Bem-vindo ao quarto projeto do Nanodegree Engenheiro de Machine Learning! Neste caderno, um exemplo de código é fornecido para te ajudar na análise do Táxi Inteligente (Smartcab) e o algoritmo de aprendizado implementado. Você não vai precisar modificar o código fornecido além do que é pedido. Você terá que responder algumas perguntas relativas ao projeto e às visualizações fornecidas no caderno. Cada seção em que você deve responder uma pergunta é precedida por um cabeçalho no formato 'Pergunta X'. Leia cada pergunta com cuidado e escreva respostas completas em cada uma das caixas de texto que são apresentadas em seguida, que se iniciam com o texto 'Resposta:'. O projeto submetido será avaliado baseado nas respostas dadas para cada uma das perguntas e na implementação que você fornecer no script agent.py.
Atenção: Células de Código Code e Markdown podem ser executadas usando o atalho de teclado Shift + Enter. Além disso, células de Markdown podem ser editadas tipicamente através de um duplo clique, que leva ao modo de edição.
Neste projeto, você vai construir um agente condutor otimizado com o algoritmo Q-Learning, que deve navegar um Smartcab através do seu ambiente em direção a um objetivo. Uma vez que se espera que um Smartcab leve passageiros de um lugar para outro, o agente condutor será avaliado a partir de duas métricas muito importantes: Segurança e Confiabilidade. Um agente condutor que leve o Smartcab para seus destino passando por sinais vermelhos ou evitando acidentes por pouco seria considerado inseguro. De forma análoga, um agente condutor frequentemente não consegue chegar ao seu destino dentro do prazo seria considerado não confiável. Maximizar a segurança e confiabilidade do agente condutor garantiria que os Smartcabs tivessem um lugar permanente na indústria dos transportes.
Segurança e Confiabilidade são associadas ao sistema de menções conforme apresentado a seguir:
| Menção | Segurança | Confiabilidade |
|---|---|---|
| A+ | O agente não comete nenhuma infração de trânsito, e sempre escolhe a ação correta. |
O agente chega ao seu destino dentro do prazo em 100% das viagens. |
| A | O agente comete poucas infrações leves de trânsito, como não se movimentar em um sinal verde. |
O agente chega ao seu destino dentro do prazo em pelo menos 90% das viagens. |
| B | O agente comete várias infrações leves de trânsito, como não se movimentar em um sinal verde. |
O agente chega ao seu destino dentro do prazo em pelo menos 80% das viagens. |
| C | O agente comete pelo menos uma infração grave de trânsito, como atravessar um sinal vermelho. |
O agente chega ao seu destino dentro do prazo em pelo menos 70% das viagens. |
| D | O agente causa pelo menos um acidente leve, como virar à esquerda no sinal verde com carros se aproximando. |
O agente chega ao seu destino dentro do prazo em pelo menos 60% das viagens. |
| F | O agente causa pelo menos um acidente grave, como atravessar um sinal vermelho em um cruzamento. |
O agente não chega ao seu destino dentro do prazo em pelo menos 60% das viagens. |
Para ajudar na avaliação destas métricas importantes, você deve carregar um código de visualização que será usado mais tarde no projeto. Execute a célula de código abaixo para importar este código será exigido para sua análise.
# Importe o código de visualização
import visuals as vs
# Visualização bonita para cadernos do Jupyter
%matplotlib inline
Antes de começar a implementar seu agente condutor, é preciso compreender o mundo (ambiente) em que o Smartcab e o agente condutor trabalham. Um dos aspectos mais importantes para a construção de um agente que aprende sozinho é entender suas características, incluindo a forma como esse agente opera. Para começar simplesmente execute o script agent.py, que contém o código inicial para o agente condutor, exatamente como está -- não é preciso fazer nenhuma alteração por enquanto. Deixe a simulação rodar por um tempo para ver os diversos componentes funcionais executando. Note que na simulação visual (se estiver ativa), o carro branco é o Smartcab.
Em algumas frases, descreva o que você observa durante a simulação do agente condutor quando executa o código agent.py inalterado. Algumas coisas que você deve considerar:
Dica: A partir da pasta superior /smartcab/ (onde este caderno se encontra), execute o comando
'python smartcab/agent.py'
!pip install pygame
Requirement already satisfied: pygame in /opt/conda/lib/python2.7/site-packages
%run ./smartcab/agent.py
Simulator.__init__(): Error initializing GUI objects; display disabled.
error: No available video device
/-------------------------
| Training trial 1
\-------------------------
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.82)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove forward instead of right. (rewarded 1.34)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.83)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent drove right instead of forward. (rewarded 0.39)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent attempted driving forward through a red light. (rewarded -10.38)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.38)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'right')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.98)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.91)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'forward')
Agent drove right instead of left. (rewarded 1.78)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove left instead of forward. (rewarded 0.18)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.35)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.49)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.81)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.16)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.26)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.41)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.12)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove right instead of forward. (rewarded 0.76)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent drove forward instead of left. (rewarded 0.20)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent drove right instead of left. (rewarded 1.06)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent drove forward instead of right. (rewarded 0.42)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent drove right instead of forward. (rewarded 1.19)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.31)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.22)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.77)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent drove right instead of forward. (rewarded 1.24)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.55)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent drove right instead of left. (rewarded -0.41)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'right')
Agent followed the waypoint forward. (rewarded 1.47)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.89)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 2
\-------------------------
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'right', None)
Agent attempted driving forward through a red light. (rewarded -10.78)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -10.27)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'right', None)
Agent attempted driving left through a red light. (rewarded -10.36)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.32)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 2.59)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'right', 'left')
Agent drove forward instead of right. (rewarded 0.32)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', 'forward')
Agent drove forward instead of right. (rewarded 1.65)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove forward instead of right. (rewarded 0.79)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent idled at a green light with no oncoming traffic. (rewarded -4.04)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.27)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.50)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.25)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 1.01)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.03)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.44)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.99)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.73)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.66)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.19)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 0.81)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 3
\-------------------------
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'left')
Agent drove right instead of left. (rewarded 0.53)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent drove right instead of forward. (rewarded 1.56)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.74)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 1.24)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.82)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.74)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'right')
Agent drove right instead of forward. (rewarded 0.92)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.35)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -11.00)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.20)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent drove right instead of forward. (rewarded 0.68)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'right')
Agent drove right instead of left. (rewarded 1.70)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent followed the waypoint left. (rewarded 1.73)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', 'left')
Agent drove right instead of left. (rewarded 1.24)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.04)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent attempted driving left through a red light. (rewarded -10.14)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'right', None)
Agent drove forward instead of right. (rewarded 0.98)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'forward')
Agent drove forward instead of right. (rewarded 1.43)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 1.56)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.86)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.34)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.54)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving left through a red light. (rewarded -9.21)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent drove right instead of forward. (rewarded 1.36)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.63)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded -0.59)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.23)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.85)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent idled at a green light with no oncoming traffic. (rewarded -5.02)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', 'left')
Agent attempted driving forward through a red light. (rewarded -10.25)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 4
\-------------------------
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'left', None)
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.72)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.19)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.11)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.13)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'right')
Agent idled at a green light with oncoming traffic. (rewarded 0.08)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.52)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -5.55)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.85)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'right')
Agent drove right instead of forward. (rewarded 1.14)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent attempted driving left through a red light. (rewarded -9.00)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 1.35)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'left')
Agent followed the waypoint right. (rewarded 2.54)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.18)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.27)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.06)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent drove right instead of forward. (rewarded 1.60)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove right instead of left. (rewarded 0.30)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.75)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.57)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 1.47)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 0.78)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.85)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded -0.22)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.93)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent followed the waypoint left. (rewarded 1.49)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.29)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent drove right instead of forward. (rewarded -0.18)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 0.52)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.67)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.87)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 5
\-------------------------
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.43)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.14)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.46)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.71)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.86)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.66)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'forward')
Agent drove right instead of forward. (rewarded 0.42)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.44)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.64)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -10.30)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove right instead of left. (rewarded 0.10)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.95)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.72)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.19)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent drove forward instead of right. (rewarded 0.23)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'left')
Agent drove forward instead of right. (rewarded 1.13)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'left')
Agent followed the waypoint right. (rewarded 1.71)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.95)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.27)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded -0.45)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.29)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving left through a red light. (rewarded -10.75)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'right', 'left')
Agent drove right instead of left. (rewarded 1.10)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 0.94)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent drove forward instead of right. (rewarded 0.53)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 6
\-------------------------
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 0.54)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'left')
Agent idled at a green light with no oncoming traffic. (rewarded -5.12)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove forward instead of right. (rewarded 0.45)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'forward')
Agent drove left instead of right. (rewarded 1.95)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.63)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 2.61)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -9.06)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'right', None)
Agent idled at a green light with oncoming traffic. (rewarded -0.01)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 1.79)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent idled at a green light with no oncoming traffic. (rewarded -5.70)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent idled at a green light with no oncoming traffic. (rewarded -4.66)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.58)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.54)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 0.05)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 0.08)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.10)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.41)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded -0.59)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent followed the waypoint left. (rewarded 0.81)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.16)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 7
\-------------------------
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', 'right')
Agent followed the waypoint left. (rewarded 1.33)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent idled at a green light with no oncoming traffic. (rewarded -4.32)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent drove left instead of forward. (rewarded 0.44)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -10.99)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 2.11)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.32)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent idled at a green light with no oncoming traffic. (rewarded -5.07)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove left instead of forward. (rewarded 0.91)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent drove forward instead of right. (rewarded 1.56)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.40)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.04)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent attempted driving forward through a red light. (rewarded -10.31)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 1.80)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.54)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -10.11)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.37)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 0.33)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.47)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.20)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 0.86)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'right')
Agent drove forward instead of left. (rewarded 0.69)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent attempted driving forward through a red light. (rewarded -10.35)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 0.62)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'right')
Agent drove right instead of left. (rewarded 0.02)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -10.85)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 8
\-------------------------
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent attempted driving left through a red light. (rewarded -10.95)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -9.27)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 1.57)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -10.05)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent drove right instead of left. (rewarded 0.48)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.77)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 1.80)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.80)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'right', None)
Agent properly idled at a red light. (rewarded 0.93)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent attempted driving left through a red light. (rewarded -9.16)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 1.52)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.51)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent idled at a green light with no oncoming traffic. (rewarded -4.06)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.24)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.10)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'right', 'left')
Agent properly idled at a red light. (rewarded 1.20)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.24)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.15)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent drove left instead of forward. (rewarded -0.26)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent drove forward instead of right. (rewarded -0.30)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent idled at a green light with oncoming traffic. (rewarded 0.30)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.60)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 0.41)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.13)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'left')
Agent drove forward instead of right. (rewarded -0.54)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 9
\-------------------------
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.51)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent drove left instead of forward. (rewarded 0.95)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent followed the waypoint right. (rewarded 2.04)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent drove right instead of forward. (rewarded 1.17)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.74)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.16)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.39)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'right', None)
Agent idled at a green light with oncoming traffic. (rewarded 1.31)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.36)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.92)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent attempted driving left through a red light. (rewarded -9.94)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.22)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent attempted driving forward through a red light. (rewarded -9.07)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent drove forward instead of left. (rewarded 1.22)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove right instead of left. (rewarded 0.04)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.45)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 0.84)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.85)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.40)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.66)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent drove left instead of forward. (rewarded 0.42)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove forward instead of right. (rewarded 0.52)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.14)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.98)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.53)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 10
\-------------------------
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 1.69)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent drove left instead of right. (rewarded 1.36)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.86)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.21)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent drove right instead of forward. (rewarded 1.22)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent drove forward instead of left. (rewarded 1.48)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.22)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent attempted driving forward through a red light. (rewarded -9.78)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.74)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent attempted driving left through a red light. (rewarded -9.55)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', 'right')
Agent attempted driving left through a red light. (rewarded -10.53)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent drove forward instead of left. (rewarded 0.48)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.31)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.80)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent drove right instead of left. (rewarded -0.10)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent drove right instead of forward. (rewarded -0.03)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving forward through a red light. (rewarded -10.83)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.11)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.10)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded -0.35)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 11
\-------------------------
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.99)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.26)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.49)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'right', None)
Agent properly idled at a red light. (rewarded 2.12)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.42)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent attempted driving left through a red light. (rewarded -10.61)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.55)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent idled at a green light with oncoming traffic. (rewarded 1.43)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.45)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded 1.02)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.32)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -10.45)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.15)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 0.92)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'right')
Agent followed the waypoint forward. (rewarded 2.36)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.63)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.95)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent attempted driving forward through a red light. (rewarded -9.09)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.17)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.20)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 12
\-------------------------
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'forward', 'forward')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.60)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.15)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.45)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent attempted driving forward through a red light. (rewarded -10.09)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.96)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 0.25)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.35)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent idled at a green light with oncoming traffic. (rewarded 1.45)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent drove forward instead of right. (rewarded 0.19)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 2.61)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 1.23)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.27)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.58)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent idled at a green light with oncoming traffic. (rewarded 0.59)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent drove right instead of forward. (rewarded -0.04)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', 'left')
Agent attempted driving forward through a red light. (rewarded -10.61)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', None)
Agent properly idled at a red light. (rewarded 1.10)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent drove forward instead of left. (rewarded 0.88)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.85)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.50)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 13
\-------------------------
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'right', 'forward')
Agent followed the waypoint right. (rewarded 2.62)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'right', 'left')
Agent drove left instead of right. (rewarded 0.64)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'forward')
Agent drove right instead of forward. (rewarded 0.46)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent followed the waypoint left. (rewarded 2.44)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.59)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.33)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.87)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'right', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.65)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.52)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent drove forward instead of left. (rewarded 1.75)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent idled at a green light with oncoming traffic. (rewarded 0.98)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent followed the waypoint left. (rewarded 1.58)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.22)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent drove forward instead of left. (rewarded -0.02)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 0.79)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 2.28)
36% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 14
\-------------------------
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.47)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.97)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.06)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.45)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.36)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.77)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 0.94)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.03)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove forward instead of left. (rewarded 1.49)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent attempted driving left through a red light. (rewarded -9.97)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent idled at a green light with no oncoming traffic. (rewarded -4.95)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent idled at a green light with no oncoming traffic. (rewarded -5.52)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 0.79)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.26)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.31)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.53)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'right', None)
Agent drove right instead of left. (rewarded 1.61)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent attempted driving forward through a red light. (rewarded -10.17)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.41)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.76)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'right')
Agent drove forward instead of left. (rewarded -0.39)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded -0.64)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.81)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.75)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -10.07)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 15
\-------------------------
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'right', None)
Agent idled at a green light with oncoming traffic. (rewarded 1.88)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.00)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.14)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.74)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove right instead of left. (rewarded 0.57)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent followed the waypoint right. (rewarded 1.77)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', 'forward')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.45)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent idled at a green light with oncoming traffic. (rewarded 0.43)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.97)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'right')
Agent properly idled at a red light. (rewarded 1.12)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.00)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 1.14)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.92)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded -0.11)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.57)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 1.73)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.50)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 0.74)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'right', None)
Agent attempted driving forward through a red light. (rewarded -9.17)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.45)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent idled at a green light with oncoming traffic. (rewarded 0.64)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.04)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.68)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove left instead of forward. (rewarded -0.13)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'left')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.11)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.02)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent drove right instead of forward. (rewarded -0.41)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.80)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 0.48)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent drove forward instead of left. (rewarded -0.14)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 16
\-------------------------
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.99)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'right')
Agent drove right instead of left. (rewarded 1.12)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.87)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.33)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.73)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'forward')
Agent followed the waypoint left. (rewarded 1.92)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent drove right instead of left. (rewarded 1.22)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent drove left instead of forward. (rewarded 0.01)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.34)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.98)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving left through a red light. (rewarded -9.67)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.53)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.67)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'right', None)
Agent drove right instead of left. (rewarded 0.51)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 2.22)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent attempted driving forward through a red light. (rewarded -9.20)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.26)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent drove left instead of forward. (rewarded 1.31)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'left')
Agent drove forward instead of right. (rewarded 1.63)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'forward')
Agent idled at a green light with oncoming traffic. (rewarded 1.29)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.42)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.98)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent attempted driving left through a red light. (rewarded -10.79)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent attempted driving forward through a red light. (rewarded -10.62)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -10.51)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -9.81)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'right')
Agent idled at a green light with oncoming traffic. (rewarded 0.38)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 0.90)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.75)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.53)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 17
\-------------------------
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.16)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -10.07)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 2.49)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -9.49)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'right')
Agent attempted driving forward through a red light. (rewarded -9.86)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent drove right instead of forward. (rewarded 1.25)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'forward')
Agent drove forward instead of left. (rewarded 1.86)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 1.34)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'right', 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.87)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'forward')
Agent drove forward instead of left. (rewarded 0.89)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent idled at a green light with oncoming traffic. (rewarded 1.51)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent drove forward instead of left. (rewarded 0.57)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.09)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 2.27)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.42)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded -0.02)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.10)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'forward')
Agent drove forward instead of right. (rewarded 0.93)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'forward')
Agent drove right instead of forward. (rewarded -0.27)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.24)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent drove right instead of left. (rewarded 0.06)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 0.58)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent drove right instead of forward. (rewarded -0.54)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent idled at a green light with no oncoming traffic. (rewarded -5.75)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent drove right instead of left. (rewarded 0.01)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 18
\-------------------------
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.59)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent drove right instead of forward. (rewarded 1.89)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent drove right instead of left. (rewarded 1.29)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent drove left instead of forward. (rewarded 1.24)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent drove forward instead of right. (rewarded 0.08)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.14)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'left')
Agent drove forward instead of right. (rewarded 0.69)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'left')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.54)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.90)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 1.28)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 0.93)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent attempted driving left through a red light. (rewarded -9.14)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.18)
63% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove left instead of forward. (rewarded 1.55)
60% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.37)
57% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.65)
54% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 1.71)
51% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.37)
49% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.78)
46% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 1.19)
43% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.61)
40% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.56)
37% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.04)
34% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.06)
31% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'forward')
Agent drove forward instead of left. (rewarded 1.14)
29% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.78)
26% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.62)
23% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', None)
Agent attempted driving forward through a red light. (rewarded -9.79)
20% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', 'forward')
Agent drove right instead of left. (rewarded -0.39)
17% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'forward', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.18)
14% of time remaining to reach destination.
/-------------------
| Step 30 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.84)
11% of time remaining to reach destination.
/-------------------
| Step 31 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.35)
9% of time remaining to reach destination.
/-------------------
| Step 32 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.48)
6% of time remaining to reach destination.
/-------------------
| Step 33 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.84)
3% of time remaining to reach destination.
/-------------------
| Step 34 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.44)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 19
\-------------------------
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.41)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -9.26)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 1.63)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove left instead of right. (rewarded 0.24)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.31)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving left through a red light. (rewarded -9.08)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'right', None)
Agent properly idled at a red light. (rewarded 2.17)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.83)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.90)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 0.93)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.42)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.84)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 2.20)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -9.38)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -9.45)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'right')
Agent attempted driving left through a red light. (rewarded -10.38)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove left instead of forward. (rewarded 0.11)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -9.71)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 1.91)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.07)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving forward through a red light. (rewarded -10.62)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.65)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.33)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent drove left instead of forward. (rewarded -0.21)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'right', 'forward')
Agent idled at a green light with oncoming traffic. (rewarded 0.30)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 20
\-------------------------
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove forward instead of right. (rewarded 0.53)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent drove left instead of right. (rewarded 0.78)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving forward through a red light. (rewarded -9.10)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 2.93)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.50)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 0.03)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent drove right instead of left. (rewarded 0.40)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.84)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.73)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 0.47)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.27)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.32)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.30)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent idled at a green light with oncoming traffic. (rewarded 0.32)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'right', None)
Agent drove right instead of left. (rewarded 0.97)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.82)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 1.28)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 0.59)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.70)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove left instead of forward. (rewarded 0.64)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'left')
Agent followed the waypoint right. (rewarded 0.92)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.03)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.26)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 0.26)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.64)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent drove left instead of forward. (rewarded 0.88)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.90)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'forward', 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.86)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.84)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove forward instead of right. (rewarded 0.05)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Testing trial 1
\-------------------------
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.28)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent drove left instead of right. (rewarded 1.90)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove right instead of left. (rewarded 1.42)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -9.41)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.26)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove forward instead of right. (rewarded 0.36)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'left', 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.13)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'forward')
Agent followed the waypoint right. (rewarded 1.68)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent attempted driving forward through a red light. (rewarded -10.79)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -10.84)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove left instead of forward. (rewarded 0.37)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent drove forward instead of right. (rewarded 1.64)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'right', None)
Agent followed the waypoint right. (rewarded 2.55)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 1.35)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.71)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent drove right instead of left. (rewarded -0.16)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove forward instead of right. (rewarded 0.53)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'forward')
Agent drove forward instead of right. (rewarded 0.98)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.76)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded -0.88)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Testing trial 2
\-------------------------
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 1.73)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.24)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 1.30)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.70)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent drove right instead of forward. (rewarded 0.34)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.36)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent drove right instead of left. (rewarded 1.54)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 1.13)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.60)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.84)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', 'left')
Agent drove forward instead of right. (rewarded 0.85)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.23)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.57)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent drove forward instead of right. (rewarded 1.01)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -10.20)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -9.98)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent idled at a green light with oncoming traffic. (rewarded 1.05)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'left')
Agent drove forward instead of right. (rewarded 1.39)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.62)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.57)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'right', 'right')
Agent properly idled at a red light. (rewarded 0.42)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.51)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'left')
Agent drove forward instead of right. (rewarded 0.80)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent followed the waypoint right. (rewarded 1.37)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -9.28)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Testing trial 3
\-------------------------
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 0.96)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'left')
Agent idled at a green light with no oncoming traffic. (rewarded -4.47)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove left instead of right. (rewarded 0.35)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent drove right instead of forward. (rewarded 1.13)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent drove right instead of left. (rewarded 1.59)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'left')
Agent drove right instead of forward. (rewarded 1.22)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving forward through a red light. (rewarded -10.22)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent drove right instead of left. (rewarded 0.07)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent drove right instead of forward. (rewarded 0.90)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'right', 'left')
Agent followed the waypoint left. (rewarded 1.33)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent drove right instead of forward. (rewarded 0.16)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 1.28)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.67)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent attempted driving left through a red light. (rewarded -9.05)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'right', 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.69)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -10.34)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.04)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', 'forward')
Agent followed the waypoint forward. (rewarded 0.62)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.18)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.31)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.40)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent idled at a green light with oncoming traffic. (rewarded 1.24)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent idled at a green light with oncoming traffic. (rewarded -0.32)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', 'forward')
Agent idled at a green light with oncoming traffic. (rewarded 0.89)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', 'forward')
Agent drove left instead of right. (rewarded -0.62)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Testing trial 4
\-------------------------
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.10)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent attempted driving forward through a red light. (rewarded -10.14)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent drove right instead of left. (rewarded 1.19)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.97)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.36)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.64)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.66)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent drove right instead of forward. (rewarded 1.55)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent drove right instead of left. (rewarded 1.13)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent followed the waypoint left. (rewarded 1.40)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.37)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent drove forward instead of left. (rewarded 1.08)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent drove right instead of left. (rewarded 0.76)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.94)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove right instead of forward. (rewarded 0.58)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.83)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent drove right instead of left. (rewarded 1.13)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.06)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent drove left instead of forward. (rewarded 0.18)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent followed the waypoint right. (rewarded 2.57)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.79)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.52)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.97)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove forward instead of right. (rewarded 1.31)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent drove forward instead of right. (rewarded 0.76)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -10.11)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'left')
Agent drove left instead of right. (rewarded -0.51)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.25)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent drove right instead of left. (rewarded 0.21)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'right', None)
Agent properly idled at a red light. (rewarded 0.26)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Testing trial 5
\-------------------------
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'right', 'left')
Agent followed the waypoint right. (rewarded 2.22)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent drove right instead of forward. (rewarded 1.84)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.76)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.13)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.71)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.04)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 0.60)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'right')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.49)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.91)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent idled at a green light with oncoming traffic. (rewarded 0.26)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent drove forward instead of left. (rewarded 0.71)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', 'left')
Agent properly idled at a red light. (rewarded 2.15)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 1.43)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent drove right instead of left. (rewarded 0.89)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', 'left')
Agent drove forward instead of right. (rewarded 1.51)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.03)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent drove forward instead of right. (rewarded 0.20)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.77)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.63)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 0.26)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Testing trial 6
\-------------------------
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', 'right')
Agent properly idled at a red light. (rewarded 2.28)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.91)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.92)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.78)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.89)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove right instead of left. (rewarded 0.78)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.28)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'right', 'left')
Agent attempted driving forward through a red light. (rewarded -10.19)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.22)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.23)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent drove right instead of left. (rewarded 1.17)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent drove forward instead of left. (rewarded 0.23)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 1.26)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove left instead of right. (rewarded 0.60)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.49)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent idled at a green light with oncoming traffic. (rewarded 0.64)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 0.96)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded -0.37)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.41)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.75)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Testing trial 7
\-------------------------
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.47)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 2.04)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent attempted driving forward through a red light. (rewarded -9.93)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.80)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.74)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.79)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.45)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'forward')
Agent drove right instead of forward. (rewarded 1.79)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.50)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 2.44)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.40)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 0.74)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.60)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'left', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.17)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'left', 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.34)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.75)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.17)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'right')
Agent properly idled at a red light. (rewarded 1.08)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', None)
Agent drove right instead of forward. (rewarded 1.11)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 1.06)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Testing trial 8
\-------------------------
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'right', 'forward')
Agent drove right instead of forward. (rewarded 0.84)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.59)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.05)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.89)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.12)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -9.97)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent drove forward instead of left. (rewarded 0.85)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.67)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.90)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.44)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.04)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.76)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'right', None)
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.93)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'right', None)
Agent drove right instead of left. (rewarded 1.73)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.05)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent drove right instead of forward. (rewarded 0.68)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 0.68)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.79)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.86)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent drove right instead of forward. (rewarded 0.73)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent idled at a green light with no oncoming traffic. (rewarded -5.64)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.79)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 1.07)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.02)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent idled at a green light with no oncoming traffic. (rewarded -5.03)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Testing trial 9
\-------------------------
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'forward', 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.13)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent drove right instead of left. (rewarded 1.64)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent idled at a green light with no oncoming traffic. (rewarded -5.09)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent idled at a green light with no oncoming traffic. (rewarded -5.56)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.00)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.67)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 1.63)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 2.23)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -10.57)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'right')
Agent properly idled at a red light. (rewarded 1.55)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent drove right instead of left. (rewarded 0.08)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 0.44)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.97)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.04)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.14)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent attempted driving forward through a red light. (rewarded -9.98)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.26)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded -0.05)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove forward instead of right. (rewarded 0.15)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.13)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', 'forward')
Agent idled at a green light with oncoming traffic. (rewarded 0.08)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.32)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', 'right')
Agent attempted driving forward through a red light. (rewarded -10.65)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.08)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 1.78)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded -0.24)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent drove forward instead of right. (rewarded 0.44)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.33)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'left')
Agent drove left instead of right. (rewarded 0.17)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 0.45)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Testing trial 10
\-------------------------
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
Simulating trial. . .
Agent not set to learn.
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove right instead of forward. (rewarded 0.82)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.53)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent followed the waypoint right. (rewarded 2.86)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.49)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.89)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.55)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent drove left instead of right. (rewarded 0.22)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.46)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent drove right instead of forward. (rewarded 0.14)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.05)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.57)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent drove right instead of left. (rewarded 0.69)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent drove left instead of forward. (rewarded 1.19)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.42)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent attempted driving forward through a red light. (rewarded -9.17)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'right', 'left')
Agent properly idled at a red light. (rewarded 2.19)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'left')
Agent followed the waypoint right. (rewarded 1.33)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.13)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded -0.53)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent followed the waypoint left. (rewarded 1.29)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
Simulation ended. . .
<matplotlib.figure.Figure at 0x7f5128c27ed0>
Resposta:
O smartcab fica parado e seu status não é atualizado na simulação.
O agente de direção recebe recompensas positivas quando o smartcab se encontra parado em uma luz vermelha. Quando a luz é alterada para verde e não há trafego,o agente permanece parado e a recompensa se torna negativa. Novamente quando a luz torna à cor vermelha, as recompensas se tornam positivas.
Este é o comportamento é detectado ao longo da simulação.
Além de entender o mundo, também é preciso entender o código em si que governa como o mundo, simulação e demais aspectos funcionam. Tentar criar um agente condutor seria difícil sem ter ao menos explorado os componentes "ocultos" que fazem tudo funcionar. Na pasta superior /smartcab/, existem duas outras pastas: /logs/ (que será usada mais tarde) e /smartcab/. Abra a pasta inferior /smartcab/, explore cada arquivo Python incluído e responda a pergunta a seguir.
agent.py, escolha três variáveis (flags) que podem ser ativadas e explique como elas afetam a simulação.environment.py, que função da classe Environment é chamada quando um agente executa uma ação?simulator.py, qual é a diferença entre a função 'render_text()' e a função 'render()'?planner.py, a função 'next_waypoint() considera primeiro a direção Norte-Sul ou Leste-Oeste?Resposta:
agent.py:
indicates whether the cab is in the learning mode or not. If it's in the learning mode, its actions will influence its decision making proccess in the next rounds.
epsilon: Este é o fator pelo qual o táxi tomará uma decisão aleatória (exploração).
alpha: Este é o fator pelo qual o táxi aprenderá com suas ações.
In environment.py, act() é o nome da função que é chamada quando um agente executa uma ação. Esta função permite ao agente atuar no meio ambiente e receber a recompensa de sua ação.
In simulator.py, render_text() dará a saída do usuário na linha de comando, enquanto a render() dará a saída do usuário em uma janela GUI.
In planner.py, A orientação Oriente-Oeste é considerada antes do Norte-Sul
O primeiro passo para criar um agente condutor otimizado com o algoritmo Q-Learning é fazer com que o agente de fato execute ações válidas. Neste caso, uma ação válida é uma entre None, (não faça nada) 'Left' (vire à esquerda), 'Right' (vire à direita) ou 'Forward' (siga em frente). Para sua primeira implementação, navegue até a função 'choose_action()' do agente e faça com que o agente condutor escolha uma destas ações aleatoriamente. Note que você terá acesso a diversas variáveis de classe que vão te ajudar a escrever essa funcionalidade, tais como 'self.learning' e 'self.valid_actions'. Uma vez implementada, execute o script do agente e a simulação rapidamente para confirmar que seu agente está executando uma ação aleatória a cada passo de tempo.
Para obter resultados da simulação inicial, você deve ajustar as seguintes variáveis (flags):
'enforce_deadline' - Atribua o valor True a esta variável para forçar o agente condutor a registrar se ele chegou no destino dentro do prazo.'update_delay' - Atribua um valor pequeno a esta variável (como 0.01) para reduzir o tempo entre passos em cada iteração.'log_metrics' - Atribua o valor True a esta variável para gravar os resultados da simulação como um arquivo .csv na pasta /logs/.'n_test' - Atribua o valor '10' a esta variável para executar 10 iterações de teste.Opcionalmente, você pode desativar a simulação visual (o que faz com que as tentativas executem mais rapidamente) atribuindo o valor False para a variável 'display'. Flags que foram alteradas aqui devem voltar para o valor original caso você precise depurar o código. É importante que você entenda o que cada flag faz e como afeta a simulação!
Assim que você tiver completado a simulação inicial com sucesso (devem ter sido executadas 20 iterações de treinamento e 10 iterações de teste), execute a célula de código abaixo para visualizar os resultados. Note que arquivos de log são sobrescritos quando simulações idênticas são executadas, portanto tenha cuidado com qual arquivo de log está sendo carregado!
# Carregue o arquivo de log 'sim_no-learning' dos resultados da simulação inicial
vs.plot_trials('sim_no-learning.csv')
Usando a visualização acima que foi produzida a partir da sua simulação inicial, forneça uma análise e faça diversas observações sobre o agente condutor. Verifique que você está fazendo ao menos uma observação sobre cada painel presente na visualização. Algumas coisas que você deve levar em consideração:
Resposta:
1 - O agente condutor estava fazendo más decisões com a frequencia de quase 42%, sendo que de 8-9% resultou em grandes acidentes. Entretanto a taxa de más decisões decaiu ao longo do tempo parando por volta de 36% diminuindo para menos de 5% a taxa de grandes acidentes.
2 - Dado que o agente está dirigindo aleatoriamente e não está aprendendo, esse resultado faz sentido.
3 - As recompensas que o agente está recebendo aumentam gradualmente ao longo do tempo de aproximadamente -6 no início para cerca de -4 no final. A melhoria das recompensas estão relacionadas com a diminuição dos acidentes graves, por isso parece estar em linha com uma recompensa adequada para o agente por seu desempenho; e, portanto, não muito penalizado por suas ações, o que parece consistente em ter um sistema de recompensas aleatórias.
4 - Ao longo do número de iterações, a taxa de confiabilidade permanece inalterada até a iteração de número 13 estando em 0%, a partir da iteração de número 14, a taxa de confiabilidade se torna constante em torno de 10%.
5 - Este Smartcab não seria considerado seguro por seus passageiros devido a suas açoes aleatórias resultando em uma alta taxa de acidentes.
O segundo passo para criar um agente condutor otimizado com o algoritmo Q-Learning é definir um conjunto de estados que um agente pode assumir em um ambiente. Dependendo da entrada, dados de sensores, e variáveis adicionais disponíveis para o agente condutor, um conjunto de estados pode ser definido para o agente de modo que ele possa eventualmente aprender que ação ele deve executar quando assume cada estado. A condição 'se estado então ação' para cada estado é chamada de política e é, em última instância, o que espera-se que o agente condutor aprenda. Sem a definição de estados, o agente condutor nunca entenderia que ação é a melhor possível (ou ótima) -- ou mesmo que variáveis ambientais e condições importam!
A observação da função 'build_state()' mostra que o agente condutor obtém os seguintes dados do ambiente:
'waypoint', que é a direção que o Smartcab deve seguir para chegar ao destino, em relação à sua direção atual.'inputs', que contém os dados de sensores do Smartcab, incluindo:'light', a cor do sinal luminoso ativo do semáforo.'left', a direção pretendida de navegação para um veículo à esquerda do Smartcab. Seu valor é None se não houver veículos à esquerda.'right', a direção pretendida de navegação para um veículo à direita do Smartcab. Seu valor é None se não houver veículos à direita.'oncoming', a direção pretendida de navegação para um veículo no sentido oposto de um cruzamento. Seu valor é None se não houver veículos no sentido oposto de um cruzamento.'deadline', que é o número de ações remanescentes para o Smartcab chegar ao seu destino antes do seu prazo se encerrar.Que características disponíveis para o agente são mais relevantes para o aprendizado tanto seguro quanto eficiente? Por que é que estas características são apropriadas para modelar o Smartcab no ambiente? Se você não escolheu algumas destas características, por que é que estas não são apropriadas?
Resposta:
Pode-se considerar as seguintes características como mais relevantes para o aprendizado seguro e eficiente do agente condutor:
Seguro:
inputs [ light ] : O agente condutor deve conhecer a cor das luzes para aprender que será penalizado por estar no estado ocioso quando a luz é verde ou atravessando as interseções com semáforos vermelhos, o que irá causar infrações de trânsito e acidentes.
inputs : [ left | right |oncoming ] : É vital para o smartcab determinar se é seguro prosseguir,tomar à direita da via ou esquerda da via, caso contrário poderá smartcab pode causar uma colisão.
Eficiente:
waypoint: A direção que o Smartcab deve seguir deve ser conhecida paa que ele possa entrar na direção correta para alcançar do destino desejado.
Embora o deadline seja importante para determinar o quão confiável é o smartcab se tratando em estar dentro do tempo estimado do trajeto, não há parâmetros complementares que possam ser utilizados para ajudar a cumprir o prazo estimado. Entretanto juntamente com o waypoint, esta característica poderia ser utilizada para otimização do caminho a ser percorrido.
Ao definir um conjunto de estados que o agente pode assumir, é necessário considerar o tamanho do espaço de estados. Quer dizer, se você espera que o agente condutor aprenda uma política para cada estado, você precisaria determinar uma ação ótima para cada estado que o agente possa ocupar. Se o número de estados possíveis for muito grande, pode ser que o agente condutor nunca aprenda o que fazer em alguns estados, o que pode levar a decisões desinformadas. Por exemplo, considere o caso em que as seguintes características são usadas para definir o estado do Smartcab:
('is_raining', 'is_foggy', 'is_red_light', 'turn_left', 'no_traffic', 'previous_turn_left', 'time_of_day').
Traduzindo: "está chovendo", "está nublado", "sinal vermelho", "curva à esquerda", "sem trânsito", "última curva à esquerda", "horário".
Com que frequência você imagina que o agente assumiria um estado como (False, True, True, True, False, False, '3AM')? Sem que se disponibilize uma quantidade quase infinita de tempo para treinamento, é improvável que o agente consiga aprender a ação adequada!
Se um estado é definido pelas características que você escolheu na Pergunta 4, qual seria o tamanho do espaço de estados? Dado o que você sabe sobre o ambiente e como ele é simulado, você acredita que o agente condutor possa aprender uma política para cada estado possível em um número razoável de iterações de treinamento?
Dica: Considere as combinações de características para calcular o número total de estados!
Resposta:
Sendo as seguintes características:
| Característica | Número de estados | Estados |
|---|---|---|
| Waypoint | 3 | forward, left, right |
| inputs: light | 2 | red, green |
| inputs: left | 4 | None, forward, left, right |
| inputs: right | 4 | None, forward, left, right |
| inputs: oncoming | 4 | None, forward, left, right |
| __ | ||
| TOTAL | 384 |
O número total de combinações possíveis são 3x2x4x4x4 = 384.
Por não ser um número de combinações muito grande, acredito ser possível que o agente condutor possa aprender uma política para cada estado em um número razoável de iterações de treinamento.
Para sua segunda implementação, navegue para a função 'build_state()' do agente condutor. Com a justificativa que você forneceu na Pergunta 4, você agora vai modificar a variável 'state' para uma tupla com todas características necessárias para o algoritmo Q-Learning. Confirme que seu agente condutor está atualizando seu estado rodando a simulação e verificando se o estado está sendo exibido. Se a visualização da simulação estiver ativada, confirme que o estado atualizado corresponde com o que está sendo exibido na simulação.
Observação: Lembre-se de resetar as flags de simulação para seus valores padrão quando fizer esta observação!
O terceiro passo para criar um agente condutor otimizado com o algoritmo Q-Learning é começar a implementar o algoritmo de aprendizado em si. O conceito por trás do Q-Learning é relativamente simples: para cada estado que o agente visitar, crie uma entrada na tabela-Q para todos pares estado-ação disponíveis. Então, quando o agente encontrar um estado e realizar uma ação, atualize o valor-Q associado àquele par estado-ação baseado na recompensa recebida e a regra de atualização interativa implementada. É claro que benefícios adicionais vêm do uso do algoritmo Q-Learning, tais que podemos fazer com que o agente escolha a melhor ação para cada estado baseado nos valores-Q de cada par estado-ação possível. Para este projeto, você vai implementar um algoritmo Q-learning $\epsilon$-guloso com decaimento, sem fator de desconto. Siga as instruções de implementação marcadas com TODO (i.e. "a fazer") nas funções do agente.
Note que atributo self.Q do agente é um dicionário Python: é assim que a Q-table será formada. Cada estado será a chave do dicionário self.Q, e o valor associado à chave será então outro dicionário que armazena a ação como chave e o valor-Q como valor. O exemplo abaixo ilustra esta organização:
{ 'state-1': {
'action-1' : Qvalue-1,
'action-2' : Qvalue-2,
...
},
'state-2': {
'action-1' : Qvalue-1,
...
},
...
}
Além disso, note que espera-se que você use um fator $\epsilon$ (de exploração) com decaimento. Assim, conforme o número de iterações aumenta, o valor de $\epsilon$ deve decair em direção ao 0. Isto é feito para que o agente aprenda com o seu comportamento e comece a agir de acordo com seu aprendizado (ao invés de priorizar a exploração de novas possibilidades). Além disso, o agente será testado usando o que ele aprendeu depois do valor de $\epsilon$ se reduzir abaixo de um determinado limiar (o valor padrão deste limiar é 0.01). Para a implementação inicial do algoritmo Q-Learning, você deve implementar uma função de decaimento linear para $\epsilon$.
Para obter resultados da implementação inicial do algoritmo Q-Learning, você vai precisar ajustar as seguintes flags:
'enforce_deadline' - Atribua o valor True a esta variável para forçar o agente condutor a registrar se ele chegou ao seu destino dentro do prazo.'update_delay' - Atribua um valor pequeno (como 0.01) a esta variável para reduzir o tempo entre passos em cada iteração.'log_metrics' - Atribua o valor True a esta variável para gravar os resultados da cimulação como um arquivo .csv e a tabela-Q como um arquivo .txt na pasta /logs/.'n_test' - Atribua o valor '10' a esta variável para executar 10 iterações de teste.'learning' - Atribua o valor 'True' a esta variável para sinalizar para o agente condutor que ele deve usar sua implementação do algoritmo Q-Learning.Além disso, use a função de decaimento a seguir para $\epsilon$:
$$ \epsilon_{t+1} = \epsilon_{t} - 0.05, \hspace{10px}\textrm{para a iteração número } t$$
Se você tiver dificuldade em fazer sua implementação funcionar, tente atribuir o valor True para a flag 'verbose' para conseguir mais informações para depuração do código. Flags que foram alterada devem ser retornadas para seu valor inicial quando você estiver depurando. É importante que você entenda o que cada flag faz e como ela afeta a simulação!
Assim que você tiver completado com sucesso a simulação inicial com o algoritmo Q-Learning, execute a célula de código abaixo para visualizar os resultados. Note que arquivos de log são sobrescritos quando simulações idênticas são executadas, portanto tenha cuidado com qual arquivo de log está sendo carregado!
#%run ./smartcab/agent.py
Simulator.__init__(): Error initializing GUI objects; display disabled.
error: No available video device
/-------------------------
| Training trial 1
\-------------------------
Simulating trial. . .
espilon = 0.9500; alpha = 0.5000
Simulating trial. . .
espilon = 0.9500; alpha = 0.5000
Simulating trial. . .
espilon = 0.9500; alpha = 0.5000
Simulating trial. . .
espilon = 0.9500; alpha = 0.5000
Simulating trial. . .
espilon = 0.9500; alpha = 0.5000
Simulating trial. . .
espilon = 0.9500; alpha = 0.5000
Simulating trial. . .
espilon = 0.9500; alpha = 0.5000
Simulating trial. . .
espilon = 0.9500; alpha = 0.5000
Simulating trial. . .
espilon = 0.9500; alpha = 0.5000
Simulating trial. . .
espilon = 0.9500; alpha = 0.5000
Simulating trial. . .
espilon = 0.9500; alpha = 0.5000
Simulating trial. . .
espilon = 0.9500; alpha = 0.5000
Simulating trial. . .
espilon = 0.9500; alpha = 0.5000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.15)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.12)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent drove right instead of left. (rewarded 1.77)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.47)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.70)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.84)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.25)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'forward')
Agent followed the waypoint right. (rewarded 1.41)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent drove left instead of right. (rewarded 0.00)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.92)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.81)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 1.11)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 1.10)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.78)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent drove right instead of forward. (rewarded 0.15)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.70)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.58)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.38)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent drove right instead of left. (rewarded 0.73)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.25)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.88)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent drove left instead of forward. (rewarded 0.05)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove left instead of right. (rewarded 1.38)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'right')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.58)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.12)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.55)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.29)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.79)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.32)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.15)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 2
\-------------------------
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
Simulating trial. . .
espilon = 0.9000; alpha = 0.5000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', 'forward')
Agent drove right instead of forward. (rewarded 0.92)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -4.78)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.59)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove right instead of left. (rewarded 1.82)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'right')
Agent followed the waypoint right. (rewarded 2.20)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent followed the waypoint right. (rewarded 2.29)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.07)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.08)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent drove right instead of left. (rewarded 1.44)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.63)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.52)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.60)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.44)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', 'right')
Agent properly idled at a red light. (rewarded 1.39)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'right')
Agent attempted driving forward through a red light. (rewarded -10.97)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 1.57)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.06)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.68)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.66)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove left instead of right. (rewarded -0.43)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 3
\-------------------------
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
Simulating trial. . .
espilon = 0.8500; alpha = 0.5000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent drove right instead of left. (rewarded 0.76)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.45)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.13)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.58)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.20)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 0.41)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent attempted driving forward through a red light. (rewarded -9.42)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded -0.10)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 0.12)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 0.06)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove right instead of left. (rewarded 0.03)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 0.98)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.07)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent drove right instead of forward. (rewarded 1.55)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.15)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.37)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.98)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded -0.31)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'forward')
Agent drove forward instead of right. (rewarded 0.05)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 0.40)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 4
\-------------------------
Simulating trial. . .
espilon = 0.8000; alpha = 0.5000
Simulating trial. . .
espilon = 0.8000; alpha = 0.5000
Simulating trial. . .
espilon = 0.8000; alpha = 0.5000
Simulating trial. . .
espilon = 0.8000; alpha = 0.5000
Simulating trial. . .
espilon = 0.8000; alpha = 0.5000
Simulating trial. . .
espilon = 0.8000; alpha = 0.5000
Simulating trial. . .
espilon = 0.8000; alpha = 0.5000
Simulating trial. . .
espilon = 0.8000; alpha = 0.5000
Simulating trial. . .
espilon = 0.8000; alpha = 0.5000
Simulating trial. . .
espilon = 0.8000; alpha = 0.5000
Simulating trial. . .
espilon = 0.8000; alpha = 0.5000
Simulating trial. . .
espilon = 0.8000; alpha = 0.5000
Simulating trial. . .
espilon = 0.8000; alpha = 0.5000
Simulating trial. . .
espilon = 0.8000; alpha = 0.5000
Simulating trial. . .
espilon = 0.8000; alpha = 0.5000
Simulating trial. . .
espilon = 0.8000; alpha = 0.5000
Simulating trial. . .
espilon = 0.8000; alpha = 0.5000
Simulating trial. . .
espilon = 0.8000; alpha = 0.5000
Simulating trial. . .
espilon = 0.8000; alpha = 0.5000
Simulating trial. . .
espilon = 0.8000; alpha = 0.5000
Simulating trial. . .
espilon = 0.8000; alpha = 0.5000
Simulating trial. . .
espilon = 0.8000; alpha = 0.5000
Simulating trial. . .
espilon = 0.8000; alpha = 0.5000
Simulating trial. . .
espilon = 0.8000; alpha = 0.5000
Simulating trial. . .
espilon = 0.8000; alpha = 0.5000
Simulating trial. . .
espilon = 0.8000; alpha = 0.5000
Simulating trial. . .
espilon = 0.8000; alpha = 0.5000
Simulating trial. . .
espilon = 0.8000; alpha = 0.5000
Simulating trial. . .
espilon = 0.8000; alpha = 0.5000
Simulating trial. . .
espilon = 0.8000; alpha = 0.5000
Simulating trial. . .
espilon = 0.8000; alpha = 0.5000
Simulating trial. . .
espilon = 0.8000; alpha = 0.5000
Simulating trial. . .
espilon = 0.8000; alpha = 0.5000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.58)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove left instead of right. (rewarded 0.28)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.02)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.23)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 1.73)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -9.23)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.20)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.92)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 1.59)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.97)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent drove right instead of forward. (rewarded 0.07)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.45)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 1.78)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.90)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'left', None)
Agent drove right instead of forward. (rewarded 1.14)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove right instead of left. (rewarded -0.14)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.68)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'right', None)
Agent followed the waypoint right. (rewarded 0.86)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove left instead of right. (rewarded 0.38)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.84)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 5
\-------------------------
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
Simulating trial. . .
espilon = 0.7500; alpha = 0.5000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.42)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.03)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 1.15)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.92)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.20)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 1.90)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 0.47)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'left')
Agent followed the waypoint right. (rewarded 1.69)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', 'forward')
Agent followed the waypoint forward. (rewarded 2.10)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 1.74)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.15)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.63)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -10.16)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.68)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.18)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'left')
Agent attempted driving forward through a red light. (rewarded -9.92)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 0.73)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.45)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', 'left')
Agent followed the waypoint right. (rewarded 2.40)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent attempted driving left through a red light. (rewarded -9.03)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.27)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove left instead of forward. (rewarded -0.36)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove left instead of right. (rewarded 1.04)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.16)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'forward')
Agent drove right instead of left. (rewarded 0.20)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 6
\-------------------------
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
Simulating trial. . .
espilon = 0.7000; alpha = 0.5000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.21)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.06)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.47)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded 1.74)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.72)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent drove left instead of forward. (rewarded 1.70)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'left')
Agent followed the waypoint right. (rewarded 2.84)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.63)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.20)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -10.53)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.45)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.83)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'left')
Agent drove forward instead of right. (rewarded 1.43)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', 'left')
Agent drove left instead of right. (rewarded 1.10)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent drove right instead of left. (rewarded 0.61)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'right')
Agent drove forward instead of right. (rewarded 0.60)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.53)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'left')
Agent drove left instead of right. (rewarded 0.58)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.39)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent followed the waypoint left. (rewarded 1.79)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.79)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent drove right instead of forward. (rewarded 1.27)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'forward')
Agent drove forward instead of left. (rewarded -0.16)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'right', 'left')
Agent drove forward instead of left. (rewarded -0.69)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 0.97)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 7
\-------------------------
Simulating trial. . .
espilon = 0.6500; alpha = 0.5000
Simulating trial. . .
espilon = 0.6500; alpha = 0.5000
Simulating trial. . .
espilon = 0.6500; alpha = 0.5000
Simulating trial. . .
espilon = 0.6500; alpha = 0.5000
Simulating trial. . .
espilon = 0.6500; alpha = 0.5000
Simulating trial. . .
espilon = 0.6500; alpha = 0.5000
Simulating trial. . .
espilon = 0.6500; alpha = 0.5000
Simulating trial. . .
espilon = 0.6500; alpha = 0.5000
Simulating trial. . .
espilon = 0.6500; alpha = 0.5000
Simulating trial. . .
espilon = 0.6500; alpha = 0.5000
Simulating trial. . .
espilon = 0.6500; alpha = 0.5000
Simulating trial. . .
espilon = 0.6500; alpha = 0.5000
Simulating trial. . .
espilon = 0.6500; alpha = 0.5000
Simulating trial. . .
espilon = 0.6500; alpha = 0.5000
Simulating trial. . .
espilon = 0.6500; alpha = 0.5000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 0.99)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.30)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.02)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.51)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', None)
Agent attempted driving left through a red light. (rewarded -9.09)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent drove forward instead of right. (rewarded 1.12)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.56)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.19)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.11)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.17)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.51)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'right')
Agent followed the waypoint forward. (rewarded 1.98)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 8
\-------------------------
Simulating trial. . .
espilon = 0.6000; alpha = 0.5000
Simulating trial. . .
espilon = 0.6000; alpha = 0.5000
Simulating trial. . .
espilon = 0.6000; alpha = 0.5000
Simulating trial. . .
espilon = 0.6000; alpha = 0.5000
Simulating trial. . .
espilon = 0.6000; alpha = 0.5000
Simulating trial. . .
espilon = 0.6000; alpha = 0.5000
Simulating trial. . .
espilon = 0.6000; alpha = 0.5000
Simulating trial. . .
espilon = 0.6000; alpha = 0.5000
Simulating trial. . .
espilon = 0.6000; alpha = 0.5000
Simulating trial. . .
espilon = 0.6000; alpha = 0.5000
Simulating trial. . .
espilon = 0.6000; alpha = 0.5000
Simulating trial. . .
espilon = 0.6000; alpha = 0.5000
Simulating trial. . .
espilon = 0.6000; alpha = 0.5000
Simulating trial. . .
espilon = 0.6000; alpha = 0.5000
Simulating trial. . .
espilon = 0.6000; alpha = 0.5000
Simulating trial. . .
espilon = 0.6000; alpha = 0.5000
Simulating trial. . .
espilon = 0.6000; alpha = 0.5000
Simulating trial. . .
espilon = 0.6000; alpha = 0.5000
Simulating trial. . .
espilon = 0.6000; alpha = 0.5000
Simulating trial. . .
espilon = 0.6000; alpha = 0.5000
Simulating trial. . .
espilon = 0.6000; alpha = 0.5000
Simulating trial. . .
espilon = 0.6000; alpha = 0.5000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent drove left instead of forward. (rewarded 1.09)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -21.00)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'right')
Agent followed the waypoint right. (rewarded 2.40)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'right', 'left')
Agent drove right instead of forward. (rewarded 0.19)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.54)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'forward')
Agent drove forward instead of left. (rewarded 0.35)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.43)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 0.71)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.94)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.62)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent drove left instead of right. (rewarded 1.44)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove forward instead of left. (rewarded 1.40)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.96)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.19)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.13)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 0.91)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.21)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.66)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', 'left')
Agent properly idled at a red light. (rewarded 0.84)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.25)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'left')
Agent followed the waypoint right. (rewarded 0.72)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove right instead of forward. (rewarded 0.11)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.35)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent drove forward instead of left. (rewarded 0.99)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.89)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent attempted driving forward through a red light. (rewarded -9.85)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.15)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.91)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.05)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.27)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 9
\-------------------------
Simulating trial. . .
espilon = 0.5500; alpha = 0.5000
Simulating trial. . .
espilon = 0.5500; alpha = 0.5000
Simulating trial. . .
espilon = 0.5500; alpha = 0.5000
Simulating trial. . .
espilon = 0.5500; alpha = 0.5000
Simulating trial. . .
espilon = 0.5500; alpha = 0.5000
Simulating trial. . .
espilon = 0.5500; alpha = 0.5000
Simulating trial. . .
espilon = 0.5500; alpha = 0.5000
Simulating trial. . .
espilon = 0.5500; alpha = 0.5000
Simulating trial. . .
espilon = 0.5500; alpha = 0.5000
Simulating trial. . .
espilon = 0.5500; alpha = 0.5000
Simulating trial. . .
espilon = 0.5500; alpha = 0.5000
Simulating trial. . .
espilon = 0.5500; alpha = 0.5000
Simulating trial. . .
espilon = 0.5500; alpha = 0.5000
Simulating trial. . .
espilon = 0.5500; alpha = 0.5000
Simulating trial. . .
espilon = 0.5500; alpha = 0.5000
Simulating trial. . .
espilon = 0.5500; alpha = 0.5000
Simulating trial. . .
espilon = 0.5500; alpha = 0.5000
Simulating trial. . .
espilon = 0.5500; alpha = 0.5000
Simulating trial. . .
espilon = 0.5500; alpha = 0.5000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.56)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent attempted driving forward through a red light. (rewarded -10.61)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.70)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, 'forward')
Agent properly idled at a red light. (rewarded 2.40)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.95)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent drove forward instead of right. (rewarded 1.85)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'right')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.03)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.22)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.00)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove left instead of forward. (rewarded 0.31)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 0.99)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 0.23)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.12)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.75)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.31)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.54)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.73)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.43)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 0.96)
24% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 10
\-------------------------
Simulating trial. . .
espilon = 0.5000; alpha = 0.5000
Simulating trial. . .
espilon = 0.5000; alpha = 0.5000
Simulating trial. . .
espilon = 0.5000; alpha = 0.5000
Simulating trial. . .
espilon = 0.5000; alpha = 0.5000
Simulating trial. . .
espilon = 0.5000; alpha = 0.5000
Simulating trial. . .
espilon = 0.5000; alpha = 0.5000
Simulating trial. . .
espilon = 0.5000; alpha = 0.5000
Simulating trial. . .
espilon = 0.5000; alpha = 0.5000
Simulating trial. . .
espilon = 0.5000; alpha = 0.5000
Simulating trial. . .
espilon = 0.5000; alpha = 0.5000
Simulating trial. . .
espilon = 0.5000; alpha = 0.5000
Simulating trial. . .
espilon = 0.5000; alpha = 0.5000
Simulating trial. . .
espilon = 0.5000; alpha = 0.5000
Simulating trial. . .
espilon = 0.5000; alpha = 0.5000
Simulating trial. . .
espilon = 0.5000; alpha = 0.5000
Simulating trial. . .
espilon = 0.5000; alpha = 0.5000
Simulating trial. . .
espilon = 0.5000; alpha = 0.5000
Simulating trial. . .
espilon = 0.5000; alpha = 0.5000
Simulating trial. . .
espilon = 0.5000; alpha = 0.5000
Simulating trial. . .
espilon = 0.5000; alpha = 0.5000
Simulating trial. . .
espilon = 0.5000; alpha = 0.5000
Simulating trial. . .
espilon = 0.5000; alpha = 0.5000
Simulating trial. . .
espilon = 0.5000; alpha = 0.5000
Simulating trial. . .
espilon = 0.5000; alpha = 0.5000
Simulating trial. . .
espilon = 0.5000; alpha = 0.5000
Simulating trial. . .
espilon = 0.5000; alpha = 0.5000
Simulating trial. . .
espilon = 0.5000; alpha = 0.5000
Simulating trial. . .
espilon = 0.5000; alpha = 0.5000
Simulating trial. . .
espilon = 0.5000; alpha = 0.5000
Simulating trial. . .
espilon = 0.5000; alpha = 0.5000
Simulating trial. . .
espilon = 0.5000; alpha = 0.5000
Simulating trial. . .
espilon = 0.5000; alpha = 0.5000
Simulating trial. . .
espilon = 0.5000; alpha = 0.5000
Simulating trial. . .
espilon = 0.5000; alpha = 0.5000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.01)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.69)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.71)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent drove forward instead of right. (rewarded 1.61)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.32)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -5.74)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.66)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', 'forward')
Agent followed the waypoint forward. (rewarded 1.89)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.10)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.67)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.10)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.71)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'left')
Agent followed the waypoint right. (rewarded 1.11)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.22)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.16)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.70)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent attempted driving left through a red light. (rewarded -9.27)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.39)
10% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 11
\-------------------------
Simulating trial. . .
espilon = 0.4500; alpha = 0.5000
Simulating trial. . .
espilon = 0.4500; alpha = 0.5000
Simulating trial. . .
espilon = 0.4500; alpha = 0.5000
Simulating trial. . .
espilon = 0.4500; alpha = 0.5000
Simulating trial. . .
espilon = 0.4500; alpha = 0.5000
Simulating trial. . .
espilon = 0.4500; alpha = 0.5000
Simulating trial. . .
espilon = 0.4500; alpha = 0.5000
Simulating trial. . .
espilon = 0.4500; alpha = 0.5000
Simulating trial. . .
espilon = 0.4500; alpha = 0.5000
Simulating trial. . .
espilon = 0.4500; alpha = 0.5000
Simulating trial. . .
espilon = 0.4500; alpha = 0.5000
Simulating trial. . .
espilon = 0.4500; alpha = 0.5000
Simulating trial. . .
espilon = 0.4500; alpha = 0.5000
Simulating trial. . .
espilon = 0.4500; alpha = 0.5000
Simulating trial. . .
espilon = 0.4500; alpha = 0.5000
Simulating trial. . .
espilon = 0.4500; alpha = 0.5000
Simulating trial. . .
espilon = 0.4500; alpha = 0.5000
Simulating trial. . .
espilon = 0.4500; alpha = 0.5000
Simulating trial. . .
espilon = 0.4500; alpha = 0.5000
Simulating trial. . .
espilon = 0.4500; alpha = 0.5000
Simulating trial. . .
espilon = 0.4500; alpha = 0.5000
Simulating trial. . .
espilon = 0.4500; alpha = 0.5000
Simulating trial. . .
espilon = 0.4500; alpha = 0.5000
Simulating trial. . .
espilon = 0.4500; alpha = 0.5000
Simulating trial. . .
espilon = 0.4500; alpha = 0.5000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.69)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.42)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent drove left instead of forward. (rewarded 0.84)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.33)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.69)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.67)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent followed the waypoint forward. (rewarded 2.59)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent drove right instead of forward. (rewarded 0.01)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent drove right instead of left. (rewarded 0.91)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent drove forward instead of right. (rewarded -0.14)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'right', None)
Agent properly idled at a red light. (rewarded 1.82)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.16)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.58)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded 0.78)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'forward')
Agent attempted driving left through a red light. (rewarded -10.48)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.01)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.85)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.16)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent attempted driving forward through a red light. (rewarded -9.67)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.03)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 0.76)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.80)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 0.56)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded 0.73)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent drove right instead of left. (rewarded 0.83)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 12
\-------------------------
Simulating trial. . .
espilon = 0.4000; alpha = 0.5000
Simulating trial. . .
espilon = 0.4000; alpha = 0.5000
Simulating trial. . .
espilon = 0.4000; alpha = 0.5000
Simulating trial. . .
espilon = 0.4000; alpha = 0.5000
Simulating trial. . .
espilon = 0.4000; alpha = 0.5000
Simulating trial. . .
espilon = 0.4000; alpha = 0.5000
Simulating trial. . .
espilon = 0.4000; alpha = 0.5000
Simulating trial. . .
espilon = 0.4000; alpha = 0.5000
Simulating trial. . .
espilon = 0.4000; alpha = 0.5000
Simulating trial. . .
espilon = 0.4000; alpha = 0.5000
Simulating trial. . .
espilon = 0.4000; alpha = 0.5000
Simulating trial. . .
espilon = 0.4000; alpha = 0.5000
Simulating trial. . .
espilon = 0.4000; alpha = 0.5000
Simulating trial. . .
espilon = 0.4000; alpha = 0.5000
Simulating trial. . .
espilon = 0.4000; alpha = 0.5000
Simulating trial. . .
espilon = 0.4000; alpha = 0.5000
Simulating trial. . .
espilon = 0.4000; alpha = 0.5000
Simulating trial. . .
espilon = 0.4000; alpha = 0.5000
Simulating trial. . .
espilon = 0.4000; alpha = 0.5000
Simulating trial. . .
espilon = 0.4000; alpha = 0.5000
Simulating trial. . .
espilon = 0.4000; alpha = 0.5000
Simulating trial. . .
espilon = 0.4000; alpha = 0.5000
Simulating trial. . .
espilon = 0.4000; alpha = 0.5000
Simulating trial. . .
espilon = 0.4000; alpha = 0.5000
Simulating trial. . .
espilon = 0.4000; alpha = 0.5000
Simulating trial. . .
espilon = 0.4000; alpha = 0.5000
Simulating trial. . .
espilon = 0.4000; alpha = 0.5000
Simulating trial. . .
espilon = 0.4000; alpha = 0.5000
Simulating trial. . .
espilon = 0.4000; alpha = 0.5000
Simulating trial. . .
espilon = 0.4000; alpha = 0.5000
Simulating trial. . .
espilon = 0.4000; alpha = 0.5000
Simulating trial. . .
espilon = 0.4000; alpha = 0.5000
Simulating trial. . .
espilon = 0.4000; alpha = 0.5000
Simulating trial. . .
espilon = 0.4000; alpha = 0.5000
Simulating trial. . .
espilon = 0.4000; alpha = 0.5000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.63)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.68)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'right')
Agent followed the waypoint right. (rewarded 1.28)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.74)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.86)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.88)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.61)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.82)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.33)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 1.77)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove right instead of left. (rewarded 0.68)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, None)
Agent drove forward instead of right. (rewarded 1.41)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.40)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.94)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, None)
Agent followed the waypoint right. (rewarded 1.85)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'right', None)
Agent drove left instead of right. (rewarded 0.04)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.94)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', 'forward')
Agent properly idled at a red light. (rewarded 2.53)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.72)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.31)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.17)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.05)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded -0.24)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.35)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.59)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 1.14)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.02)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.11)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.86)
3% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 13
\-------------------------
Simulating trial. . .
espilon = 0.3500; alpha = 0.5000
Simulating trial. . .
espilon = 0.3500; alpha = 0.5000
Simulating trial. . .
espilon = 0.3500; alpha = 0.5000
Simulating trial. . .
espilon = 0.3500; alpha = 0.5000
Simulating trial. . .
espilon = 0.3500; alpha = 0.5000
Simulating trial. . .
espilon = 0.3500; alpha = 0.5000
Simulating trial. . .
espilon = 0.3500; alpha = 0.5000
Simulating trial. . .
espilon = 0.3500; alpha = 0.5000
Simulating trial. . .
espilon = 0.3500; alpha = 0.5000
Simulating trial. . .
espilon = 0.3500; alpha = 0.5000
Simulating trial. . .
espilon = 0.3500; alpha = 0.5000
Simulating trial. . .
espilon = 0.3500; alpha = 0.5000
Simulating trial. . .
espilon = 0.3500; alpha = 0.5000
Simulating trial. . .
espilon = 0.3500; alpha = 0.5000
Simulating trial. . .
espilon = 0.3500; alpha = 0.5000
Simulating trial. . .
espilon = 0.3500; alpha = 0.5000
Simulating trial. . .
espilon = 0.3500; alpha = 0.5000
Simulating trial. . .
espilon = 0.3500; alpha = 0.5000
Simulating trial. . .
espilon = 0.3500; alpha = 0.5000
Simulating trial. . .
espilon = 0.3500; alpha = 0.5000
Simulating trial. . .
espilon = 0.3500; alpha = 0.5000
Simulating trial. . .
espilon = 0.3500; alpha = 0.5000
Simulating trial. . .
espilon = 0.3500; alpha = 0.5000
Simulating trial. . .
espilon = 0.3500; alpha = 0.5000
Simulating trial. . .
espilon = 0.3500; alpha = 0.5000
Simulating trial. . .
espilon = 0.3500; alpha = 0.5000
Simulating trial. . .
espilon = 0.3500; alpha = 0.5000
Simulating trial. . .
espilon = 0.3500; alpha = 0.5000
Simulating trial. . .
espilon = 0.3500; alpha = 0.5000
Simulating trial. . .
espilon = 0.3500; alpha = 0.5000
Simulating trial. . .
espilon = 0.3500; alpha = 0.5000
Simulating trial. . .
espilon = 0.3500; alpha = 0.5000
Simulating trial. . .
espilon = 0.3500; alpha = 0.5000
Simulating trial. . .
espilon = 0.3500; alpha = 0.5000
Simulating trial. . .
espilon = 0.3500; alpha = 0.5000
Simulating trial. . .
espilon = 0.3500; alpha = 0.5000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent drove forward instead of right. (rewarded 0.54)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent followed the waypoint right. (rewarded 2.14)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.71)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.14)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.38)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.15)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 0.71)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.39)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent followed the waypoint right. (rewarded 1.47)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent drove forward instead of right. (rewarded 1.07)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.18)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.51)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.76)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'left')
Agent drove forward instead of right. (rewarded 0.01)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'left')
Agent drove left instead of right. (rewarded -0.15)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent drove left instead of right. (rewarded 1.35)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.24)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent drove right instead of left. (rewarded 1.46)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'right', None)
Agent drove left instead of forward. (rewarded -0.04)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 1.33)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.41)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 0.74)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'right', 'left')
Agent idled at a green light with oncoming traffic. (rewarded -0.18)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.29)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent attempted driving forward through a red light. (rewarded -9.64)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 14
\-------------------------
Simulating trial. . .
espilon = 0.3000; alpha = 0.5000
Simulating trial. . .
espilon = 0.3000; alpha = 0.5000
Simulating trial. . .
espilon = 0.3000; alpha = 0.5000
Simulating trial. . .
espilon = 0.3000; alpha = 0.5000
Simulating trial. . .
espilon = 0.3000; alpha = 0.5000
Simulating trial. . .
espilon = 0.3000; alpha = 0.5000
Simulating trial. . .
espilon = 0.3000; alpha = 0.5000
Simulating trial. . .
espilon = 0.3000; alpha = 0.5000
Simulating trial. . .
espilon = 0.3000; alpha = 0.5000
Simulating trial. . .
espilon = 0.3000; alpha = 0.5000
Simulating trial. . .
espilon = 0.3000; alpha = 0.5000
Simulating trial. . .
espilon = 0.3000; alpha = 0.5000
Simulating trial. . .
espilon = 0.3000; alpha = 0.5000
Simulating trial. . .
espilon = 0.3000; alpha = 0.5000
Simulating trial. . .
espilon = 0.3000; alpha = 0.5000
Simulating trial. . .
espilon = 0.3000; alpha = 0.5000
Simulating trial. . .
espilon = 0.3000; alpha = 0.5000
Simulating trial. . .
espilon = 0.3000; alpha = 0.5000
Simulating trial. . .
espilon = 0.3000; alpha = 0.5000
Simulating trial. . .
espilon = 0.3000; alpha = 0.5000
Simulating trial. . .
espilon = 0.3000; alpha = 0.5000
Simulating trial. . .
espilon = 0.3000; alpha = 0.5000
Simulating trial. . .
espilon = 0.3000; alpha = 0.5000
Simulating trial. . .
espilon = 0.3000; alpha = 0.5000
Simulating trial. . .
espilon = 0.3000; alpha = 0.5000
Simulating trial. . .
espilon = 0.3000; alpha = 0.5000
Simulating trial. . .
espilon = 0.3000; alpha = 0.5000
Simulating trial. . .
espilon = 0.3000; alpha = 0.5000
Simulating trial. . .
espilon = 0.3000; alpha = 0.5000
Simulating trial. . .
espilon = 0.3000; alpha = 0.5000
Simulating trial. . .
espilon = 0.3000; alpha = 0.5000
Simulating trial. . .
espilon = 0.3000; alpha = 0.5000
Simulating trial. . .
espilon = 0.3000; alpha = 0.5000
Simulating trial. . .
espilon = 0.3000; alpha = 0.5000
Simulating trial. . .
espilon = 0.3000; alpha = 0.5000
Simulating trial. . .
espilon = 0.3000; alpha = 0.5000
Simulating trial. . .
espilon = 0.3000; alpha = 0.5000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded 0.93)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent drove forward instead of left. (rewarded 0.87)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.79)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 1.54)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.04)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent drove right instead of forward. (rewarded 0.39)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.49)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.40)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', 'forward')
Agent drove forward instead of left. (rewarded 0.42)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 1.58)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.10)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.23)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.16)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.45)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.08)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.33)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.45)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.79)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 0.32)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', 'right')
Agent drove right instead of forward. (rewarded 0.87)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 15
\-------------------------
Simulating trial. . .
espilon = 0.2500; alpha = 0.5000
Simulating trial. . .
espilon = 0.2500; alpha = 0.5000
Simulating trial. . .
espilon = 0.2500; alpha = 0.5000
Simulating trial. . .
espilon = 0.2500; alpha = 0.5000
Simulating trial. . .
espilon = 0.2500; alpha = 0.5000
Simulating trial. . .
espilon = 0.2500; alpha = 0.5000
Simulating trial. . .
espilon = 0.2500; alpha = 0.5000
Simulating trial. . .
espilon = 0.2500; alpha = 0.5000
Simulating trial. . .
espilon = 0.2500; alpha = 0.5000
Simulating trial. . .
espilon = 0.2500; alpha = 0.5000
Simulating trial. . .
espilon = 0.2500; alpha = 0.5000
Simulating trial. . .
espilon = 0.2500; alpha = 0.5000
Simulating trial. . .
espilon = 0.2500; alpha = 0.5000
Simulating trial. . .
espilon = 0.2500; alpha = 0.5000
Simulating trial. . .
espilon = 0.2500; alpha = 0.5000
Simulating trial. . .
espilon = 0.2500; alpha = 0.5000
Simulating trial. . .
espilon = 0.2500; alpha = 0.5000
Simulating trial. . .
espilon = 0.2500; alpha = 0.5000
Simulating trial. . .
espilon = 0.2500; alpha = 0.5000
Simulating trial. . .
espilon = 0.2500; alpha = 0.5000
Simulating trial. . .
espilon = 0.2500; alpha = 0.5000
Simulating trial. . .
espilon = 0.2500; alpha = 0.5000
Simulating trial. . .
espilon = 0.2500; alpha = 0.5000
Simulating trial. . .
espilon = 0.2500; alpha = 0.5000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'right', 'forward')
Agent drove forward instead of right. (rewarded 1.52)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded 1.16)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.63)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent drove right instead of forward. (rewarded 0.10)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded 0.44)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.97)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.82)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.28)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.31)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.04)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.39)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.64)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.62)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'forward')
Agent followed the waypoint forward. (rewarded 1.13)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.03)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.08)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.30)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.13)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'right')
Agent properly idled at a red light. (rewarded 1.42)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent attempted driving forward through a red light. (rewarded -10.19)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 16
\-------------------------
Simulating trial. . .
espilon = 0.2000; alpha = 0.5000
Simulating trial. . .
espilon = 0.2000; alpha = 0.5000
Simulating trial. . .
espilon = 0.2000; alpha = 0.5000
Simulating trial. . .
espilon = 0.2000; alpha = 0.5000
Simulating trial. . .
espilon = 0.2000; alpha = 0.5000
Simulating trial. . .
espilon = 0.2000; alpha = 0.5000
Simulating trial. . .
espilon = 0.2000; alpha = 0.5000
Simulating trial. . .
espilon = 0.2000; alpha = 0.5000
Simulating trial. . .
espilon = 0.2000; alpha = 0.5000
Simulating trial. . .
espilon = 0.2000; alpha = 0.5000
Simulating trial. . .
espilon = 0.2000; alpha = 0.5000
Simulating trial. . .
espilon = 0.2000; alpha = 0.5000
Simulating trial. . .
espilon = 0.2000; alpha = 0.5000
Simulating trial. . .
espilon = 0.2000; alpha = 0.5000
Simulating trial. . .
espilon = 0.2000; alpha = 0.5000
Simulating trial. . .
espilon = 0.2000; alpha = 0.5000
Simulating trial. . .
espilon = 0.2000; alpha = 0.5000
Simulating trial. . .
espilon = 0.2000; alpha = 0.5000
Simulating trial. . .
espilon = 0.2000; alpha = 0.5000
Simulating trial. . .
espilon = 0.2000; alpha = 0.5000
Simulating trial. . .
espilon = 0.2000; alpha = 0.5000
Simulating trial. . .
espilon = 0.2000; alpha = 0.5000
Simulating trial. . .
espilon = 0.2000; alpha = 0.5000
Simulating trial. . .
espilon = 0.2000; alpha = 0.5000
Simulating trial. . .
espilon = 0.2000; alpha = 0.5000
Simulating trial. . .
espilon = 0.2000; alpha = 0.5000
Simulating trial. . .
espilon = 0.2000; alpha = 0.5000
Simulating trial. . .
espilon = 0.2000; alpha = 0.5000
Simulating trial. . .
espilon = 0.2000; alpha = 0.5000
Simulating trial. . .
espilon = 0.2000; alpha = 0.5000
Simulating trial. . .
espilon = 0.2000; alpha = 0.5000
Simulating trial. . .
espilon = 0.2000; alpha = 0.5000
Simulating trial. . .
espilon = 0.2000; alpha = 0.5000
Simulating trial. . .
espilon = 0.2000; alpha = 0.5000
Simulating trial. . .
espilon = 0.2000; alpha = 0.5000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'left', None)
Agent drove forward instead of left. (rewarded 0.86)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.61)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.04)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 1.27)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 0.96)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'forward')
Agent drove right instead of forward. (rewarded 1.89)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 1.36)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'forward')
Agent attempted driving forward through a red light. (rewarded -9.19)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.97)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.07)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded 1.51)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.09)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.25)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.79)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.81)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 0.61)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.24)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent drove left instead of forward. (rewarded 1.54)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'left')
Agent attempted driving left through a red light. (rewarded -10.88)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.17)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'left')
Agent followed the waypoint right. (rewarded 2.59)
30% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 17
\-------------------------
Simulating trial. . .
espilon = 0.1500; alpha = 0.5000
Simulating trial. . .
espilon = 0.1500; alpha = 0.5000
Simulating trial. . .
espilon = 0.1500; alpha = 0.5000
Simulating trial. . .
espilon = 0.1500; alpha = 0.5000
Simulating trial. . .
espilon = 0.1500; alpha = 0.5000
Simulating trial. . .
espilon = 0.1500; alpha = 0.5000
Simulating trial. . .
espilon = 0.1500; alpha = 0.5000
Simulating trial. . .
espilon = 0.1500; alpha = 0.5000
Simulating trial. . .
espilon = 0.1500; alpha = 0.5000
Simulating trial. . .
espilon = 0.1500; alpha = 0.5000
Simulating trial. . .
espilon = 0.1500; alpha = 0.5000
Simulating trial. . .
espilon = 0.1500; alpha = 0.5000
Simulating trial. . .
espilon = 0.1500; alpha = 0.5000
Simulating trial. . .
espilon = 0.1500; alpha = 0.5000
Simulating trial. . .
espilon = 0.1500; alpha = 0.5000
Simulating trial. . .
espilon = 0.1500; alpha = 0.5000
Simulating trial. . .
espilon = 0.1500; alpha = 0.5000
Simulating trial. . .
espilon = 0.1500; alpha = 0.5000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.01)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.73)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.63)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.21)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.45)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.37)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -10.20)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.84)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent drove forward instead of right. (rewarded 0.77)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'forward')
Agent drove forward instead of right. (rewarded 0.45)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'right')
Agent followed the waypoint right. (rewarded 2.81)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.92)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.43)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.88)
44% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 18
\-------------------------
Simulating trial. . .
espilon = 0.1000; alpha = 0.5000
Simulating trial. . .
espilon = 0.1000; alpha = 0.5000
Simulating trial. . .
espilon = 0.1000; alpha = 0.5000
Simulating trial. . .
espilon = 0.1000; alpha = 0.5000
Simulating trial. . .
espilon = 0.1000; alpha = 0.5000
Simulating trial. . .
espilon = 0.1000; alpha = 0.5000
Simulating trial. . .
espilon = 0.1000; alpha = 0.5000
Simulating trial. . .
espilon = 0.1000; alpha = 0.5000
Simulating trial. . .
espilon = 0.1000; alpha = 0.5000
Simulating trial. . .
espilon = 0.1000; alpha = 0.5000
Simulating trial. . .
espilon = 0.1000; alpha = 0.5000
Simulating trial. . .
espilon = 0.1000; alpha = 0.5000
Simulating trial. . .
espilon = 0.1000; alpha = 0.5000
Simulating trial. . .
espilon = 0.1000; alpha = 0.5000
Simulating trial. . .
espilon = 0.1000; alpha = 0.5000
Simulating trial. . .
espilon = 0.1000; alpha = 0.5000
Simulating trial. . .
espilon = 0.1000; alpha = 0.5000
Simulating trial. . .
espilon = 0.1000; alpha = 0.5000
Simulating trial. . .
espilon = 0.1000; alpha = 0.5000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 1.91)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'right', None)
Agent properly idled at a red light. (rewarded 1.56)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.62)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.48)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent drove right instead of forward. (rewarded 0.77)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.49)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.22)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.02)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.72)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.05)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.68)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.53)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.37)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent drove left instead of forward. (rewarded 0.99)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.61)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.31)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.00)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent followed the waypoint right. (rewarded 2.17)
10% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 19
\-------------------------
Simulating trial. . .
espilon = 0.0500; alpha = 0.5000
Simulating trial. . .
espilon = 0.0500; alpha = 0.5000
Simulating trial. . .
espilon = 0.0500; alpha = 0.5000
Simulating trial. . .
espilon = 0.0500; alpha = 0.5000
Simulating trial. . .
espilon = 0.0500; alpha = 0.5000
Simulating trial. . .
espilon = 0.0500; alpha = 0.5000
Simulating trial. . .
espilon = 0.0500; alpha = 0.5000
Simulating trial. . .
espilon = 0.0500; alpha = 0.5000
Simulating trial. . .
espilon = 0.0500; alpha = 0.5000
Simulating trial. . .
espilon = 0.0500; alpha = 0.5000
Simulating trial. . .
espilon = 0.0500; alpha = 0.5000
Simulating trial. . .
espilon = 0.0500; alpha = 0.5000
Simulating trial. . .
espilon = 0.0500; alpha = 0.5000
Simulating trial. . .
espilon = 0.0500; alpha = 0.5000
Simulating trial. . .
espilon = 0.0500; alpha = 0.5000
Simulating trial. . .
espilon = 0.0500; alpha = 0.5000
Simulating trial. . .
espilon = 0.0500; alpha = 0.5000
Simulating trial. . .
espilon = 0.0500; alpha = 0.5000
Simulating trial. . .
espilon = 0.0500; alpha = 0.5000
Simulating trial. . .
espilon = 0.0500; alpha = 0.5000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.86)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent drove right instead of left. (rewarded 1.18)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, None)
Agent followed the waypoint right. (rewarded 1.89)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.54)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.20)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent drove right instead of forward. (rewarded 1.79)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'left', None)
Agent drove forward instead of left. (rewarded 0.39)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'right', 'right')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.92)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent drove right instead of left. (rewarded 0.63)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', 'right')
Agent drove forward instead of right. (rewarded 0.25)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent followed the waypoint forward. (rewarded 1.60)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent drove right instead of forward. (rewarded 1.56)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded 0.93)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.20)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', None)
Agent followed the waypoint right. (rewarded 1.74)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'left')
Agent drove left instead of right. (rewarded 1.54)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'right', 'left')
Agent followed the waypoint forward. (rewarded 0.68)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent drove right instead of forward. (rewarded 0.01)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent drove forward instead of left. (rewarded -0.10)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove forward instead of left. (rewarded 0.96)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent drove right instead of left. (rewarded 0.46)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.79)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', 'right')
Agent drove forward instead of right. (rewarded 0.92)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, None)
Agent followed the waypoint right. (rewarded 0.44)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.66)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 20
\-------------------------
Simulating trial. . .
espilon = -0.0000; alpha = 0.5000
Simulating trial. . .
espilon = -0.0000; alpha = 0.5000
Simulating trial. . .
espilon = -0.0000; alpha = 0.5000
Simulating trial. . .
espilon = -0.0000; alpha = 0.5000
Simulating trial. . .
espilon = -0.0000; alpha = 0.5000
Simulating trial. . .
espilon = -0.0000; alpha = 0.5000
Simulating trial. . .
espilon = -0.0000; alpha = 0.5000
Simulating trial. . .
espilon = -0.0000; alpha = 0.5000
Simulating trial. . .
espilon = -0.0000; alpha = 0.5000
Simulating trial. . .
espilon = -0.0000; alpha = 0.5000
Simulating trial. . .
espilon = -0.0000; alpha = 0.5000
Simulating trial. . .
espilon = -0.0000; alpha = 0.5000
Simulating trial. . .
espilon = -0.0000; alpha = 0.5000
Simulating trial. . .
espilon = -0.0000; alpha = 0.5000
Simulating trial. . .
espilon = -0.0000; alpha = 0.5000
Simulating trial. . .
espilon = -0.0000; alpha = 0.5000
Simulating trial. . .
espilon = -0.0000; alpha = 0.5000
Simulating trial. . .
espilon = -0.0000; alpha = 0.5000
Simulating trial. . .
espilon = -0.0000; alpha = 0.5000
Simulating trial. . .
espilon = -0.0000; alpha = 0.5000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', 'forward')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.40)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded 0.14)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, 'forward')
Agent properly idled at a red light. (rewarded 1.21)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.21)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.67)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.96)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.49)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.60)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.55)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'right', 'forward')
Agent followed the waypoint forward. (rewarded 2.59)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.42)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.36)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.56)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.85)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded 0.82)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent drove forward instead of right. (rewarded 0.79)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.61)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.67)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 2.27)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.61)
33% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Testing trial 1
\-------------------------
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.04)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 2.63)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.57)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.98)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', 'forward')
Agent followed the waypoint forward. (rewarded 2.07)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.76)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.00)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent drove right instead of forward. (rewarded 0.53)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.36)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent attempted driving forward through a red light. (rewarded -9.51)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 1.61)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent drove forward instead of right. (rewarded 1.10)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.45)
63% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.35)
60% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent drove right instead of forward. (rewarded 1.04)
57% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.51)
54% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'right', 'left')
Agent drove right instead of left. (rewarded 1.63)
51% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 1.31)
49% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove left instead of right. (rewarded 0.56)
46% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.96)
43% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 0.88)
40% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.46)
37% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.21)
34% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 0.51)
31% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent drove right instead of left. (rewarded 0.50)
29% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.40)
26% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.96)
23% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove left instead of right. (rewarded 0.07)
20% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.33)
17% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.27)
14% of time remaining to reach destination.
/-------------------
| Step 30 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent drove right instead of forward. (rewarded 0.70)
11% of time remaining to reach destination.
/-------------------
| Step 31 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.34)
9% of time remaining to reach destination.
/-------------------
| Step 32 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 1.08)
6% of time remaining to reach destination.
/-------------------
| Step 33 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent drove right instead of left. (rewarded -0.40)
3% of time remaining to reach destination.
/-------------------
| Step 34 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded -0.49)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Testing trial 2
\-------------------------
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent followed the waypoint right. (rewarded 2.62)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent drove forward instead of right. (rewarded 1.64)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent drove forward instead of right. (rewarded 0.61)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent drove forward instead of right. (rewarded 1.73)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.12)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.89)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.29)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.74)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 1.09)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 1.59)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'right', None)
Agent properly idled at a red light. (rewarded 2.71)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.14)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.83)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.23)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.67)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent followed the waypoint forward. (rewarded 1.33)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.24)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent followed the waypoint right. (rewarded 0.74)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Testing trial 3
\-------------------------
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'right')
Agent followed the waypoint forward. (rewarded 2.63)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent drove left instead of forward. (rewarded 1.72)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent drove left instead of right. (rewarded 1.09)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.02)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded 1.77)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.78)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent drove right instead of left. (rewarded 0.03)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent drove right instead of forward. (rewarded 0.78)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.24)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.79)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.46)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'forward')
Agent attempted driving forward through a red light. (rewarded -10.59)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'forward')
Agent attempted driving forward through a red light. (rewarded -10.17)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.82)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 0.55)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 1.40)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent drove right instead of forward. (rewarded -0.22)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'left')
Agent drove forward instead of left. (rewarded 0.41)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.60)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.00)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Testing trial 4
\-------------------------
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'left', None)
Agent drove right instead of forward. (rewarded 0.66)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent drove forward instead of left. (rewarded 0.77)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded 0.72)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.28)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.91)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.68)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.33)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.20)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.04)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.86)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.63)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.25)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'left')
Agent drove left instead of right. (rewarded 0.42)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.41)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent drove forward instead of right. (rewarded 1.31)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.48)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.91)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.78)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.40)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.45)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Testing trial 5
\-------------------------
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', 'forward')
Agent drove forward instead of left. (rewarded 0.09)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'forward')
Agent drove forward instead of left. (rewarded 0.18)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 1.77)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent drove left instead of forward. (rewarded 1.75)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.65)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.44)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent drove right instead of forward. (rewarded 0.90)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.64)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'left', 'left')
Agent followed the waypoint forward. (rewarded 1.24)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove forward instead of left. (rewarded 0.26)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 0.81)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.62)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.34)
63% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.28)
60% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.63)
57% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.89)
54% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.31)
51% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.19)
49% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Testing trial 6
\-------------------------
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', 'right')
Agent followed the waypoint forward. (rewarded 1.83)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.06)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'forward')
Agent attempted driving forward through a red light. (rewarded -10.16)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.31)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.55)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 2.00)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.55)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.87)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.90)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.66)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent followed the waypoint right. (rewarded 2.24)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'right')
Agent drove forward instead of right. (rewarded 1.09)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.00)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.28)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.61)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.35)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.29)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.64)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.58)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove left instead of right. (rewarded -0.04)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Testing trial 7
\-------------------------
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent drove forward instead of left. (rewarded 0.95)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.06)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 2.89)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.28)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 0.45)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.72)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent drove right instead of left. (rewarded 0.40)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 0.06)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'left')
Agent drove left instead of right. (rewarded 1.21)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.33)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.48)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.59)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent drove right instead of forward. (rewarded 1.25)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent drove forward instead of left. (rewarded 1.13)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'left')
Agent drove forward instead of left. (rewarded 0.22)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded -0.09)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent attempted driving forward through a red light. (rewarded -9.40)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.34)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.51)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', 'left')
Agent followed the waypoint forward. (rewarded 1.96)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.06)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.26)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.57)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent drove right instead of forward. (rewarded 0.27)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded 0.49)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Testing trial 8
\-------------------------
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent drove forward instead of left. (rewarded 1.07)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded 1.14)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'left')
Agent drove left instead of right. (rewarded 0.81)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.41)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.02)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.75)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 0.63)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent drove right instead of left. (rewarded 0.94)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove forward instead of left. (rewarded 1.26)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded -0.01)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.10)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.38)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'right', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.69)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.27)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent drove right instead of forward. (rewarded 0.45)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', 'right')
Agent drove forward instead of left. (rewarded 0.39)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent drove forward instead of left. (rewarded 1.47)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'left', None)
Agent drove forward instead of left. (rewarded 0.25)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent attempted driving forward through a red light. (rewarded -11.00)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.84)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent drove right instead of left. (rewarded 0.46)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent drove forward instead of right. (rewarded 0.75)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'forward')
Agent drove right instead of forward. (rewarded -0.42)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'right')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.72)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent attempted driving forward through a red light. (rewarded -9.22)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Testing trial 9
\-------------------------
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent followed the waypoint right. (rewarded 2.69)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.03)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.70)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.08)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.80)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.24)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.36)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.00)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent drove right instead of forward. (rewarded 1.73)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 1.31)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 1.88)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.08)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.73)
63% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.21)
60% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 0.96)
57% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.95)
54% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.10)
51% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.60)
49% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.27)
46% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 2.31)
43% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Testing trial 10
\-------------------------
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'right', 'right')
Agent attempted driving forward through a red light. (rewarded -10.40)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.67)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.79)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.81)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.87)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', 'right')
Agent followed the waypoint forward. (rewarded 1.51)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'right')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.60)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent drove forward instead of right. (rewarded 1.86)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.58)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.43)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.58)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'right')
Agent attempted driving forward through a red light. (rewarded -10.28)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 1.01)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.88)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent attempted driving forward through a red light. (rewarded -10.26)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.37)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent followed the waypoint right. (rewarded 0.67)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.84)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.37)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 0.76)
20% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
Simulation ended. . .
<matplotlib.figure.Figure at 0x7f512163ebd0>
# Carregue o arquivo 'sim_default-learning' da simulação padrão do algoritmo Q-Learning
vs.plot_trials('sim_default-learning.csv')
Usando a visualização acima que foi produzida a partir da sua simulação inicial com o algoritmo Q-Learning, forneça uma análise e faça observações sobre o agente condutor como na Pergunta 3. Note que a simulação também deve ter produzido uma tabela-Q em um arquivo texto que pode te ajudar a fazer observações sobre o aprendizado do agente. Algumas coisas adicionais que você deve levar em consideração:
Resposta:
1 - Não há similaridades ente o agente condutor básico e o agente condutor padrão com o algoritmo Q-Learning.As ações ruins, violações e acidentes diminuíram muito ao longo do tempo para o agente Q-Learning, o que não era o caso do agente básico de direção. Além disso, a taxa de confiabilidade do agente Q-Learning passou de 10% para pouco mais de 60% e em seguida, para baixo para 60%, apresentando assim uma grande melhoria.
2 - Levaram 20 iterações de treinamento antes que o agente Q-Learning começasse a testar. Este número faz sentido dado que diminuímos 0,05 em cada teste de treinamento com uma tolerância a epsilon de 0,05. Então, 1.0 - (20 x 0.05) = 0.0 < 0.05.
3 - Sim, a constante função de decaimento que implementei para ε (o fator de exploração) parece ser representada com precisão no painel de parâmetros.
4 - À medida que o número de iterações de treinamento aumenta, o número de ações ruins diminui, e a recompensa média aumenta.
5 - A classificação de segurança e confiabilidade ainda são Fs. Mesmo que o novo desempenho do agente Q-Learning tenha melhorado comparado ao agente de controle básico, ele não melhorou o suficiente para obter uma classificação melhor por segurança ou confiabilidade.
O quarto passo para criar um agente condutor otimizado com o algoritmo Q-Learning é otimizá-lo! Agora que o algoritmo Q-Learning está implementado e o agente condutor está aprendendo com sucesso, é preciso ajustar as configurações e parêmetros de aprendizado para que o agente condutor aprenda como navegar de forma segura e eficaz. Tipicamente este passo vai exigir muita tentativa e erro, uma vez que algumas configurações irão invariavelmente piorar o aprendizado. Uma coisa para se manter em mente é o ato de aprender em si e o tempo que isso demora: teoricamente, nós poderíamos permitir que o agente aprendesse por um tempo incrivelmente longo; entretanto, outro objetivo do Q-Learning é fazer a transição entre a experimentação com comportamentos não aprendidos e a ação com comportamentos aprendidos. Por exemplo, sempre permitir que o agente execute ações aleatórias durante o treinamento (com $\epsilon = 1$, sem decaimento) certamente vai fazê-lo aprender, mas nunca vai deixá-lo agir. Quando estiver melhorando sua implementação do algoritmo Q-Learning, avalie as implicações de cada ajuste e se ele faz sentido logisticamente.
Para obter resultados da implementação melhorada do algoritmo Q-Learning, você vai precisar ajustar as seguintes flags:
'enforce_deadline' - Atribua o valor True a esta variável para forçar o agente condutor a registrar se ele chegou ao seu destino dentro do prazo.'update_delay' - Atribua um valor pequeno (como 0.01) a esta variável para reduzir o tempo entre passos em cada iteração.'log_metrics' - Atribua o valor True a esta variável para gravar os resultados da cimulação como um arquivo .csv e a tabela-Q como um arquivo .txt na pasta /logs/.'learning' - Atribua o valor 'True' a esta variável para sinalizar para o agente condutor que ele deve usar sua implementação do algoritmo Q-Learning.'optimized' - Atribua o valor 'True' a esta variável para sinalizar para o agente condutor que você está executando uma versão otimizada do algoritmo Q-Learning.Flags adicionais que podem ser ajustadas como parte da otimização do agente condutor com Q-Learning:
'n_test' - Atribua um valor positivo a esta variável (que antes tinha valor 10) para executar esse número de iterações de teste.'alpha' - Atribua um valor numérico real entre 0 - 1 para ajustar a taxa de aprendizado do algoritmo Q-Learning.'epsilon' - Atribua um valor numérico real entre 0 - 1 para ajustar o fator de exploração inicial do algoritmo Q-Learning.'tolerance' - Atribua um valor numérico real positivo e pequeno (o valor padrão era 0.05) para ajustar a tolerância-epsilon, o limiar do fator de exploração a partir do qual se inicia a etapa de testes.Além disso, use uma função de decaimento da sua escolha para $\epsilon$ (o fator de exploração). Note que qualquer função que você use deve decair abaixo do valor definido para a variável 'tolerance' segundo uma taxa razoável. O agente condutor Q-Learning não vai começar a etapa de testes até que isso ocorra, então se o decaimento for lento demais o aprendizado pode demorar bastante. Algumas funções de decaimento exemplo (para $t$, o número de iterações):
$$ \epsilon = a^t, \textrm{para } 0 < a < 1 \hspace{50px}\epsilon = \frac{1}{t^2}\hspace{50px}\epsilon = e^{-at}, \textrm{para } 0 < a < 1 \hspace{50px} \epsilon = \cos(at), \textrm{para } 0 < a < 1$$ Você também pode usar uma função de decaimento para $\alpha$ (a taxa de aprendizagem) se você quiser, entretanto isso não é feito em geral. Se você decidir tentar, verifique que sua função de decaimento segue a condição expressa pela desigualdade $0 \leq \alpha \leq 1$.
Se você tiver dificuldade em fazer sua implementação funcionar, tente atribuir o valor True para a flag 'verbose' para conseguir mais informações para depuração do código. Flags que foram alterada devem ser retornadas para seu valor inicial quando você estiver depurando. É importante que você entenda o que cada flag faz e como ela afeta a simulação!
Assim que você tiver completado com sucesso a simulação com o algoritmo Q-Learning, execute a célula de código abaixo para visualizar os resultados. Note que arquivos de log são sobrescritos quando simulações idênticas são executadas, portanto tenha cuidado com qual arquivo de log está sendo carregado!
%run ./smartcab/agent.py
Simulator.__init__(): Error initializing GUI objects; display disabled.
error: No available video device
/-------------------------
| Training trial 1
\-------------------------
Simulating trial. . .
espilon = 1.0000; alpha = 0.0040
Simulating trial. . .
espilon = 1.0000; alpha = 0.0040
Simulating trial. . .
espilon = 1.0000; alpha = 0.0040
Simulating trial. . .
espilon = 1.0000; alpha = 0.0040
Simulating trial. . .
espilon = 1.0000; alpha = 0.0040
Simulating trial. . .
espilon = 1.0000; alpha = 0.0040
Simulating trial. . .
espilon = 1.0000; alpha = 0.0040
Simulating trial. . .
espilon = 1.0000; alpha = 0.0040
Simulating trial. . .
espilon = 1.0000; alpha = 0.0040
Simulating trial. . .
espilon = 1.0000; alpha = 0.0040
Simulating trial. . .
espilon = 1.0000; alpha = 0.0040
Simulating trial. . .
espilon = 1.0000; alpha = 0.0040
Simulating trial. . .
espilon = 1.0000; alpha = 0.0040
Simulating trial. . .
espilon = 1.0000; alpha = 0.0040
Simulating trial. . .
espilon = 1.0000; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'forward', 'left')
Agent drove left instead of right. (rewarded 0.62)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -4.27)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent drove right instead of forward. (rewarded 1.39)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent drove right instead of left. (rewarded 1.90)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent drove right instead of forward. (rewarded 1.93)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.92)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.79)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.44)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.98)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 1.04)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.63)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent drove forward instead of left. (rewarded 1.38)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.84)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.10)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent attempted driving left through a red light. (rewarded -9.42)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.41)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent attempted driving left through a red light. (rewarded -9.25)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded -0.35)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent drove right instead of forward. (rewarded 0.06)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', None)
Agent drove right instead of left. (rewarded -0.22)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 2
\-------------------------
Simulating trial. . .
espilon = 0.9960; alpha = 0.0040
Simulating trial. . .
espilon = 0.9960; alpha = 0.0040
Simulating trial. . .
espilon = 0.9960; alpha = 0.0040
Simulating trial. . .
espilon = 0.9960; alpha = 0.0040
Simulating trial. . .
espilon = 0.9960; alpha = 0.0040
Simulating trial. . .
espilon = 0.9960; alpha = 0.0040
Simulating trial. . .
espilon = 0.9960; alpha = 0.0040
Simulating trial. . .
espilon = 0.9960; alpha = 0.0040
Simulating trial. . .
espilon = 0.9960; alpha = 0.0040
Simulating trial. . .
espilon = 0.9960; alpha = 0.0040
Simulating trial. . .
espilon = 0.9960; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent drove left instead of forward. (rewarded 1.33)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent attempted driving left through a red light. (rewarded -9.93)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.13)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.61)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.72)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.04)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'right', None)
Agent properly idled at a red light. (rewarded 2.54)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.68)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.98)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.89)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent drove right instead of forward. (rewarded 0.08)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded 0.89)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.74)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent followed the waypoint right. (rewarded 1.75)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.07)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -5.76)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 2.18)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'right')
Agent drove right instead of forward. (rewarded 0.15)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -9.23)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.08)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.34)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent drove right instead of left. (rewarded 1.37)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 0.82)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.36)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.72)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 2.18)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.93)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.82)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.20)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove right instead of left. (rewarded 0.86)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 3
\-------------------------
Simulating trial. . .
espilon = 0.9920; alpha = 0.0040
Simulating trial. . .
espilon = 0.9920; alpha = 0.0040
Simulating trial. . .
espilon = 0.9920; alpha = 0.0040
Simulating trial. . .
espilon = 0.9920; alpha = 0.0040
Simulating trial. . .
espilon = 0.9920; alpha = 0.0040
Simulating trial. . .
espilon = 0.9920; alpha = 0.0040
Simulating trial. . .
espilon = 0.9920; alpha = 0.0040
Simulating trial. . .
espilon = 0.9920; alpha = 0.0040
Simulating trial. . .
espilon = 0.9920; alpha = 0.0040
Simulating trial. . .
espilon = 0.9920; alpha = 0.0040
Simulating trial. . .
espilon = 0.9920; alpha = 0.0040
Simulating trial. . .
espilon = 0.9920; alpha = 0.0040
Simulating trial. . .
espilon = 0.9920; alpha = 0.0040
Simulating trial. . .
espilon = 0.9920; alpha = 0.0040
Simulating trial. . .
espilon = 0.9920; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.64)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent drove right instead of forward. (rewarded 1.20)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 1.89)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.86)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'right')
Agent properly idled at a red light. (rewarded 2.55)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.78)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.77)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.87)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.01)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent drove right instead of left. (rewarded 1.67)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 1.56)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded -0.05)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'right')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.20)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 1.39)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 2.33)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 1.53)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'right')
Agent properly idled at a red light. (rewarded 2.49)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.16)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent attempted driving left through a red light. (rewarded -9.25)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded 0.66)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent drove right instead of left. (rewarded 0.87)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove left instead of forward. (rewarded 0.29)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 0.49)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.14)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'forward')
Agent attempted driving left through a red light. (rewarded -10.29)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 4
\-------------------------
Simulating trial. . .
espilon = 0.9881; alpha = 0.0040
Simulating trial. . .
espilon = 0.9881; alpha = 0.0040
Simulating trial. . .
espilon = 0.9881; alpha = 0.0040
Simulating trial. . .
espilon = 0.9881; alpha = 0.0040
Simulating trial. . .
espilon = 0.9881; alpha = 0.0040
Simulating trial. . .
espilon = 0.9881; alpha = 0.0040
Simulating trial. . .
espilon = 0.9881; alpha = 0.0040
Simulating trial. . .
espilon = 0.9881; alpha = 0.0040
Simulating trial. . .
espilon = 0.9881; alpha = 0.0040
Simulating trial. . .
espilon = 0.9881; alpha = 0.0040
Simulating trial. . .
espilon = 0.9881; alpha = 0.0040
Simulating trial. . .
espilon = 0.9881; alpha = 0.0040
Simulating trial. . .
espilon = 0.9881; alpha = 0.0040
Simulating trial. . .
espilon = 0.9881; alpha = 0.0040
Simulating trial. . .
espilon = 0.9881; alpha = 0.0040
Simulating trial. . .
espilon = 0.9881; alpha = 0.0040
Simulating trial. . .
espilon = 0.9881; alpha = 0.0040
Simulating trial. . .
espilon = 0.9881; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent drove right instead of forward. (rewarded 0.13)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent drove forward instead of left. (rewarded 0.07)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -5.71)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.42)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.74)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 1.48)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 1.26)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.88)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent drove right instead of left. (rewarded 1.17)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.51)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.31)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.33)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.98)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.21)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'right')
Agent properly idled at a red light. (rewarded 1.36)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.34)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.60)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded -0.25)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent drove left instead of forward. (rewarded 0.66)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded -0.36)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.17)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.63)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.55)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.46)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.21)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 5
\-------------------------
Simulating trial. . .
espilon = 0.9841; alpha = 0.0040
Simulating trial. . .
espilon = 0.9841; alpha = 0.0040
Simulating trial. . .
espilon = 0.9841; alpha = 0.0040
Simulating trial. . .
espilon = 0.9841; alpha = 0.0040
Simulating trial. . .
espilon = 0.9841; alpha = 0.0040
Simulating trial. . .
espilon = 0.9841; alpha = 0.0040
Simulating trial. . .
espilon = 0.9841; alpha = 0.0040
Simulating trial. . .
espilon = 0.9841; alpha = 0.0040
Simulating trial. . .
espilon = 0.9841; alpha = 0.0040
Simulating trial. . .
espilon = 0.9841; alpha = 0.0040
Simulating trial. . .
espilon = 0.9841; alpha = 0.0040
Simulating trial. . .
espilon = 0.9841; alpha = 0.0040
Simulating trial. . .
espilon = 0.9841; alpha = 0.0040
Simulating trial. . .
espilon = 0.9841; alpha = 0.0040
Simulating trial. . .
espilon = 0.9841; alpha = 0.0040
Simulating trial. . .
espilon = 0.9841; alpha = 0.0040
Simulating trial. . .
espilon = 0.9841; alpha = 0.0040
Simulating trial. . .
espilon = 0.9841; alpha = 0.0040
Simulating trial. . .
espilon = 0.9841; alpha = 0.0040
Simulating trial. . .
espilon = 0.9841; alpha = 0.0040
Simulating trial. . .
espilon = 0.9841; alpha = 0.0040
Simulating trial. . .
espilon = 0.9841; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.09)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent drove right instead of forward. (rewarded 1.70)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -10.73)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 1.92)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent drove right instead of left. (rewarded 1.19)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.23)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.86)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent followed the waypoint right. (rewarded 2.58)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent drove left instead of forward. (rewarded 0.32)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'forward', None)
Agent drove left instead of right. (rewarded 0.05)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.82)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.65)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 1.41)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.97)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', 'left')
Agent idled at a green light with oncoming traffic. (rewarded -0.03)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent followed the waypoint right. (rewarded 2.26)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent followed the waypoint right. (rewarded 0.78)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove right instead of forward. (rewarded 0.73)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 2.20)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.18)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 6
\-------------------------
Simulating trial. . .
espilon = 0.9802; alpha = 0.0040
Simulating trial. . .
espilon = 0.9802; alpha = 0.0040
Simulating trial. . .
espilon = 0.9802; alpha = 0.0040
Simulating trial. . .
espilon = 0.9802; alpha = 0.0040
Simulating trial. . .
espilon = 0.9802; alpha = 0.0040
Simulating trial. . .
espilon = 0.9802; alpha = 0.0040
Simulating trial. . .
espilon = 0.9802; alpha = 0.0040
Simulating trial. . .
espilon = 0.9802; alpha = 0.0040
Simulating trial. . .
espilon = 0.9802; alpha = 0.0040
Simulating trial. . .
espilon = 0.9802; alpha = 0.0040
Simulating trial. . .
espilon = 0.9802; alpha = 0.0040
Simulating trial. . .
espilon = 0.9802; alpha = 0.0040
Simulating trial. . .
espilon = 0.9802; alpha = 0.0040
Simulating trial. . .
espilon = 0.9802; alpha = 0.0040
Simulating trial. . .
espilon = 0.9802; alpha = 0.0040
Simulating trial. . .
espilon = 0.9802; alpha = 0.0040
Simulating trial. . .
espilon = 0.9802; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent followed the waypoint left. (rewarded 2.75)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent drove right instead of forward. (rewarded 1.50)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.53)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent drove forward instead of left. (rewarded 1.22)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 1.66)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.15)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.15)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.81)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent followed the waypoint right. (rewarded 2.19)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent drove forward instead of right. (rewarded 0.18)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded 1.40)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'right', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.41)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', 'left')
Agent followed the waypoint left. (rewarded 2.51)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.76)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.57)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.97)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.83)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.72)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.93)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent drove right instead of forward. (rewarded 0.04)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 7
\-------------------------
Simulating trial. . .
espilon = 0.9763; alpha = 0.0040
Simulating trial. . .
espilon = 0.9763; alpha = 0.0040
Simulating trial. . .
espilon = 0.9763; alpha = 0.0040
Simulating trial. . .
espilon = 0.9763; alpha = 0.0040
Simulating trial. . .
espilon = 0.9763; alpha = 0.0040
Simulating trial. . .
espilon = 0.9763; alpha = 0.0040
Simulating trial. . .
espilon = 0.9763; alpha = 0.0040
Simulating trial. . .
espilon = 0.9763; alpha = 0.0040
Simulating trial. . .
espilon = 0.9763; alpha = 0.0040
Simulating trial. . .
espilon = 0.9763; alpha = 0.0040
Simulating trial. . .
espilon = 0.9763; alpha = 0.0040
Simulating trial. . .
espilon = 0.9763; alpha = 0.0040
Simulating trial. . .
espilon = 0.9763; alpha = 0.0040
Simulating trial. . .
espilon = 0.9763; alpha = 0.0040
Simulating trial. . .
espilon = 0.9763; alpha = 0.0040
Simulating trial. . .
espilon = 0.9763; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'left')
Agent drove forward instead of right. (rewarded 0.46)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.97)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent followed the waypoint right. (rewarded 1.34)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.94)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 1.90)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.41)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', None)
Agent attempted driving left through a red light. (rewarded -9.76)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.32)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', 'forward')
Agent drove right instead of left. (rewarded -0.06)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.13)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.74)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.69)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.01)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -9.72)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent attempted driving left through a red light. (rewarded -9.97)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 0.63)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -10.82)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.69)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', 'left')
Agent properly idled at a red light. (rewarded 2.47)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 0.78)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', 'left')
Agent followed the waypoint right. (rewarded 2.07)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.89)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.74)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.33)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.98)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove left instead of forward. (rewarded -0.60)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 0.10)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.68)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.91)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.06)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 8
\-------------------------
Simulating trial. . .
espilon = 0.9724; alpha = 0.0040
Simulating trial. . .
espilon = 0.9724; alpha = 0.0040
Simulating trial. . .
espilon = 0.9724; alpha = 0.0040
Simulating trial. . .
espilon = 0.9724; alpha = 0.0040
Simulating trial. . .
espilon = 0.9724; alpha = 0.0040
Simulating trial. . .
espilon = 0.9724; alpha = 0.0040
Simulating trial. . .
espilon = 0.9724; alpha = 0.0040
Simulating trial. . .
espilon = 0.9724; alpha = 0.0040
Simulating trial. . .
espilon = 0.9724; alpha = 0.0040
Simulating trial. . .
espilon = 0.9724; alpha = 0.0040
Simulating trial. . .
espilon = 0.9724; alpha = 0.0040
Simulating trial. . .
espilon = 0.9724; alpha = 0.0040
Simulating trial. . .
espilon = 0.9724; alpha = 0.0040
Simulating trial. . .
espilon = 0.9724; alpha = 0.0040
Simulating trial. . .
espilon = 0.9724; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent attempted driving left through a red light. (rewarded -10.98)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'right')
Agent properly idled at a red light. (rewarded 2.66)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.90)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 0.50)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.58)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.80)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -9.66)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.57)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 0.99)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.42)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded 1.46)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent drove right instead of left. (rewarded 1.30)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.00)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.53)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent drove left instead of right. (rewarded 0.64)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.84)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.87)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 0.32)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent drove right instead of forward. (rewarded 0.31)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.87)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 9
\-------------------------
Simulating trial. . .
espilon = 0.9685; alpha = 0.0040
Simulating trial. . .
espilon = 0.9685; alpha = 0.0040
Simulating trial. . .
espilon = 0.9685; alpha = 0.0040
Simulating trial. . .
espilon = 0.9685; alpha = 0.0040
Simulating trial. . .
espilon = 0.9685; alpha = 0.0040
Simulating trial. . .
espilon = 0.9685; alpha = 0.0040
Simulating trial. . .
espilon = 0.9685; alpha = 0.0040
Simulating trial. . .
espilon = 0.9685; alpha = 0.0040
Simulating trial. . .
espilon = 0.9685; alpha = 0.0040
Simulating trial. . .
espilon = 0.9685; alpha = 0.0040
Simulating trial. . .
espilon = 0.9685; alpha = 0.0040
Simulating trial. . .
espilon = 0.9685; alpha = 0.0040
Simulating trial. . .
espilon = 0.9685; alpha = 0.0040
Simulating trial. . .
espilon = 0.9685; alpha = 0.0040
Simulating trial. . .
espilon = 0.9685; alpha = 0.0040
Simulating trial. . .
espilon = 0.9685; alpha = 0.0040
Simulating trial. . .
espilon = 0.9685; alpha = 0.0040
Simulating trial. . .
espilon = 0.9685; alpha = 0.0040
Simulating trial. . .
espilon = 0.9685; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent drove left instead of right. (rewarded 0.29)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.12)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.55)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.19)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 2.07)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent followed the waypoint forward. (rewarded 1.26)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', 'left')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.85)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent attempted driving forward through a red light. (rewarded -10.24)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.37)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', 'left')
Agent drove right instead of forward. (rewarded 0.28)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.32)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', 'left')
Agent followed the waypoint left. (rewarded 2.09)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent drove forward instead of right. (rewarded 0.96)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -9.67)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.38)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.46)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 0.94)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.95)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.37)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.15)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.18)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.40)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', 'right')
Agent drove right instead of left. (rewarded 0.58)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 0.08)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.66)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 10
\-------------------------
Simulating trial. . .
espilon = 0.9646; alpha = 0.0040
Simulating trial. . .
espilon = 0.9646; alpha = 0.0040
Simulating trial. . .
espilon = 0.9646; alpha = 0.0040
Simulating trial. . .
espilon = 0.9646; alpha = 0.0040
Simulating trial. . .
espilon = 0.9646; alpha = 0.0040
Simulating trial. . .
espilon = 0.9646; alpha = 0.0040
Simulating trial. . .
espilon = 0.9646; alpha = 0.0040
Simulating trial. . .
espilon = 0.9646; alpha = 0.0040
Simulating trial. . .
espilon = 0.9646; alpha = 0.0040
Simulating trial. . .
espilon = 0.9646; alpha = 0.0040
Simulating trial. . .
espilon = 0.9646; alpha = 0.0040
Simulating trial. . .
espilon = 0.9646; alpha = 0.0040
Simulating trial. . .
espilon = 0.9646; alpha = 0.0040
Simulating trial. . .
espilon = 0.9646; alpha = 0.0040
Simulating trial. . .
espilon = 0.9646; alpha = 0.0040
Simulating trial. . .
espilon = 0.9646; alpha = 0.0040
Simulating trial. . .
espilon = 0.9646; alpha = 0.0040
Simulating trial. . .
espilon = 0.9646; alpha = 0.0040
Simulating trial. . .
espilon = 0.9646; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 1.95)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent drove forward instead of right. (rewarded 0.70)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.36)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -10.27)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 0.45)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.62)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.88)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.39)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 0.41)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.27)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.88)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.66)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.26)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove left instead of right. (rewarded 0.75)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.81)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.45)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.17)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.24)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.39)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.04)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'left')
Agent drove left instead of right. (rewarded 1.24)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.77)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.90)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded -0.56)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded -0.22)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 11
\-------------------------
Simulating trial. . .
espilon = 0.9608; alpha = 0.0040
Simulating trial. . .
espilon = 0.9608; alpha = 0.0040
Simulating trial. . .
espilon = 0.9608; alpha = 0.0040
Simulating trial. . .
espilon = 0.9608; alpha = 0.0040
Simulating trial. . .
espilon = 0.9608; alpha = 0.0040
Simulating trial. . .
espilon = 0.9608; alpha = 0.0040
Simulating trial. . .
espilon = 0.9608; alpha = 0.0040
Simulating trial. . .
espilon = 0.9608; alpha = 0.0040
Simulating trial. . .
espilon = 0.9608; alpha = 0.0040
Simulating trial. . .
espilon = 0.9608; alpha = 0.0040
Simulating trial. . .
espilon = 0.9608; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.26)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.33)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent drove right instead of forward. (rewarded 0.31)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 0.43)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.73)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'left')
Agent followed the waypoint right. (rewarded 2.20)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.07)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent followed the waypoint right. (rewarded 2.49)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.17)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.96)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.54)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.99)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 1.15)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove left instead of right. (rewarded 1.42)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent attempted driving left through a red light. (rewarded -9.38)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.52)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'left')
Agent drove right instead of left. (rewarded 0.64)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.46)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.30)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.02)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 12
\-------------------------
Simulating trial. . .
espilon = 0.9570; alpha = 0.0040
Simulating trial. . .
espilon = 0.9570; alpha = 0.0040
Simulating trial. . .
espilon = 0.9570; alpha = 0.0040
Simulating trial. . .
espilon = 0.9570; alpha = 0.0040
Simulating trial. . .
espilon = 0.9570; alpha = 0.0040
Simulating trial. . .
espilon = 0.9570; alpha = 0.0040
Simulating trial. . .
espilon = 0.9570; alpha = 0.0040
Simulating trial. . .
espilon = 0.9570; alpha = 0.0040
Simulating trial. . .
espilon = 0.9570; alpha = 0.0040
Simulating trial. . .
espilon = 0.9570; alpha = 0.0040
Simulating trial. . .
espilon = 0.9570; alpha = 0.0040
Simulating trial. . .
espilon = 0.9570; alpha = 0.0040
Simulating trial. . .
espilon = 0.9570; alpha = 0.0040
Simulating trial. . .
espilon = 0.9570; alpha = 0.0040
Simulating trial. . .
espilon = 0.9570; alpha = 0.0040
Simulating trial. . .
espilon = 0.9570; alpha = 0.0040
Simulating trial. . .
espilon = 0.9570; alpha = 0.0040
Simulating trial. . .
espilon = 0.9570; alpha = 0.0040
Simulating trial. . .
espilon = 0.9570; alpha = 0.0040
Simulating trial. . .
espilon = 0.9570; alpha = 0.0040
Simulating trial. . .
espilon = 0.9570; alpha = 0.0040
Simulating trial. . .
espilon = 0.9570; alpha = 0.0040
Simulating trial. . .
espilon = 0.9570; alpha = 0.0040
Simulating trial. . .
espilon = 0.9570; alpha = 0.0040
Simulating trial. . .
espilon = 0.9570; alpha = 0.0040
Simulating trial. . .
espilon = 0.9570; alpha = 0.0040
Simulating trial. . .
espilon = 0.9570; alpha = 0.0040
Simulating trial. . .
espilon = 0.9570; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -4.51)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent drove right instead of left. (rewarded 0.03)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.46)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.58)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 0.89)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove forward instead of left. (rewarded 0.39)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.62)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.82)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 1.55)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.81)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent drove right instead of forward. (rewarded 0.47)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.04)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.16)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', 'left')
Agent drove forward instead of left. (rewarded 0.07)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.70)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.72)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.90)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.10)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent followed the waypoint left. (rewarded 2.45)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.02)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent drove right instead of forward. (rewarded -0.23)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded -0.35)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded -0.13)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.35)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent attempted driving left through a red light. (rewarded -9.78)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.02)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 0.56)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.74)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove forward instead of left. (rewarded 0.97)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.07)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 13
\-------------------------
Simulating trial. . .
espilon = 0.9531; alpha = 0.0040
Simulating trial. . .
espilon = 0.9531; alpha = 0.0040
Simulating trial. . .
espilon = 0.9531; alpha = 0.0040
Simulating trial. . .
espilon = 0.9531; alpha = 0.0040
Simulating trial. . .
espilon = 0.9531; alpha = 0.0040
Simulating trial. . .
espilon = 0.9531; alpha = 0.0040
Simulating trial. . .
espilon = 0.9531; alpha = 0.0040
Simulating trial. . .
espilon = 0.9531; alpha = 0.0040
Simulating trial. . .
espilon = 0.9531; alpha = 0.0040
Simulating trial. . .
espilon = 0.9531; alpha = 0.0040
Simulating trial. . .
espilon = 0.9531; alpha = 0.0040
Simulating trial. . .
espilon = 0.9531; alpha = 0.0040
Simulating trial. . .
espilon = 0.9531; alpha = 0.0040
Simulating trial. . .
espilon = 0.9531; alpha = 0.0040
Simulating trial. . .
espilon = 0.9531; alpha = 0.0040
Simulating trial. . .
espilon = 0.9531; alpha = 0.0040
Simulating trial. . .
espilon = 0.9531; alpha = 0.0040
Simulating trial. . .
espilon = 0.9531; alpha = 0.0040
Simulating trial. . .
espilon = 0.9531; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -5.40)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.37)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 1.68)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent drove right instead of forward. (rewarded 1.65)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 1.45)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent drove right instead of left. (rewarded 1.55)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.26)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.77)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.79)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.28)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent drove right instead of forward. (rewarded 1.69)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent drove right instead of left. (rewarded 0.91)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent drove forward instead of left. (rewarded 0.84)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'forward', None)
Agent drove right instead of left. (rewarded 1.72)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.07)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 2.30)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.99)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove left instead of forward. (rewarded 1.34)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.32)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent drove left instead of right. (rewarded 1.37)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.12)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'right', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.32)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.24)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent drove left instead of right. (rewarded 0.04)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.23)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 14
\-------------------------
Simulating trial. . .
espilon = 0.9493; alpha = 0.0040
Simulating trial. . .
espilon = 0.9493; alpha = 0.0040
Simulating trial. . .
espilon = 0.9493; alpha = 0.0040
Simulating trial. . .
espilon = 0.9493; alpha = 0.0040
Simulating trial. . .
espilon = 0.9493; alpha = 0.0040
Simulating trial. . .
espilon = 0.9493; alpha = 0.0040
Simulating trial. . .
espilon = 0.9493; alpha = 0.0040
Simulating trial. . .
espilon = 0.9493; alpha = 0.0040
Simulating trial. . .
espilon = 0.9493; alpha = 0.0040
Simulating trial. . .
espilon = 0.9493; alpha = 0.0040
Simulating trial. . .
espilon = 0.9493; alpha = 0.0040
Simulating trial. . .
espilon = 0.9493; alpha = 0.0040
Simulating trial. . .
espilon = 0.9493; alpha = 0.0040
Simulating trial. . .
espilon = 0.9493; alpha = 0.0040
Simulating trial. . .
espilon = 0.9493; alpha = 0.0040
Simulating trial. . .
espilon = 0.9493; alpha = 0.0040
Simulating trial. . .
espilon = 0.9493; alpha = 0.0040
Simulating trial. . .
espilon = 0.9493; alpha = 0.0040
Simulating trial. . .
espilon = 0.9493; alpha = 0.0040
Simulating trial. . .
espilon = 0.9493; alpha = 0.0040
Simulating trial. . .
espilon = 0.9493; alpha = 0.0040
Simulating trial. . .
espilon = 0.9493; alpha = 0.0040
Simulating trial. . .
espilon = 0.9493; alpha = 0.0040
Simulating trial. . .
espilon = 0.9493; alpha = 0.0040
Simulating trial. . .
espilon = 0.9493; alpha = 0.0040
Simulating trial. . .
espilon = 0.9493; alpha = 0.0040
Simulating trial. . .
espilon = 0.9493; alpha = 0.0040
Simulating trial. . .
espilon = 0.9493; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent drove right instead of forward. (rewarded 1.04)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.49)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.08)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.49)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded 1.90)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent drove right instead of forward. (rewarded 1.51)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.05)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'right')
Agent drove right instead of left. (rewarded 0.45)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.86)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.79)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove right instead of left. (rewarded 1.48)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent drove right instead of forward. (rewarded 1.73)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.45)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.48)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent attempted driving left through a red light. (rewarded -9.16)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded -0.02)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.27)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent drove left instead of forward. (rewarded 1.69)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.39)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.45)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.37)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.11)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.69)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.98)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent drove right instead of forward. (rewarded 0.53)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.96)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 0.93)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -10.93)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.90)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.10)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 15
\-------------------------
Simulating trial. . .
espilon = 0.9455; alpha = 0.0040
Simulating trial. . .
espilon = 0.9455; alpha = 0.0040
Simulating trial. . .
espilon = 0.9455; alpha = 0.0040
Simulating trial. . .
espilon = 0.9455; alpha = 0.0040
Simulating trial. . .
espilon = 0.9455; alpha = 0.0040
Simulating trial. . .
espilon = 0.9455; alpha = 0.0040
Simulating trial. . .
espilon = 0.9455; alpha = 0.0040
Simulating trial. . .
espilon = 0.9455; alpha = 0.0040
Simulating trial. . .
espilon = 0.9455; alpha = 0.0040
Simulating trial. . .
espilon = 0.9455; alpha = 0.0040
Simulating trial. . .
espilon = 0.9455; alpha = 0.0040
Simulating trial. . .
espilon = 0.9455; alpha = 0.0040
Simulating trial. . .
espilon = 0.9455; alpha = 0.0040
Simulating trial. . .
espilon = 0.9455; alpha = 0.0040
Simulating trial. . .
espilon = 0.9455; alpha = 0.0040
Simulating trial. . .
espilon = 0.9455; alpha = 0.0040
Simulating trial. . .
espilon = 0.9455; alpha = 0.0040
Simulating trial. . .
espilon = 0.9455; alpha = 0.0040
Simulating trial. . .
espilon = 0.9455; alpha = 0.0040
Simulating trial. . .
espilon = 0.9455; alpha = 0.0040
Simulating trial. . .
espilon = 0.9455; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'forward', None)
Agent followed the waypoint right. (rewarded 2.02)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 1.92)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.79)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent drove right instead of left. (rewarded 0.25)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 1.52)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.04)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.02)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.38)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 1.20)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.75)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent attempted driving left through a red light. (rewarded -9.31)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove forward instead of left. (rewarded 0.67)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 0.90)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent attempted driving left through a red light. (rewarded -9.51)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.85)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.61)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.43)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded 0.52)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.23)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded -0.03)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.25)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.30)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.14)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent followed the waypoint right. (rewarded 0.32)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent followed the waypoint forward. (rewarded 0.72)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 16
\-------------------------
Simulating trial. . .
espilon = 0.9418; alpha = 0.0040
Simulating trial. . .
espilon = 0.9418; alpha = 0.0040
Simulating trial. . .
espilon = 0.9418; alpha = 0.0040
Simulating trial. . .
espilon = 0.9418; alpha = 0.0040
Simulating trial. . .
espilon = 0.9418; alpha = 0.0040
Simulating trial. . .
espilon = 0.9418; alpha = 0.0040
Simulating trial. . .
espilon = 0.9418; alpha = 0.0040
Simulating trial. . .
espilon = 0.9418; alpha = 0.0040
Simulating trial. . .
espilon = 0.9418; alpha = 0.0040
Simulating trial. . .
espilon = 0.9418; alpha = 0.0040
Simulating trial. . .
espilon = 0.9418; alpha = 0.0040
Simulating trial. . .
espilon = 0.9418; alpha = 0.0040
Simulating trial. . .
espilon = 0.9418; alpha = 0.0040
Simulating trial. . .
espilon = 0.9418; alpha = 0.0040
Simulating trial. . .
espilon = 0.9418; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'forward')
Agent followed the waypoint right. (rewarded 1.64)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent drove left instead of forward. (rewarded 0.76)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent drove left instead of right. (rewarded -0.00)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.10)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 1.13)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent attempted driving left through a red light. (rewarded -9.53)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.00)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.60)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove forward instead of right. (rewarded 0.81)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.90)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.43)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.87)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent attempted driving left through a red light. (rewarded -10.98)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent attempted driving left through a red light. (rewarded -10.17)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', 'left')
Agent attempted driving left through a red light. (rewarded -10.05)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 1.48)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.90)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.12)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.82)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.05)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 17
\-------------------------
Simulating trial. . .
espilon = 0.9380; alpha = 0.0040
Simulating trial. . .
espilon = 0.9380; alpha = 0.0040
Simulating trial. . .
espilon = 0.9380; alpha = 0.0040
Simulating trial. . .
espilon = 0.9380; alpha = 0.0040
Simulating trial. . .
espilon = 0.9380; alpha = 0.0040
Simulating trial. . .
espilon = 0.9380; alpha = 0.0040
Simulating trial. . .
espilon = 0.9380; alpha = 0.0040
Simulating trial. . .
espilon = 0.9380; alpha = 0.0040
Simulating trial. . .
espilon = 0.9380; alpha = 0.0040
Simulating trial. . .
espilon = 0.9380; alpha = 0.0040
Simulating trial. . .
espilon = 0.9380; alpha = 0.0040
Simulating trial. . .
espilon = 0.9380; alpha = 0.0040
Simulating trial. . .
espilon = 0.9380; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'right', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.71)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.48)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', 'left')
Agent followed the waypoint left. (rewarded 1.23)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 1.91)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.60)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.73)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.19)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.38)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.32)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.02)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent drove left instead of right. (rewarded 1.36)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 0.89)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.87)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.63)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 0.27)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.44)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.90)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.48)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.31)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 0.88)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 18
\-------------------------
Simulating trial. . .
espilon = 0.9343; alpha = 0.0040
Simulating trial. . .
espilon = 0.9343; alpha = 0.0040
Simulating trial. . .
espilon = 0.9343; alpha = 0.0040
Simulating trial. . .
espilon = 0.9343; alpha = 0.0040
Simulating trial. . .
espilon = 0.9343; alpha = 0.0040
Simulating trial. . .
espilon = 0.9343; alpha = 0.0040
Simulating trial. . .
espilon = 0.9343; alpha = 0.0040
Simulating trial. . .
espilon = 0.9343; alpha = 0.0040
Simulating trial. . .
espilon = 0.9343; alpha = 0.0040
Simulating trial. . .
espilon = 0.9343; alpha = 0.0040
Simulating trial. . .
espilon = 0.9343; alpha = 0.0040
Simulating trial. . .
espilon = 0.9343; alpha = 0.0040
Simulating trial. . .
espilon = 0.9343; alpha = 0.0040
Simulating trial. . .
espilon = 0.9343; alpha = 0.0040
Simulating trial. . .
espilon = 0.9343; alpha = 0.0040
Simulating trial. . .
espilon = 0.9343; alpha = 0.0040
Simulating trial. . .
espilon = 0.9343; alpha = 0.0040
Simulating trial. . .
espilon = 0.9343; alpha = 0.0040
Simulating trial. . .
espilon = 0.9343; alpha = 0.0040
Simulating trial. . .
espilon = 0.9343; alpha = 0.0040
Simulating trial. . .
espilon = 0.9343; alpha = 0.0040
Simulating trial. . .
espilon = 0.9343; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'left', 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -5.44)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', 'right')
Agent drove right instead of left. (rewarded 1.70)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.53)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.08)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent drove right instead of forward. (rewarded 0.54)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.23)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.55)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 1.08)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.42)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 1.27)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.80)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded 0.55)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.21)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.64)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.67)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent attempted driving forward through a red light. (rewarded -9.56)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.89)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent drove left instead of forward. (rewarded -0.03)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.77)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.13)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 1.14)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.40)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.56)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.06)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.42)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 19
\-------------------------
Simulating trial. . .
espilon = 0.9305; alpha = 0.0040
Simulating trial. . .
espilon = 0.9305; alpha = 0.0040
Simulating trial. . .
espilon = 0.9305; alpha = 0.0040
Simulating trial. . .
espilon = 0.9305; alpha = 0.0040
Simulating trial. . .
espilon = 0.9305; alpha = 0.0040
Simulating trial. . .
espilon = 0.9305; alpha = 0.0040
Simulating trial. . .
espilon = 0.9305; alpha = 0.0040
Simulating trial. . .
espilon = 0.9305; alpha = 0.0040
Simulating trial. . .
espilon = 0.9305; alpha = 0.0040
Simulating trial. . .
espilon = 0.9305; alpha = 0.0040
Simulating trial. . .
espilon = 0.9305; alpha = 0.0040
Simulating trial. . .
espilon = 0.9305; alpha = 0.0040
Simulating trial. . .
espilon = 0.9305; alpha = 0.0040
Simulating trial. . .
espilon = 0.9305; alpha = 0.0040
Simulating trial. . .
espilon = 0.9305; alpha = 0.0040
Simulating trial. . .
espilon = 0.9305; alpha = 0.0040
Simulating trial. . .
espilon = 0.9305; alpha = 0.0040
Simulating trial. . .
espilon = 0.9305; alpha = 0.0040
Simulating trial. . .
espilon = 0.9305; alpha = 0.0040
Simulating trial. . .
espilon = 0.9305; alpha = 0.0040
Simulating trial. . .
espilon = 0.9305; alpha = 0.0040
Simulating trial. . .
espilon = 0.9305; alpha = 0.0040
Simulating trial. . .
espilon = 0.9305; alpha = 0.0040
Simulating trial. . .
espilon = 0.9305; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent drove left instead of right. (rewarded 0.61)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.78)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 1.36)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent drove left instead of right. (rewarded 0.82)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove left instead of forward. (rewarded -0.03)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent drove forward instead of right. (rewarded -0.03)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.52)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.41)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.47)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'left')
Agent drove left instead of right. (rewarded 1.27)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.99)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.97)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -4.32)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -9.82)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.44)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.54)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 0.00)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', 'left')
Agent idled at a green light with oncoming traffic. (rewarded -0.40)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', 'forward')
Agent drove right instead of left. (rewarded 0.43)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.78)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.00)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 0.25)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', None)
Agent attempted driving forward through a red light. (rewarded -10.65)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.65)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.52)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 20
\-------------------------
Simulating trial. . .
espilon = 0.9268; alpha = 0.0040
Simulating trial. . .
espilon = 0.9268; alpha = 0.0040
Simulating trial. . .
espilon = 0.9268; alpha = 0.0040
Simulating trial. . .
espilon = 0.9268; alpha = 0.0040
Simulating trial. . .
espilon = 0.9268; alpha = 0.0040
Simulating trial. . .
espilon = 0.9268; alpha = 0.0040
Simulating trial. . .
espilon = 0.9268; alpha = 0.0040
Simulating trial. . .
espilon = 0.9268; alpha = 0.0040
Simulating trial. . .
espilon = 0.9268; alpha = 0.0040
Simulating trial. . .
espilon = 0.9268; alpha = 0.0040
Simulating trial. . .
espilon = 0.9268; alpha = 0.0040
Simulating trial. . .
espilon = 0.9268; alpha = 0.0040
Simulating trial. . .
espilon = 0.9268; alpha = 0.0040
Simulating trial. . .
espilon = 0.9268; alpha = 0.0040
Simulating trial. . .
espilon = 0.9268; alpha = 0.0040
Simulating trial. . .
espilon = 0.9268; alpha = 0.0040
Simulating trial. . .
espilon = 0.9268; alpha = 0.0040
Simulating trial. . .
espilon = 0.9268; alpha = 0.0040
Simulating trial. . .
espilon = 0.9268; alpha = 0.0040
Simulating trial. . .
espilon = 0.9268; alpha = 0.0040
Simulating trial. . .
espilon = 0.9268; alpha = 0.0040
Simulating trial. . .
espilon = 0.9268; alpha = 0.0040
Simulating trial. . .
espilon = 0.9268; alpha = 0.0040
Simulating trial. . .
espilon = 0.9268; alpha = 0.0040
Simulating trial. . .
espilon = 0.9268; alpha = 0.0040
Simulating trial. . .
espilon = 0.9268; alpha = 0.0040
Simulating trial. . .
espilon = 0.9268; alpha = 0.0040
Simulating trial. . .
espilon = 0.9268; alpha = 0.0040
Simulating trial. . .
espilon = 0.9268; alpha = 0.0040
Simulating trial. . .
espilon = 0.9268; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.14)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.86)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'right', 'left')
Agent properly idled at a red light. (rewarded 2.26)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.18)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent attempted driving forward through a red light. (rewarded -9.85)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove left instead of forward. (rewarded 0.29)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.34)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.57)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent drove right instead of forward. (rewarded 0.03)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove forward instead of left. (rewarded 0.69)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.98)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.97)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent drove right instead of left. (rewarded -0.02)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent attempted driving left through a red light. (rewarded -10.70)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 1.36)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, 'left')
Agent attempted driving left through a red light. (rewarded -9.54)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.98)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.05)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.25)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.15)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent properly idled at a red light. (rewarded 1.80)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove left instead of forward. (rewarded -0.57)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.83)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent followed the waypoint right. (rewarded 2.10)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.63)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 21
\-------------------------
Simulating trial. . .
espilon = 0.9231; alpha = 0.0040
Simulating trial. . .
espilon = 0.9231; alpha = 0.0040
Simulating trial. . .
espilon = 0.9231; alpha = 0.0040
Simulating trial. . .
espilon = 0.9231; alpha = 0.0040
Simulating trial. . .
espilon = 0.9231; alpha = 0.0040
Simulating trial. . .
espilon = 0.9231; alpha = 0.0040
Simulating trial. . .
espilon = 0.9231; alpha = 0.0040
Simulating trial. . .
espilon = 0.9231; alpha = 0.0040
Simulating trial. . .
espilon = 0.9231; alpha = 0.0040
Simulating trial. . .
espilon = 0.9231; alpha = 0.0040
Simulating trial. . .
espilon = 0.9231; alpha = 0.0040
Simulating trial. . .
espilon = 0.9231; alpha = 0.0040
Simulating trial. . .
espilon = 0.9231; alpha = 0.0040
Simulating trial. . .
espilon = 0.9231; alpha = 0.0040
Simulating trial. . .
espilon = 0.9231; alpha = 0.0040
Simulating trial. . .
espilon = 0.9231; alpha = 0.0040
Simulating trial. . .
espilon = 0.9231; alpha = 0.0040
Simulating trial. . .
espilon = 0.9231; alpha = 0.0040
Simulating trial. . .
espilon = 0.9231; alpha = 0.0040
Simulating trial. . .
espilon = 0.9231; alpha = 0.0040
Simulating trial. . .
espilon = 0.9231; alpha = 0.0040
Simulating trial. . .
espilon = 0.9231; alpha = 0.0040
Simulating trial. . .
espilon = 0.9231; alpha = 0.0040
Simulating trial. . .
espilon = 0.9231; alpha = 0.0040
Simulating trial. . .
espilon = 0.9231; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.33)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove left instead of right. (rewarded 0.67)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent drove forward instead of right. (rewarded 0.02)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.85)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', 'right')
Agent drove right instead of forward. (rewarded 1.47)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.81)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove left instead of forward. (rewarded 1.54)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove left instead of right. (rewarded 1.28)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.83)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -10.99)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -10.95)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.45)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.89)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 0.74)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.16)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove forward instead of left. (rewarded 1.35)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 0.32)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent drove forward instead of right. (rewarded 0.16)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -9.81)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.02)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 22
\-------------------------
Simulating trial. . .
espilon = 0.9194; alpha = 0.0040
Simulating trial. . .
espilon = 0.9194; alpha = 0.0040
Simulating trial. . .
espilon = 0.9194; alpha = 0.0040
Simulating trial. . .
espilon = 0.9194; alpha = 0.0040
Simulating trial. . .
espilon = 0.9194; alpha = 0.0040
Simulating trial. . .
espilon = 0.9194; alpha = 0.0040
Simulating trial. . .
espilon = 0.9194; alpha = 0.0040
Simulating trial. . .
espilon = 0.9194; alpha = 0.0040
Simulating trial. . .
espilon = 0.9194; alpha = 0.0040
Simulating trial. . .
espilon = 0.9194; alpha = 0.0040
Simulating trial. . .
espilon = 0.9194; alpha = 0.0040
Simulating trial. . .
espilon = 0.9194; alpha = 0.0040
Simulating trial. . .
espilon = 0.9194; alpha = 0.0040
Simulating trial. . .
espilon = 0.9194; alpha = 0.0040
Simulating trial. . .
espilon = 0.9194; alpha = 0.0040
Simulating trial. . .
espilon = 0.9194; alpha = 0.0040
Simulating trial. . .
espilon = 0.9194; alpha = 0.0040
Simulating trial. . .
espilon = 0.9194; alpha = 0.0040
Simulating trial. . .
espilon = 0.9194; alpha = 0.0040
Simulating trial. . .
espilon = 0.9194; alpha = 0.0040
Simulating trial. . .
espilon = 0.9194; alpha = 0.0040
Simulating trial. . .
espilon = 0.9194; alpha = 0.0040
Simulating trial. . .
espilon = 0.9194; alpha = 0.0040
Simulating trial. . .
espilon = 0.9194; alpha = 0.0040
Simulating trial. . .
espilon = 0.9194; alpha = 0.0040
Simulating trial. . .
espilon = 0.9194; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 1.19)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove forward instead of right. (rewarded 1.31)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.42)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.13)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.48)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.43)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', None)
Agent drove right instead of forward. (rewarded 0.65)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 0.03)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded -0.11)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.96)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.76)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.02)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent drove forward instead of right. (rewarded 0.46)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.42)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.56)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent drove left instead of right. (rewarded -0.13)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent followed the waypoint left. (rewarded 0.95)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.91)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.30)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.25)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.91)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 1.05)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.08)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.03)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.90)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 23
\-------------------------
Simulating trial. . .
espilon = 0.9158; alpha = 0.0040
Simulating trial. . .
espilon = 0.9158; alpha = 0.0040
Simulating trial. . .
espilon = 0.9158; alpha = 0.0040
Simulating trial. . .
espilon = 0.9158; alpha = 0.0040
Simulating trial. . .
espilon = 0.9158; alpha = 0.0040
Simulating trial. . .
espilon = 0.9158; alpha = 0.0040
Simulating trial. . .
espilon = 0.9158; alpha = 0.0040
Simulating trial. . .
espilon = 0.9158; alpha = 0.0040
Simulating trial. . .
espilon = 0.9158; alpha = 0.0040
Simulating trial. . .
espilon = 0.9158; alpha = 0.0040
Simulating trial. . .
espilon = 0.9158; alpha = 0.0040
Simulating trial. . .
espilon = 0.9158; alpha = 0.0040
Simulating trial. . .
espilon = 0.9158; alpha = 0.0040
Simulating trial. . .
espilon = 0.9158; alpha = 0.0040
Simulating trial. . .
espilon = 0.9158; alpha = 0.0040
Simulating trial. . .
espilon = 0.9158; alpha = 0.0040
Simulating trial. . .
espilon = 0.9158; alpha = 0.0040
Simulating trial. . .
espilon = 0.9158; alpha = 0.0040
Simulating trial. . .
espilon = 0.9158; alpha = 0.0040
Simulating trial. . .
espilon = 0.9158; alpha = 0.0040
Simulating trial. . .
espilon = 0.9158; alpha = 0.0040
Simulating trial. . .
espilon = 0.9158; alpha = 0.0040
Simulating trial. . .
espilon = 0.9158; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', 'forward')
Agent followed the waypoint right. (rewarded 1.22)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent drove left instead of forward. (rewarded 0.63)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.65)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.02)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.33)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.45)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent drove right instead of left. (rewarded 0.25)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.47)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.00)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.39)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent drove right instead of forward. (rewarded 1.64)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.11)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent drove right instead of left. (rewarded -0.12)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.03)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 0.87)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent drove forward instead of left. (rewarded 0.09)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.42)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 2.33)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent drove right instead of left. (rewarded 1.35)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent attempted driving left through a red light. (rewarded -10.15)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 0.52)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.83)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent drove right instead of left. (rewarded 1.20)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.42)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.10)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 24
\-------------------------
Simulating trial. . .
espilon = 0.9121; alpha = 0.0040
Simulating trial. . .
espilon = 0.9121; alpha = 0.0040
Simulating trial. . .
espilon = 0.9121; alpha = 0.0040
Simulating trial. . .
espilon = 0.9121; alpha = 0.0040
Simulating trial. . .
espilon = 0.9121; alpha = 0.0040
Simulating trial. . .
espilon = 0.9121; alpha = 0.0040
Simulating trial. . .
espilon = 0.9121; alpha = 0.0040
Simulating trial. . .
espilon = 0.9121; alpha = 0.0040
Simulating trial. . .
espilon = 0.9121; alpha = 0.0040
Simulating trial. . .
espilon = 0.9121; alpha = 0.0040
Simulating trial. . .
espilon = 0.9121; alpha = 0.0040
Simulating trial. . .
espilon = 0.9121; alpha = 0.0040
Simulating trial. . .
espilon = 0.9121; alpha = 0.0040
Simulating trial. . .
espilon = 0.9121; alpha = 0.0040
Simulating trial. . .
espilon = 0.9121; alpha = 0.0040
Simulating trial. . .
espilon = 0.9121; alpha = 0.0040
Simulating trial. . .
espilon = 0.9121; alpha = 0.0040
Simulating trial. . .
espilon = 0.9121; alpha = 0.0040
Simulating trial. . .
espilon = 0.9121; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.14)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent attempted driving left through a red light. (rewarded -10.22)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -10.69)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.74)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'forward')
Agent drove right instead of left. (rewarded -0.04)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.85)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.76)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.53)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 1.42)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.17)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.91)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'left')
Agent followed the waypoint right. (rewarded 1.82)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent drove left instead of forward. (rewarded 1.23)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove left instead of right. (rewarded 0.03)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent drove left instead of right. (rewarded 1.24)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove right instead of left. (rewarded 1.24)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'right', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.35)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 0.54)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -4.32)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent drove forward instead of right. (rewarded -0.78)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 25
\-------------------------
Simulating trial. . .
espilon = 0.9085; alpha = 0.0040
Simulating trial. . .
espilon = 0.9085; alpha = 0.0040
Simulating trial. . .
espilon = 0.9085; alpha = 0.0040
Simulating trial. . .
espilon = 0.9085; alpha = 0.0040
Simulating trial. . .
espilon = 0.9085; alpha = 0.0040
Simulating trial. . .
espilon = 0.9085; alpha = 0.0040
Simulating trial. . .
espilon = 0.9085; alpha = 0.0040
Simulating trial. . .
espilon = 0.9085; alpha = 0.0040
Simulating trial. . .
espilon = 0.9085; alpha = 0.0040
Simulating trial. . .
espilon = 0.9085; alpha = 0.0040
Simulating trial. . .
espilon = 0.9085; alpha = 0.0040
Simulating trial. . .
espilon = 0.9085; alpha = 0.0040
Simulating trial. . .
espilon = 0.9085; alpha = 0.0040
Simulating trial. . .
espilon = 0.9085; alpha = 0.0040
Simulating trial. . .
espilon = 0.9085; alpha = 0.0040
Simulating trial. . .
espilon = 0.9085; alpha = 0.0040
Simulating trial. . .
espilon = 0.9085; alpha = 0.0040
Simulating trial. . .
espilon = 0.9085; alpha = 0.0040
Simulating trial. . .
espilon = 0.9085; alpha = 0.0040
Simulating trial. . .
espilon = 0.9085; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'forward')
Agent drove right instead of left. (rewarded 1.94)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded 0.34)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.94)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent drove forward instead of right. (rewarded 1.92)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.96)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.04)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.51)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.62)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.76)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.66)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent attempted driving forward through a red light. (rewarded -9.11)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.45)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.45)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.50)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.63)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.81)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 1.58)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.87)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 1.71)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.84)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 26
\-------------------------
Simulating trial. . .
espilon = 0.9048; alpha = 0.0040
Simulating trial. . .
espilon = 0.9048; alpha = 0.0040
Simulating trial. . .
espilon = 0.9048; alpha = 0.0040
Simulating trial. . .
espilon = 0.9048; alpha = 0.0040
Simulating trial. . .
espilon = 0.9048; alpha = 0.0040
Simulating trial. . .
espilon = 0.9048; alpha = 0.0040
Simulating trial. . .
espilon = 0.9048; alpha = 0.0040
Simulating trial. . .
espilon = 0.9048; alpha = 0.0040
Simulating trial. . .
espilon = 0.9048; alpha = 0.0040
Simulating trial. . .
espilon = 0.9048; alpha = 0.0040
Simulating trial. . .
espilon = 0.9048; alpha = 0.0040
Simulating trial. . .
espilon = 0.9048; alpha = 0.0040
Simulating trial. . .
espilon = 0.9048; alpha = 0.0040
Simulating trial. . .
espilon = 0.9048; alpha = 0.0040
Simulating trial. . .
espilon = 0.9048; alpha = 0.0040
Simulating trial. . .
espilon = 0.9048; alpha = 0.0040
Simulating trial. . .
espilon = 0.9048; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.74)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 1.55)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.81)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.11)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.38)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.77)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 1.09)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.32)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.93)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.62)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.75)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 2.67)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.59)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 0.76)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'right')
Agent properly idled at a red light. (rewarded 1.55)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent attempted driving forward through a red light. (rewarded -10.57)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'left')
Agent drove forward instead of right. (rewarded 0.97)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -5.21)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', 'left')
Agent followed the waypoint right. (rewarded 1.75)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent attempted driving left through a red light. (rewarded -9.12)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 27
\-------------------------
Simulating trial. . .
espilon = 0.9012; alpha = 0.0040
Simulating trial. . .
espilon = 0.9012; alpha = 0.0040
Simulating trial. . .
espilon = 0.9012; alpha = 0.0040
Simulating trial. . .
espilon = 0.9012; alpha = 0.0040
Simulating trial. . .
espilon = 0.9012; alpha = 0.0040
Simulating trial. . .
espilon = 0.9012; alpha = 0.0040
Simulating trial. . .
espilon = 0.9012; alpha = 0.0040
Simulating trial. . .
espilon = 0.9012; alpha = 0.0040
Simulating trial. . .
espilon = 0.9012; alpha = 0.0040
Simulating trial. . .
espilon = 0.9012; alpha = 0.0040
Simulating trial. . .
espilon = 0.9012; alpha = 0.0040
Simulating trial. . .
espilon = 0.9012; alpha = 0.0040
Simulating trial. . .
espilon = 0.9012; alpha = 0.0040
Simulating trial. . .
espilon = 0.9012; alpha = 0.0040
Simulating trial. . .
espilon = 0.9012; alpha = 0.0040
Simulating trial. . .
espilon = 0.9012; alpha = 0.0040
Simulating trial. . .
espilon = 0.9012; alpha = 0.0040
Simulating trial. . .
espilon = 0.9012; alpha = 0.0040
Simulating trial. . .
espilon = 0.9012; alpha = 0.0040
Simulating trial. . .
espilon = 0.9012; alpha = 0.0040
Simulating trial. . .
espilon = 0.9012; alpha = 0.0040
Simulating trial. . .
espilon = 0.9012; alpha = 0.0040
Simulating trial. . .
espilon = 0.9012; alpha = 0.0040
Simulating trial. . .
espilon = 0.9012; alpha = 0.0040
Simulating trial. . .
espilon = 0.9012; alpha = 0.0040
Simulating trial. . .
espilon = 0.9012; alpha = 0.0040
Simulating trial. . .
espilon = 0.9012; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.90)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', 'left')
Agent followed the waypoint forward. (rewarded 1.82)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.37)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.98)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent drove right instead of forward. (rewarded 1.38)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.97)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.70)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent drove forward instead of left. (rewarded -0.04)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 2.61)
55% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 28
\-------------------------
Simulating trial. . .
espilon = 0.8976; alpha = 0.0040
Simulating trial. . .
espilon = 0.8976; alpha = 0.0040
Simulating trial. . .
espilon = 0.8976; alpha = 0.0040
Simulating trial. . .
espilon = 0.8976; alpha = 0.0040
Simulating trial. . .
espilon = 0.8976; alpha = 0.0040
Simulating trial. . .
espilon = 0.8976; alpha = 0.0040
Simulating trial. . .
espilon = 0.8976; alpha = 0.0040
Simulating trial. . .
espilon = 0.8976; alpha = 0.0040
Simulating trial. . .
espilon = 0.8976; alpha = 0.0040
Simulating trial. . .
espilon = 0.8976; alpha = 0.0040
Simulating trial. . .
espilon = 0.8976; alpha = 0.0040
Simulating trial. . .
espilon = 0.8976; alpha = 0.0040
Simulating trial. . .
espilon = 0.8976; alpha = 0.0040
Simulating trial. . .
espilon = 0.8976; alpha = 0.0040
Simulating trial. . .
espilon = 0.8976; alpha = 0.0040
Simulating trial. . .
espilon = 0.8976; alpha = 0.0040
Simulating trial. . .
espilon = 0.8976; alpha = 0.0040
Simulating trial. . .
espilon = 0.8976; alpha = 0.0040
Simulating trial. . .
espilon = 0.8976; alpha = 0.0040
Simulating trial. . .
espilon = 0.8976; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.63)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded 0.74)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.51)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent attempted driving left through a red light. (rewarded -10.91)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.89)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 0.67)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent followed the waypoint left. (rewarded 1.40)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.63)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.40)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.00)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.20)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent drove right instead of forward. (rewarded 0.53)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.01)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.06)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded -0.06)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -10.43)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.89)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.15)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.71)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 1.13)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.17)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.25)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.99)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 0.47)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -9.98)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 29
\-------------------------
Simulating trial. . .
espilon = 0.8940; alpha = 0.0040
Simulating trial. . .
espilon = 0.8940; alpha = 0.0040
Simulating trial. . .
espilon = 0.8940; alpha = 0.0040
Simulating trial. . .
espilon = 0.8940; alpha = 0.0040
Simulating trial. . .
espilon = 0.8940; alpha = 0.0040
Simulating trial. . .
espilon = 0.8940; alpha = 0.0040
Simulating trial. . .
espilon = 0.8940; alpha = 0.0040
Simulating trial. . .
espilon = 0.8940; alpha = 0.0040
Simulating trial. . .
espilon = 0.8940; alpha = 0.0040
Simulating trial. . .
espilon = 0.8940; alpha = 0.0040
Simulating trial. . .
espilon = 0.8940; alpha = 0.0040
Simulating trial. . .
espilon = 0.8940; alpha = 0.0040
Simulating trial. . .
espilon = 0.8940; alpha = 0.0040
Simulating trial. . .
espilon = 0.8940; alpha = 0.0040
Simulating trial. . .
espilon = 0.8940; alpha = 0.0040
Simulating trial. . .
espilon = 0.8940; alpha = 0.0040
Simulating trial. . .
espilon = 0.8940; alpha = 0.0040
Simulating trial. . .
espilon = 0.8940; alpha = 0.0040
Simulating trial. . .
espilon = 0.8940; alpha = 0.0040
Simulating trial. . .
espilon = 0.8940; alpha = 0.0040
Simulating trial. . .
espilon = 0.8940; alpha = 0.0040
Simulating trial. . .
espilon = 0.8940; alpha = 0.0040
Simulating trial. . .
espilon = 0.8940; alpha = 0.0040
Simulating trial. . .
espilon = 0.8940; alpha = 0.0040
Simulating trial. . .
espilon = 0.8940; alpha = 0.0040
Simulating trial. . .
espilon = 0.8940; alpha = 0.0040
Simulating trial. . .
espilon = 0.8940; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.22)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.20)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent drove right instead of forward. (rewarded 1.53)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent drove right instead of left. (rewarded 0.33)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.00)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.48)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 0.82)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.83)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent drove right instead of left. (rewarded 1.80)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 1.20)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded -0.14)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 0.98)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.66)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.80)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.52)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.52)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.60)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.29)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 0.03)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.13)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 0.22)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.11)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.19)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'left')
Agent followed the waypoint right. (rewarded 0.92)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.42)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 30
\-------------------------
Simulating trial. . .
espilon = 0.8905; alpha = 0.0040
Simulating trial. . .
espilon = 0.8905; alpha = 0.0040
Simulating trial. . .
espilon = 0.8905; alpha = 0.0040
Simulating trial. . .
espilon = 0.8905; alpha = 0.0040
Simulating trial. . .
espilon = 0.8905; alpha = 0.0040
Simulating trial. . .
espilon = 0.8905; alpha = 0.0040
Simulating trial. . .
espilon = 0.8905; alpha = 0.0040
Simulating trial. . .
espilon = 0.8905; alpha = 0.0040
Simulating trial. . .
espilon = 0.8905; alpha = 0.0040
Simulating trial. . .
espilon = 0.8905; alpha = 0.0040
Simulating trial. . .
espilon = 0.8905; alpha = 0.0040
Simulating trial. . .
espilon = 0.8905; alpha = 0.0040
Simulating trial. . .
espilon = 0.8905; alpha = 0.0040
Simulating trial. . .
espilon = 0.8905; alpha = 0.0040
Simulating trial. . .
espilon = 0.8905; alpha = 0.0040
Simulating trial. . .
espilon = 0.8905; alpha = 0.0040
Simulating trial. . .
espilon = 0.8905; alpha = 0.0040
Simulating trial. . .
espilon = 0.8905; alpha = 0.0040
Simulating trial. . .
espilon = 0.8905; alpha = 0.0040
Simulating trial. . .
espilon = 0.8905; alpha = 0.0040
Simulating trial. . .
espilon = 0.8905; alpha = 0.0040
Simulating trial. . .
espilon = 0.8905; alpha = 0.0040
Simulating trial. . .
espilon = 0.8905; alpha = 0.0040
Simulating trial. . .
espilon = 0.8905; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.47)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.29)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.36)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', 'left')
Agent drove right instead of left. (rewarded 1.23)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.54)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.90)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'left')
Agent drove left instead of right. (rewarded -0.04)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.24)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.10)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.77)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 0.76)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.11)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.62)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.47)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.59)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 0.61)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.63)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent drove left instead of forward. (rewarded 0.51)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded -0.67)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.26)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 31
\-------------------------
Simulating trial. . .
espilon = 0.8869; alpha = 0.0040
Simulating trial. . .
espilon = 0.8869; alpha = 0.0040
Simulating trial. . .
espilon = 0.8869; alpha = 0.0040
Simulating trial. . .
espilon = 0.8869; alpha = 0.0040
Simulating trial. . .
espilon = 0.8869; alpha = 0.0040
Simulating trial. . .
espilon = 0.8869; alpha = 0.0040
Simulating trial. . .
espilon = 0.8869; alpha = 0.0040
Simulating trial. . .
espilon = 0.8869; alpha = 0.0040
Simulating trial. . .
espilon = 0.8869; alpha = 0.0040
Simulating trial. . .
espilon = 0.8869; alpha = 0.0040
Simulating trial. . .
espilon = 0.8869; alpha = 0.0040
Simulating trial. . .
espilon = 0.8869; alpha = 0.0040
Simulating trial. . .
espilon = 0.8869; alpha = 0.0040
Simulating trial. . .
espilon = 0.8869; alpha = 0.0040
Simulating trial. . .
espilon = 0.8869; alpha = 0.0040
Simulating trial. . .
espilon = 0.8869; alpha = 0.0040
Simulating trial. . .
espilon = 0.8869; alpha = 0.0040
Simulating trial. . .
espilon = 0.8869; alpha = 0.0040
Simulating trial. . .
espilon = 0.8869; alpha = 0.0040
Simulating trial. . .
espilon = 0.8869; alpha = 0.0040
Simulating trial. . .
espilon = 0.8869; alpha = 0.0040
Simulating trial. . .
espilon = 0.8869; alpha = 0.0040
Simulating trial. . .
espilon = 0.8869; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.73)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.17)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'right')
Agent drove right instead of forward. (rewarded 1.64)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.32)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded 1.03)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.15)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove forward instead of left. (rewarded 0.19)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.66)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.09)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove right instead of left. (rewarded 0.48)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 0.55)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.18)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent drove right instead of left. (rewarded -0.11)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent attempted driving left through a red light. (rewarded -9.68)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.56)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.59)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove right instead of forward. (rewarded -0.16)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.07)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.20)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove left instead of forward. (rewarded 0.19)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.01)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.18)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.56)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'right', 'left')
Agent followed the waypoint right. (rewarded 0.58)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.06)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 32
\-------------------------
Simulating trial. . .
espilon = 0.8834; alpha = 0.0040
Simulating trial. . .
espilon = 0.8834; alpha = 0.0040
Simulating trial. . .
espilon = 0.8834; alpha = 0.0040
Simulating trial. . .
espilon = 0.8834; alpha = 0.0040
Simulating trial. . .
espilon = 0.8834; alpha = 0.0040
Simulating trial. . .
espilon = 0.8834; alpha = 0.0040
Simulating trial. . .
espilon = 0.8834; alpha = 0.0040
Simulating trial. . .
espilon = 0.8834; alpha = 0.0040
Simulating trial. . .
espilon = 0.8834; alpha = 0.0040
Simulating trial. . .
espilon = 0.8834; alpha = 0.0040
Simulating trial. . .
espilon = 0.8834; alpha = 0.0040
Simulating trial. . .
espilon = 0.8834; alpha = 0.0040
Simulating trial. . .
espilon = 0.8834; alpha = 0.0040
Simulating trial. . .
espilon = 0.8834; alpha = 0.0040
Simulating trial. . .
espilon = 0.8834; alpha = 0.0040
Simulating trial. . .
espilon = 0.8834; alpha = 0.0040
Simulating trial. . .
espilon = 0.8834; alpha = 0.0040
Simulating trial. . .
espilon = 0.8834; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove forward instead of right. (rewarded 0.67)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.01)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.42)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.98)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 1.66)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.54)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.68)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.73)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove left instead of forward. (rewarded 1.00)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.27)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 1.86)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.40)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 0.26)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'forward', None)
Agent drove right instead of left. (rewarded 1.44)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent followed the waypoint left. (rewarded 1.03)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.93)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.24)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.45)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded -0.17)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent drove right instead of left. (rewarded 1.04)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent drove forward instead of left. (rewarded 0.60)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.16)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent drove right instead of left. (rewarded 0.13)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 0.76)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 1.13)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 33
\-------------------------
Simulating trial. . .
espilon = 0.8799; alpha = 0.0040
Simulating trial. . .
espilon = 0.8799; alpha = 0.0040
Simulating trial. . .
espilon = 0.8799; alpha = 0.0040
Simulating trial. . .
espilon = 0.8799; alpha = 0.0040
Simulating trial. . .
espilon = 0.8799; alpha = 0.0040
Simulating trial. . .
espilon = 0.8799; alpha = 0.0040
Simulating trial. . .
espilon = 0.8799; alpha = 0.0040
Simulating trial. . .
espilon = 0.8799; alpha = 0.0040
Simulating trial. . .
espilon = 0.8799; alpha = 0.0040
Simulating trial. . .
espilon = 0.8799; alpha = 0.0040
Simulating trial. . .
espilon = 0.8799; alpha = 0.0040
Simulating trial. . .
espilon = 0.8799; alpha = 0.0040
Simulating trial. . .
espilon = 0.8799; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent drove right instead of left. (rewarded 1.96)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent attempted driving forward through a red light. (rewarded -10.74)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.02)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.98)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded 1.64)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 0.13)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.31)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.00)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.82)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.28)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.30)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'right')
Agent followed the waypoint forward. (rewarded 2.52)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.13)
63% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.55)
60% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.92)
57% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.24)
54% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -9.21)
51% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.81)
49% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 0.55)
46% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.79)
43% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.58)
40% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.50)
37% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'right')
Agent followed the waypoint right. (rewarded 2.24)
34% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 1.11)
31% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.55)
29% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent followed the waypoint forward. (rewarded 1.35)
26% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', None)
Agent properly idled at a red light. (rewarded 1.74)
23% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'right', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.99)
20% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent drove right instead of forward. (rewarded 0.69)
17% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent drove forward instead of left. (rewarded 0.80)
14% of time remaining to reach destination.
/-------------------
| Step 30 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 2.21)
11% of time remaining to reach destination.
/-------------------
| Step 31 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.12)
9% of time remaining to reach destination.
/-------------------
| Step 32 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'right', None)
Agent attempted driving left through a red light. (rewarded -10.50)
6% of time remaining to reach destination.
/-------------------
| Step 33 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.60)
3% of time remaining to reach destination.
/-------------------
| Step 34 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.71)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 34
\-------------------------
Simulating trial. . .
espilon = 0.8763; alpha = 0.0040
Simulating trial. . .
espilon = 0.8763; alpha = 0.0040
Simulating trial. . .
espilon = 0.8763; alpha = 0.0040
Simulating trial. . .
espilon = 0.8763; alpha = 0.0040
Simulating trial. . .
espilon = 0.8763; alpha = 0.0040
Simulating trial. . .
espilon = 0.8763; alpha = 0.0040
Simulating trial. . .
espilon = 0.8763; alpha = 0.0040
Simulating trial. . .
espilon = 0.8763; alpha = 0.0040
Simulating trial. . .
espilon = 0.8763; alpha = 0.0040
Simulating trial. . .
espilon = 0.8763; alpha = 0.0040
Simulating trial. . .
espilon = 0.8763; alpha = 0.0040
Simulating trial. . .
espilon = 0.8763; alpha = 0.0040
Simulating trial. . .
espilon = 0.8763; alpha = 0.0040
Simulating trial. . .
espilon = 0.8763; alpha = 0.0040
Simulating trial. . .
espilon = 0.8763; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.23)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -9.29)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 1.10)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.67)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.96)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.37)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent drove right instead of left. (rewarded 1.73)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.30)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 1.63)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded 1.65)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.70)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, None)
Agent drove left instead of right. (rewarded 0.65)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.74)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove right instead of forward. (rewarded 0.44)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent drove right instead of left. (rewarded 1.18)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.41)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 0.98)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove right instead of forward. (rewarded 0.37)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.77)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent attempted driving left through a red light. (rewarded -9.49)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.39)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent drove right instead of left. (rewarded 0.24)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.55)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.32)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.93)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 35
\-------------------------
Simulating trial. . .
espilon = 0.8728; alpha = 0.0040
Simulating trial. . .
espilon = 0.8728; alpha = 0.0040
Simulating trial. . .
espilon = 0.8728; alpha = 0.0040
Simulating trial. . .
espilon = 0.8728; alpha = 0.0040
Simulating trial. . .
espilon = 0.8728; alpha = 0.0040
Simulating trial. . .
espilon = 0.8728; alpha = 0.0040
Simulating trial. . .
espilon = 0.8728; alpha = 0.0040
Simulating trial. . .
espilon = 0.8728; alpha = 0.0040
Simulating trial. . .
espilon = 0.8728; alpha = 0.0040
Simulating trial. . .
espilon = 0.8728; alpha = 0.0040
Simulating trial. . .
espilon = 0.8728; alpha = 0.0040
Simulating trial. . .
espilon = 0.8728; alpha = 0.0040
Simulating trial. . .
espilon = 0.8728; alpha = 0.0040
Simulating trial. . .
espilon = 0.8728; alpha = 0.0040
Simulating trial. . .
espilon = 0.8728; alpha = 0.0040
Simulating trial. . .
espilon = 0.8728; alpha = 0.0040
Simulating trial. . .
espilon = 0.8728; alpha = 0.0040
Simulating trial. . .
espilon = 0.8728; alpha = 0.0040
Simulating trial. . .
espilon = 0.8728; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', 'forward')
Agent followed the waypoint right. (rewarded 1.86)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.20)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent attempted driving forward through a red light. (rewarded -9.75)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded 0.37)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.73)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove right instead of left. (rewarded 0.63)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.71)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.97)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.07)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.97)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.90)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent drove right instead of forward. (rewarded 0.50)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent drove forward instead of left. (rewarded 1.29)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 1.18)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.37)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 0.90)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.83)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.92)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 1.15)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded -0.29)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.26)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 0.51)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.31)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.07)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent drove left instead of right. (rewarded -0.48)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 0.84)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent drove right instead of forward. (rewarded 1.29)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded -0.27)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -9.62)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded -0.08)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 36
\-------------------------
Simulating trial. . .
espilon = 0.8694; alpha = 0.0040
Simulating trial. . .
espilon = 0.8694; alpha = 0.0040
Simulating trial. . .
espilon = 0.8694; alpha = 0.0040
Simulating trial. . .
espilon = 0.8694; alpha = 0.0040
Simulating trial. . .
espilon = 0.8694; alpha = 0.0040
Simulating trial. . .
espilon = 0.8694; alpha = 0.0040
Simulating trial. . .
espilon = 0.8694; alpha = 0.0040
Simulating trial. . .
espilon = 0.8694; alpha = 0.0040
Simulating trial. . .
espilon = 0.8694; alpha = 0.0040
Simulating trial. . .
espilon = 0.8694; alpha = 0.0040
Simulating trial. . .
espilon = 0.8694; alpha = 0.0040
Simulating trial. . .
espilon = 0.8694; alpha = 0.0040
Simulating trial. . .
espilon = 0.8694; alpha = 0.0040
Simulating trial. . .
espilon = 0.8694; alpha = 0.0040
Simulating trial. . .
espilon = 0.8694; alpha = 0.0040
Simulating trial. . .
espilon = 0.8694; alpha = 0.0040
Simulating trial. . .
espilon = 0.8694; alpha = 0.0040
Simulating trial. . .
espilon = 0.8694; alpha = 0.0040
Simulating trial. . .
espilon = 0.8694; alpha = 0.0040
Simulating trial. . .
espilon = 0.8694; alpha = 0.0040
Simulating trial. . .
espilon = 0.8694; alpha = 0.0040
Simulating trial. . .
espilon = 0.8694; alpha = 0.0040
Simulating trial. . .
espilon = 0.8694; alpha = 0.0040
Simulating trial. . .
espilon = 0.8694; alpha = 0.0040
Simulating trial. . .
espilon = 0.8694; alpha = 0.0040
Simulating trial. . .
espilon = 0.8694; alpha = 0.0040
Simulating trial. . .
espilon = 0.8694; alpha = 0.0040
Simulating trial. . .
espilon = 0.8694; alpha = 0.0040
Simulating trial. . .
espilon = 0.8694; alpha = 0.0040
Simulating trial. . .
espilon = 0.8694; alpha = 0.0040
Simulating trial. . .
espilon = 0.8694; alpha = 0.0040
Simulating trial. . .
espilon = 0.8694; alpha = 0.0040
Simulating trial. . .
espilon = 0.8694; alpha = 0.0040
Simulating trial. . .
espilon = 0.8694; alpha = 0.0040
Simulating trial. . .
espilon = 0.8694; alpha = 0.0040
Simulating trial. . .
espilon = 0.8694; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.03)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.64)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.16)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.20)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -10.44)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.12)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent drove left instead of forward. (rewarded 0.49)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 1.41)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.33)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.64)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 1.42)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 1.21)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 1.57)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.64)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.47)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.61)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 0.32)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.18)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.54)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.29)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.57)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove right instead of left. (rewarded 0.73)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.41)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent attempted driving forward through a red light. (rewarded -9.74)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded -0.58)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.65)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent drove forward instead of left. (rewarded 0.16)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.13)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 0.07)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.96)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 37
\-------------------------
Simulating trial. . .
espilon = 0.8659; alpha = 0.0040
Simulating trial. . .
espilon = 0.8659; alpha = 0.0040
Simulating trial. . .
espilon = 0.8659; alpha = 0.0040
Simulating trial. . .
espilon = 0.8659; alpha = 0.0040
Simulating trial. . .
espilon = 0.8659; alpha = 0.0040
Simulating trial. . .
espilon = 0.8659; alpha = 0.0040
Simulating trial. . .
espilon = 0.8659; alpha = 0.0040
Simulating trial. . .
espilon = 0.8659; alpha = 0.0040
Simulating trial. . .
espilon = 0.8659; alpha = 0.0040
Simulating trial. . .
espilon = 0.8659; alpha = 0.0040
Simulating trial. . .
espilon = 0.8659; alpha = 0.0040
Simulating trial. . .
espilon = 0.8659; alpha = 0.0040
Simulating trial. . .
espilon = 0.8659; alpha = 0.0040
Simulating trial. . .
espilon = 0.8659; alpha = 0.0040
Simulating trial. . .
espilon = 0.8659; alpha = 0.0040
Simulating trial. . .
espilon = 0.8659; alpha = 0.0040
Simulating trial. . .
espilon = 0.8659; alpha = 0.0040
Simulating trial. . .
espilon = 0.8659; alpha = 0.0040
Simulating trial. . .
espilon = 0.8659; alpha = 0.0040
Simulating trial. . .
espilon = 0.8659; alpha = 0.0040
Simulating trial. . .
espilon = 0.8659; alpha = 0.0040
Simulating trial. . .
espilon = 0.8659; alpha = 0.0040
Simulating trial. . .
espilon = 0.8659; alpha = 0.0040
Simulating trial. . .
espilon = 0.8659; alpha = 0.0040
Simulating trial. . .
espilon = 0.8659; alpha = 0.0040
Simulating trial. . .
espilon = 0.8659; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent attempted driving forward through a red light. (rewarded -9.24)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.09)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.26)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.43)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.89)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.49)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.01)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.94)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.33)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.89)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 0.76)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'right')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.04)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.32)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.89)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent drove right instead of left. (rewarded 0.06)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 0.89)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.48)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.96)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent idled at a green light with oncoming traffic. (rewarded -0.61)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.04)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 38
\-------------------------
Simulating trial. . .
espilon = 0.8624; alpha = 0.0040
Simulating trial. . .
espilon = 0.8624; alpha = 0.0040
Simulating trial. . .
espilon = 0.8624; alpha = 0.0040
Simulating trial. . .
espilon = 0.8624; alpha = 0.0040
Simulating trial. . .
espilon = 0.8624; alpha = 0.0040
Simulating trial. . .
espilon = 0.8624; alpha = 0.0040
Simulating trial. . .
espilon = 0.8624; alpha = 0.0040
Simulating trial. . .
espilon = 0.8624; alpha = 0.0040
Simulating trial. . .
espilon = 0.8624; alpha = 0.0040
Simulating trial. . .
espilon = 0.8624; alpha = 0.0040
Simulating trial. . .
espilon = 0.8624; alpha = 0.0040
Simulating trial. . .
espilon = 0.8624; alpha = 0.0040
Simulating trial. . .
espilon = 0.8624; alpha = 0.0040
Simulating trial. . .
espilon = 0.8624; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent followed the waypoint forward. (rewarded 2.69)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'right', None)
Agent drove left instead of right. (rewarded 1.29)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.93)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent followed the waypoint forward. (rewarded 2.26)
80% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 39
\-------------------------
Simulating trial. . .
espilon = 0.8590; alpha = 0.0040
Simulating trial. . .
espilon = 0.8590; alpha = 0.0040
Simulating trial. . .
espilon = 0.8590; alpha = 0.0040
Simulating trial. . .
espilon = 0.8590; alpha = 0.0040
Simulating trial. . .
espilon = 0.8590; alpha = 0.0040
Simulating trial. . .
espilon = 0.8590; alpha = 0.0040
Simulating trial. . .
espilon = 0.8590; alpha = 0.0040
Simulating trial. . .
espilon = 0.8590; alpha = 0.0040
Simulating trial. . .
espilon = 0.8590; alpha = 0.0040
Simulating trial. . .
espilon = 0.8590; alpha = 0.0040
Simulating trial. . .
espilon = 0.8590; alpha = 0.0040
Simulating trial. . .
espilon = 0.8590; alpha = 0.0040
Simulating trial. . .
espilon = 0.8590; alpha = 0.0040
Simulating trial. . .
espilon = 0.8590; alpha = 0.0040
Simulating trial. . .
espilon = 0.8590; alpha = 0.0040
Simulating trial. . .
espilon = 0.8590; alpha = 0.0040
Simulating trial. . .
espilon = 0.8590; alpha = 0.0040
Simulating trial. . .
espilon = 0.8590; alpha = 0.0040
Simulating trial. . .
espilon = 0.8590; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.84)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', 'forward')
Agent followed the waypoint right. (rewarded 2.44)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 2.76)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent drove right instead of forward. (rewarded 1.87)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.58)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.14)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent attempted driving left through a red light. (rewarded -9.52)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove forward instead of left. (rewarded 0.94)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.90)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded -0.19)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.61)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 1.52)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'left')
Agent attempted driving left through a red light. (rewarded -10.94)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove left instead of right. (rewarded 1.57)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', 'left')
Agent properly idled at a red light. (rewarded 1.00)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 1.03)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove left instead of forward. (rewarded 1.24)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.71)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.92)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.92)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 40
\-------------------------
Simulating trial. . .
espilon = 0.8556; alpha = 0.0040
Simulating trial. . .
espilon = 0.8556; alpha = 0.0040
Simulating trial. . .
espilon = 0.8556; alpha = 0.0040
Simulating trial. . .
espilon = 0.8556; alpha = 0.0040
Simulating trial. . .
espilon = 0.8556; alpha = 0.0040
Simulating trial. . .
espilon = 0.8556; alpha = 0.0040
Simulating trial. . .
espilon = 0.8556; alpha = 0.0040
Simulating trial. . .
espilon = 0.8556; alpha = 0.0040
Simulating trial. . .
espilon = 0.8556; alpha = 0.0040
Simulating trial. . .
espilon = 0.8556; alpha = 0.0040
Simulating trial. . .
espilon = 0.8556; alpha = 0.0040
Simulating trial. . .
espilon = 0.8556; alpha = 0.0040
Simulating trial. . .
espilon = 0.8556; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.32)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.83)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.16)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.26)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.15)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.71)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove right instead of left. (rewarded 1.11)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.15)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove right instead of left. (rewarded 1.32)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.72)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.80)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 1.62)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -10.63)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.26)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.59)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.46)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.21)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', None)
Agent attempted driving forward through a red light. (rewarded -10.99)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 0.96)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.82)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 2.14)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.63)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -5.52)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent drove left instead of forward. (rewarded 1.19)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.03)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.01)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.16)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'left', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.78)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'left', 'left')
Agent idled at a green light with oncoming traffic. (rewarded -0.83)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'left', 'left')
Agent followed the waypoint right. (rewarded 1.36)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 41
\-------------------------
Simulating trial. . .
espilon = 0.8521; alpha = 0.0040
Simulating trial. . .
espilon = 0.8521; alpha = 0.0040
Simulating trial. . .
espilon = 0.8521; alpha = 0.0040
Simulating trial. . .
espilon = 0.8521; alpha = 0.0040
Simulating trial. . .
espilon = 0.8521; alpha = 0.0040
Simulating trial. . .
espilon = 0.8521; alpha = 0.0040
Simulating trial. . .
espilon = 0.8521; alpha = 0.0040
Simulating trial. . .
espilon = 0.8521; alpha = 0.0040
Simulating trial. . .
espilon = 0.8521; alpha = 0.0040
Simulating trial. . .
espilon = 0.8521; alpha = 0.0040
Simulating trial. . .
espilon = 0.8521; alpha = 0.0040
Simulating trial. . .
espilon = 0.8521; alpha = 0.0040
Simulating trial. . .
espilon = 0.8521; alpha = 0.0040
Simulating trial. . .
espilon = 0.8521; alpha = 0.0040
Simulating trial. . .
espilon = 0.8521; alpha = 0.0040
Simulating trial. . .
espilon = 0.8521; alpha = 0.0040
Simulating trial. . .
espilon = 0.8521; alpha = 0.0040
Simulating trial. . .
espilon = 0.8521; alpha = 0.0040
Simulating trial. . .
espilon = 0.8521; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.47)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.68)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.08)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.79)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.49)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.66)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.54)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent drove forward instead of left. (rewarded 1.05)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.24)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.02)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 2.49)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove left instead of forward. (rewarded 0.94)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -9.02)
63% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.13)
60% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -10.97)
57% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.44)
54% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.08)
51% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'left', 'forward')
Agent drove forward instead of right. (rewarded 1.24)
49% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent followed the waypoint right. (rewarded 1.41)
46% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.64)
43% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent drove left instead of forward. (rewarded 0.51)
40% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 2.25)
37% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.82)
34% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -10.29)
31% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.69)
29% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.62)
26% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'left')
Agent followed the waypoint right. (rewarded 2.14)
23% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 1.45)
20% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 42
\-------------------------
Simulating trial. . .
espilon = 0.8487; alpha = 0.0040
Simulating trial. . .
espilon = 0.8487; alpha = 0.0040
Simulating trial. . .
espilon = 0.8487; alpha = 0.0040
Simulating trial. . .
espilon = 0.8487; alpha = 0.0040
Simulating trial. . .
espilon = 0.8487; alpha = 0.0040
Simulating trial. . .
espilon = 0.8487; alpha = 0.0040
Simulating trial. . .
espilon = 0.8487; alpha = 0.0040
Simulating trial. . .
espilon = 0.8487; alpha = 0.0040
Simulating trial. . .
espilon = 0.8487; alpha = 0.0040
Simulating trial. . .
espilon = 0.8487; alpha = 0.0040
Simulating trial. . .
espilon = 0.8487; alpha = 0.0040
Simulating trial. . .
espilon = 0.8487; alpha = 0.0040
Simulating trial. . .
espilon = 0.8487; alpha = 0.0040
Simulating trial. . .
espilon = 0.8487; alpha = 0.0040
Simulating trial. . .
espilon = 0.8487; alpha = 0.0040
Simulating trial. . .
espilon = 0.8487; alpha = 0.0040
Simulating trial. . .
espilon = 0.8487; alpha = 0.0040
Simulating trial. . .
espilon = 0.8487; alpha = 0.0040
Simulating trial. . .
espilon = 0.8487; alpha = 0.0040
Simulating trial. . .
espilon = 0.8487; alpha = 0.0040
Simulating trial. . .
espilon = 0.8487; alpha = 0.0040
Simulating trial. . .
espilon = 0.8487; alpha = 0.0040
Simulating trial. . .
espilon = 0.8487; alpha = 0.0040
Simulating trial. . .
espilon = 0.8487; alpha = 0.0040
Simulating trial. . .
espilon = 0.8487; alpha = 0.0040
Simulating trial. . .
espilon = 0.8487; alpha = 0.0040
Simulating trial. . .
espilon = 0.8487; alpha = 0.0040
Simulating trial. . .
espilon = 0.8487; alpha = 0.0040
Simulating trial. . .
espilon = 0.8487; alpha = 0.0040
Simulating trial. . .
espilon = 0.8487; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent drove right instead of left. (rewarded 1.65)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.28)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.55)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.23)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.91)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -4.74)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.53)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.83)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.11)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 1.43)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.76)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 1.67)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.97)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove left instead of right. (rewarded 0.75)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.81)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.80)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'right', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.14)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.36)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.58)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.52)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.68)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove left instead of right. (rewarded -0.21)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.58)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent idled at a green light with oncoming traffic. (rewarded -0.13)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 0.99)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 43
\-------------------------
Simulating trial. . .
espilon = 0.8454; alpha = 0.0040
Simulating trial. . .
espilon = 0.8454; alpha = 0.0040
Simulating trial. . .
espilon = 0.8454; alpha = 0.0040
Simulating trial. . .
espilon = 0.8454; alpha = 0.0040
Simulating trial. . .
espilon = 0.8454; alpha = 0.0040
Simulating trial. . .
espilon = 0.8454; alpha = 0.0040
Simulating trial. . .
espilon = 0.8454; alpha = 0.0040
Simulating trial. . .
espilon = 0.8454; alpha = 0.0040
Simulating trial. . .
espilon = 0.8454; alpha = 0.0040
Simulating trial. . .
espilon = 0.8454; alpha = 0.0040
Simulating trial. . .
espilon = 0.8454; alpha = 0.0040
Simulating trial. . .
espilon = 0.8454; alpha = 0.0040
Simulating trial. . .
espilon = 0.8454; alpha = 0.0040
Simulating trial. . .
espilon = 0.8454; alpha = 0.0040
Simulating trial. . .
espilon = 0.8454; alpha = 0.0040
Simulating trial. . .
espilon = 0.8454; alpha = 0.0040
Simulating trial. . .
espilon = 0.8454; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -10.99)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.32)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.66)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -10.14)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.66)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.95)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.17)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.46)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.44)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.39)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent attempted driving left through a red light. (rewarded -10.78)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent followed the waypoint forward. (rewarded 2.29)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.10)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 2.33)
30% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 44
\-------------------------
Simulating trial. . .
espilon = 0.8420; alpha = 0.0040
Simulating trial. . .
espilon = 0.8420; alpha = 0.0040
Simulating trial. . .
espilon = 0.8420; alpha = 0.0040
Simulating trial. . .
espilon = 0.8420; alpha = 0.0040
Simulating trial. . .
espilon = 0.8420; alpha = 0.0040
Simulating trial. . .
espilon = 0.8420; alpha = 0.0040
Simulating trial. . .
espilon = 0.8420; alpha = 0.0040
Simulating trial. . .
espilon = 0.8420; alpha = 0.0040
Simulating trial. . .
espilon = 0.8420; alpha = 0.0040
Simulating trial. . .
espilon = 0.8420; alpha = 0.0040
Simulating trial. . .
espilon = 0.8420; alpha = 0.0040
Simulating trial. . .
espilon = 0.8420; alpha = 0.0040
Simulating trial. . .
espilon = 0.8420; alpha = 0.0040
Simulating trial. . .
espilon = 0.8420; alpha = 0.0040
Simulating trial. . .
espilon = 0.8420; alpha = 0.0040
Simulating trial. . .
espilon = 0.8420; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.57)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.28)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.27)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.10)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.85)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 0.65)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.41)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.99)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.69)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'forward')
Agent properly idled at a red light. (rewarded 1.24)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent attempted driving left through a red light. (rewarded -10.97)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.70)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.43)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.54)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove left instead of forward. (rewarded 0.71)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.90)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent drove left instead of right. (rewarded -0.15)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.07)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent attempted driving forward through a red light. (rewarded -9.92)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.55)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 45
\-------------------------
Simulating trial. . .
espilon = 0.8386; alpha = 0.0040
Simulating trial. . .
espilon = 0.8386; alpha = 0.0040
Simulating trial. . .
espilon = 0.8386; alpha = 0.0040
Simulating trial. . .
espilon = 0.8386; alpha = 0.0040
Simulating trial. . .
espilon = 0.8386; alpha = 0.0040
Simulating trial. . .
espilon = 0.8386; alpha = 0.0040
Simulating trial. . .
espilon = 0.8386; alpha = 0.0040
Simulating trial. . .
espilon = 0.8386; alpha = 0.0040
Simulating trial. . .
espilon = 0.8386; alpha = 0.0040
Simulating trial. . .
espilon = 0.8386; alpha = 0.0040
Simulating trial. . .
espilon = 0.8386; alpha = 0.0040
Simulating trial. . .
espilon = 0.8386; alpha = 0.0040
Simulating trial. . .
espilon = 0.8386; alpha = 0.0040
Simulating trial. . .
espilon = 0.8386; alpha = 0.0040
Simulating trial. . .
espilon = 0.8386; alpha = 0.0040
Simulating trial. . .
espilon = 0.8386; alpha = 0.0040
Simulating trial. . .
espilon = 0.8386; alpha = 0.0040
Simulating trial. . .
espilon = 0.8386; alpha = 0.0040
Simulating trial. . .
espilon = 0.8386; alpha = 0.0040
Simulating trial. . .
espilon = 0.8386; alpha = 0.0040
Simulating trial. . .
espilon = 0.8386; alpha = 0.0040
Simulating trial. . .
espilon = 0.8386; alpha = 0.0040
Simulating trial. . .
espilon = 0.8386; alpha = 0.0040
Simulating trial. . .
espilon = 0.8386; alpha = 0.0040
Simulating trial. . .
espilon = 0.8386; alpha = 0.0040
Simulating trial. . .
espilon = 0.8386; alpha = 0.0040
Simulating trial. . .
espilon = 0.8386; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.20)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', None)
Agent drove right instead of forward. (rewarded 1.11)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.23)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.97)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 1.90)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.85)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.96)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'right', 'forward')
Agent drove right instead of left. (rewarded 0.23)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent followed the waypoint right. (rewarded 1.96)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent drove right instead of forward. (rewarded 1.38)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent drove right instead of left. (rewarded 0.67)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.65)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.81)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.33)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.41)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent drove right instead of left. (rewarded 0.15)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.83)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 1.58)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent drove left instead of right. (rewarded 0.63)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.56)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 46
\-------------------------
Simulating trial. . .
espilon = 0.8353; alpha = 0.0040
Simulating trial. . .
espilon = 0.8353; alpha = 0.0040
Simulating trial. . .
espilon = 0.8353; alpha = 0.0040
Simulating trial. . .
espilon = 0.8353; alpha = 0.0040
Simulating trial. . .
espilon = 0.8353; alpha = 0.0040
Simulating trial. . .
espilon = 0.8353; alpha = 0.0040
Simulating trial. . .
espilon = 0.8353; alpha = 0.0040
Simulating trial. . .
espilon = 0.8353; alpha = 0.0040
Simulating trial. . .
espilon = 0.8353; alpha = 0.0040
Simulating trial. . .
espilon = 0.8353; alpha = 0.0040
Simulating trial. . .
espilon = 0.8353; alpha = 0.0040
Simulating trial. . .
espilon = 0.8353; alpha = 0.0040
Simulating trial. . .
espilon = 0.8353; alpha = 0.0040
Simulating trial. . .
espilon = 0.8353; alpha = 0.0040
Simulating trial. . .
espilon = 0.8353; alpha = 0.0040
Simulating trial. . .
espilon = 0.8353; alpha = 0.0040
Simulating trial. . .
espilon = 0.8353; alpha = 0.0040
Simulating trial. . .
espilon = 0.8353; alpha = 0.0040
Simulating trial. . .
espilon = 0.8353; alpha = 0.0040
Simulating trial. . .
espilon = 0.8353; alpha = 0.0040
Simulating trial. . .
espilon = 0.8353; alpha = 0.0040
Simulating trial. . .
espilon = 0.8353; alpha = 0.0040
Simulating trial. . .
espilon = 0.8353; alpha = 0.0040
Simulating trial. . .
espilon = 0.8353; alpha = 0.0040
Simulating trial. . .
espilon = 0.8353; alpha = 0.0040
Simulating trial. . .
espilon = 0.8353; alpha = 0.0040
Simulating trial. . .
espilon = 0.8353; alpha = 0.0040
Simulating trial. . .
espilon = 0.8353; alpha = 0.0040
Simulating trial. . .
espilon = 0.8353; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.65)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent drove left instead of forward. (rewarded 1.88)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent attempted driving forward through a red light. (rewarded -9.05)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.59)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 1.48)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent drove left instead of forward. (rewarded 0.49)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.24)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent drove left instead of right. (rewarded 0.15)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -10.87)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.34)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 1.11)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.65)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded 0.87)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.76)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent drove left instead of forward. (rewarded 1.45)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded -0.22)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', None)
Agent drove right instead of forward. (rewarded 1.07)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.49)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent drove forward instead of left. (rewarded 1.01)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.92)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.00)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.43)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.25)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.55)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.29)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.47)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 0.82)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.79)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.81)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.69)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 47
\-------------------------
Simulating trial. . .
espilon = 0.8319; alpha = 0.0040
Simulating trial. . .
espilon = 0.8319; alpha = 0.0040
Simulating trial. . .
espilon = 0.8319; alpha = 0.0040
Simulating trial. . .
espilon = 0.8319; alpha = 0.0040
Simulating trial. . .
espilon = 0.8319; alpha = 0.0040
Simulating trial. . .
espilon = 0.8319; alpha = 0.0040
Simulating trial. . .
espilon = 0.8319; alpha = 0.0040
Simulating trial. . .
espilon = 0.8319; alpha = 0.0040
Simulating trial. . .
espilon = 0.8319; alpha = 0.0040
Simulating trial. . .
espilon = 0.8319; alpha = 0.0040
Simulating trial. . .
espilon = 0.8319; alpha = 0.0040
Simulating trial. . .
espilon = 0.8319; alpha = 0.0040
Simulating trial. . .
espilon = 0.8319; alpha = 0.0040
Simulating trial. . .
espilon = 0.8319; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 1.66)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded 1.04)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent drove right instead of left. (rewarded 0.71)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 0.78)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent drove left instead of forward. (rewarded 1.23)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'left')
Agent drove forward instead of right. (rewarded 1.39)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.74)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'forward')
Agent attempted driving left through a red light. (rewarded -9.26)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.93)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.03)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.50)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent drove right instead of forward. (rewarded 0.68)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.58)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -10.05)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', 'forward')
Agent attempted driving left through a red light. (rewarded -9.31)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 1.28)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.88)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 0.95)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent drove forward instead of left. (rewarded 0.35)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.01)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.14)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent followed the waypoint forward. (rewarded 0.38)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent drove left instead of forward. (rewarded 0.18)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.53)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.41)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 48
\-------------------------
Simulating trial. . .
espilon = 0.8286; alpha = 0.0040
Simulating trial. . .
espilon = 0.8286; alpha = 0.0040
Simulating trial. . .
espilon = 0.8286; alpha = 0.0040
Simulating trial. . .
espilon = 0.8286; alpha = 0.0040
Simulating trial. . .
espilon = 0.8286; alpha = 0.0040
Simulating trial. . .
espilon = 0.8286; alpha = 0.0040
Simulating trial. . .
espilon = 0.8286; alpha = 0.0040
Simulating trial. . .
espilon = 0.8286; alpha = 0.0040
Simulating trial. . .
espilon = 0.8286; alpha = 0.0040
Simulating trial. . .
espilon = 0.8286; alpha = 0.0040
Simulating trial. . .
espilon = 0.8286; alpha = 0.0040
Simulating trial. . .
espilon = 0.8286; alpha = 0.0040
Simulating trial. . .
espilon = 0.8286; alpha = 0.0040
Simulating trial. . .
espilon = 0.8286; alpha = 0.0040
Simulating trial. . .
espilon = 0.8286; alpha = 0.0040
Simulating trial. . .
espilon = 0.8286; alpha = 0.0040
Simulating trial. . .
espilon = 0.8286; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', 'right')
Agent drove forward instead of right. (rewarded 0.50)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.64)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent attempted driving left through a red light. (rewarded -9.09)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.01)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -9.28)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.48)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 1.13)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.73)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 1.62)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.09)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.58)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.12)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.88)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded 0.57)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'right')
Agent properly idled at a red light. (rewarded 2.15)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'right', None)
Agent drove left instead of forward. (rewarded 1.51)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded -0.24)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove forward instead of right. (rewarded 0.67)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.73)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.15)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 1.39)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 1.00)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.45)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent drove right instead of forward. (rewarded 0.97)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent drove forward instead of left. (rewarded 0.23)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.27)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent drove right instead of left. (rewarded 0.93)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.95)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -9.70)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.75)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 49
\-------------------------
Simulating trial. . .
espilon = 0.8253; alpha = 0.0040
Simulating trial. . .
espilon = 0.8253; alpha = 0.0040
Simulating trial. . .
espilon = 0.8253; alpha = 0.0040
Simulating trial. . .
espilon = 0.8253; alpha = 0.0040
Simulating trial. . .
espilon = 0.8253; alpha = 0.0040
Simulating trial. . .
espilon = 0.8253; alpha = 0.0040
Simulating trial. . .
espilon = 0.8253; alpha = 0.0040
Simulating trial. . .
espilon = 0.8253; alpha = 0.0040
Simulating trial. . .
espilon = 0.8253; alpha = 0.0040
Simulating trial. . .
espilon = 0.8253; alpha = 0.0040
Simulating trial. . .
espilon = 0.8253; alpha = 0.0040
Simulating trial. . .
espilon = 0.8253; alpha = 0.0040
Simulating trial. . .
espilon = 0.8253; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'right', 'left')
Agent attempted driving left through a red light. (rewarded -10.18)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.90)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 2.13)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.05)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', 'right')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.88)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.73)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.20)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.91)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.34)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.65)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.28)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.63)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.48)
48% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 50
\-------------------------
Simulating trial. . .
espilon = 0.8220; alpha = 0.0040
Simulating trial. . .
espilon = 0.8220; alpha = 0.0040
Simulating trial. . .
espilon = 0.8220; alpha = 0.0040
Simulating trial. . .
espilon = 0.8220; alpha = 0.0040
Simulating trial. . .
espilon = 0.8220; alpha = 0.0040
Simulating trial. . .
espilon = 0.8220; alpha = 0.0040
Simulating trial. . .
espilon = 0.8220; alpha = 0.0040
Simulating trial. . .
espilon = 0.8220; alpha = 0.0040
Simulating trial. . .
espilon = 0.8220; alpha = 0.0040
Simulating trial. . .
espilon = 0.8220; alpha = 0.0040
Simulating trial. . .
espilon = 0.8220; alpha = 0.0040
Simulating trial. . .
espilon = 0.8220; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.46)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent attempted driving left through a red light. (rewarded -10.32)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent attempted driving left through a red light. (rewarded -9.69)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove left instead of forward. (rewarded 1.76)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -4.94)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.20)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.03)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'forward')
Agent drove forward instead of right. (rewarded 1.10)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.31)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 1.96)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.35)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.60)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.49)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.52)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.55)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.03)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.49)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.91)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'right')
Agent attempted driving forward through a red light. (rewarded -10.63)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.93)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 51
\-------------------------
Simulating trial. . .
espilon = 0.8187; alpha = 0.0040
Simulating trial. . .
espilon = 0.8187; alpha = 0.0040
Simulating trial. . .
espilon = 0.8187; alpha = 0.0040
Simulating trial. . .
espilon = 0.8187; alpha = 0.0040
Simulating trial. . .
espilon = 0.8187; alpha = 0.0040
Simulating trial. . .
espilon = 0.8187; alpha = 0.0040
Simulating trial. . .
espilon = 0.8187; alpha = 0.0040
Simulating trial. . .
espilon = 0.8187; alpha = 0.0040
Simulating trial. . .
espilon = 0.8187; alpha = 0.0040
Simulating trial. . .
espilon = 0.8187; alpha = 0.0040
Simulating trial. . .
espilon = 0.8187; alpha = 0.0040
Simulating trial. . .
espilon = 0.8187; alpha = 0.0040
Simulating trial. . .
espilon = 0.8187; alpha = 0.0040
Simulating trial. . .
espilon = 0.8187; alpha = 0.0040
Simulating trial. . .
espilon = 0.8187; alpha = 0.0040
Simulating trial. . .
espilon = 0.8187; alpha = 0.0040
Simulating trial. . .
espilon = 0.8187; alpha = 0.0040
Simulating trial. . .
espilon = 0.8187; alpha = 0.0040
Simulating trial. . .
espilon = 0.8187; alpha = 0.0040
Simulating trial. . .
espilon = 0.8187; alpha = 0.0040
Simulating trial. . .
espilon = 0.8187; alpha = 0.0040
Simulating trial. . .
espilon = 0.8187; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.87)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.58)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.31)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.98)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.23)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 1.22)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.50)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 1.26)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 0.21)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.05)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent drove right instead of left. (rewarded -0.13)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 1.01)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'left')
Agent drove left instead of right. (rewarded 0.29)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent drove left instead of forward. (rewarded 0.25)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', 'left')
Agent followed the waypoint right. (rewarded 2.10)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.11)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove left instead of forward. (rewarded 1.18)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 1.07)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.19)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.14)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 52
\-------------------------
Simulating trial. . .
espilon = 0.8155; alpha = 0.0040
Simulating trial. . .
espilon = 0.8155; alpha = 0.0040
Simulating trial. . .
espilon = 0.8155; alpha = 0.0040
Simulating trial. . .
espilon = 0.8155; alpha = 0.0040
Simulating trial. . .
espilon = 0.8155; alpha = 0.0040
Simulating trial. . .
espilon = 0.8155; alpha = 0.0040
Simulating trial. . .
espilon = 0.8155; alpha = 0.0040
Simulating trial. . .
espilon = 0.8155; alpha = 0.0040
Simulating trial. . .
espilon = 0.8155; alpha = 0.0040
Simulating trial. . .
espilon = 0.8155; alpha = 0.0040
Simulating trial. . .
espilon = 0.8155; alpha = 0.0040
Simulating trial. . .
espilon = 0.8155; alpha = 0.0040
Simulating trial. . .
espilon = 0.8155; alpha = 0.0040
Simulating trial. . .
espilon = 0.8155; alpha = 0.0040
Simulating trial. . .
espilon = 0.8155; alpha = 0.0040
Simulating trial. . .
espilon = 0.8155; alpha = 0.0040
Simulating trial. . .
espilon = 0.8155; alpha = 0.0040
Simulating trial. . .
espilon = 0.8155; alpha = 0.0040
Simulating trial. . .
espilon = 0.8155; alpha = 0.0040
Simulating trial. . .
espilon = 0.8155; alpha = 0.0040
Simulating trial. . .
espilon = 0.8155; alpha = 0.0040
Simulating trial. . .
espilon = 0.8155; alpha = 0.0040
Simulating trial. . .
espilon = 0.8155; alpha = 0.0040
Simulating trial. . .
espilon = 0.8155; alpha = 0.0040
Simulating trial. . .
espilon = 0.8155; alpha = 0.0040
Simulating trial. . .
espilon = 0.8155; alpha = 0.0040
Simulating trial. . .
espilon = 0.8155; alpha = 0.0040
Simulating trial. . .
espilon = 0.8155; alpha = 0.0040
Simulating trial. . .
espilon = 0.8155; alpha = 0.0040
Simulating trial. . .
espilon = 0.8155; alpha = 0.0040
Simulating trial. . .
espilon = 0.8155; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.02)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.90)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent drove right instead of forward. (rewarded 1.07)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 2.02)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent drove forward instead of right. (rewarded -0.06)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.26)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.61)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'left')
Agent drove forward instead of right. (rewarded 0.11)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 1.12)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 1.04)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.41)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.22)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.06)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.44)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent drove forward instead of left. (rewarded 1.55)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -9.44)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent drove right instead of left. (rewarded 1.14)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent followed the waypoint forward. (rewarded 0.62)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.86)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.08)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 53
\-------------------------
Simulating trial. . .
espilon = 0.8122; alpha = 0.0040
Simulating trial. . .
espilon = 0.8122; alpha = 0.0040
Simulating trial. . .
espilon = 0.8122; alpha = 0.0040
Simulating trial. . .
espilon = 0.8122; alpha = 0.0040
Simulating trial. . .
espilon = 0.8122; alpha = 0.0040
Simulating trial. . .
espilon = 0.8122; alpha = 0.0040
Simulating trial. . .
espilon = 0.8122; alpha = 0.0040
Simulating trial. . .
espilon = 0.8122; alpha = 0.0040
Simulating trial. . .
espilon = 0.8122; alpha = 0.0040
Simulating trial. . .
espilon = 0.8122; alpha = 0.0040
Simulating trial. . .
espilon = 0.8122; alpha = 0.0040
Simulating trial. . .
espilon = 0.8122; alpha = 0.0040
Simulating trial. . .
espilon = 0.8122; alpha = 0.0040
Simulating trial. . .
espilon = 0.8122; alpha = 0.0040
Simulating trial. . .
espilon = 0.8122; alpha = 0.0040
Simulating trial. . .
espilon = 0.8122; alpha = 0.0040
Simulating trial. . .
espilon = 0.8122; alpha = 0.0040
Simulating trial. . .
espilon = 0.8122; alpha = 0.0040
Simulating trial. . .
espilon = 0.8122; alpha = 0.0040
Simulating trial. . .
espilon = 0.8122; alpha = 0.0040
Simulating trial. . .
espilon = 0.8122; alpha = 0.0040
Simulating trial. . .
espilon = 0.8122; alpha = 0.0040
Simulating trial. . .
espilon = 0.8122; alpha = 0.0040
Simulating trial. . .
espilon = 0.8122; alpha = 0.0040
Simulating trial. . .
espilon = 0.8122; alpha = 0.0040
Simulating trial. . .
espilon = 0.8122; alpha = 0.0040
Simulating trial. . .
espilon = 0.8122; alpha = 0.0040
Simulating trial. . .
espilon = 0.8122; alpha = 0.0040
Simulating trial. . .
espilon = 0.8122; alpha = 0.0040
Simulating trial. . .
espilon = 0.8122; alpha = 0.0040
Simulating trial. . .
espilon = 0.8122; alpha = 0.0040
Simulating trial. . .
espilon = 0.8122; alpha = 0.0040
Simulating trial. . .
espilon = 0.8122; alpha = 0.0040
Simulating trial. . .
espilon = 0.8122; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent drove left instead of right. (rewarded 0.32)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.98)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', None)
Agent drove right instead of left. (rewarded 1.53)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.26)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded 0.92)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 1.10)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.36)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 0.98)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 2.60)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'right')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.12)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'right')
Agent drove right instead of forward. (rewarded 1.22)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent followed the waypoint left. (rewarded 2.08)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent drove left instead of forward. (rewarded 0.57)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 0.92)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.06)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded 1.39)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent followed the waypoint right. (rewarded 0.54)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded -0.02)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.11)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.64)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 54
\-------------------------
Simulating trial. . .
espilon = 0.8090; alpha = 0.0040
Simulating trial. . .
espilon = 0.8090; alpha = 0.0040
Simulating trial. . .
espilon = 0.8090; alpha = 0.0040
Simulating trial. . .
espilon = 0.8090; alpha = 0.0040
Simulating trial. . .
espilon = 0.8090; alpha = 0.0040
Simulating trial. . .
espilon = 0.8090; alpha = 0.0040
Simulating trial. . .
espilon = 0.8090; alpha = 0.0040
Simulating trial. . .
espilon = 0.8090; alpha = 0.0040
Simulating trial. . .
espilon = 0.8090; alpha = 0.0040
Simulating trial. . .
espilon = 0.8090; alpha = 0.0040
Simulating trial. . .
espilon = 0.8090; alpha = 0.0040
Simulating trial. . .
espilon = 0.8090; alpha = 0.0040
Simulating trial. . .
espilon = 0.8090; alpha = 0.0040
Simulating trial. . .
espilon = 0.8090; alpha = 0.0040
Simulating trial. . .
espilon = 0.8090; alpha = 0.0040
Simulating trial. . .
espilon = 0.8090; alpha = 0.0040
Simulating trial. . .
espilon = 0.8090; alpha = 0.0040
Simulating trial. . .
espilon = 0.8090; alpha = 0.0040
Simulating trial. . .
espilon = 0.8090; alpha = 0.0040
Simulating trial. . .
espilon = 0.8090; alpha = 0.0040
Simulating trial. . .
espilon = 0.8090; alpha = 0.0040
Simulating trial. . .
espilon = 0.8090; alpha = 0.0040
Simulating trial. . .
espilon = 0.8090; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.15)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.88)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.19)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.43)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.27)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 0.47)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'left')
Agent drove forward instead of right. (rewarded 0.00)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove forward instead of right. (rewarded 0.72)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent drove forward instead of right. (rewarded 0.44)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.65)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'forward')
Agent drove right instead of forward. (rewarded 0.50)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 0.15)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent drove left instead of forward. (rewarded 1.31)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent attempted driving left through a red light. (rewarded -10.52)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.66)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove forward instead of right. (rewarded 0.10)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.66)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 0.90)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.14)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.80)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.21)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.84)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.25)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.04)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.88)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 55
\-------------------------
Simulating trial. . .
espilon = 0.8057; alpha = 0.0040
Simulating trial. . .
espilon = 0.8057; alpha = 0.0040
Simulating trial. . .
espilon = 0.8057; alpha = 0.0040
Simulating trial. . .
espilon = 0.8057; alpha = 0.0040
Simulating trial. . .
espilon = 0.8057; alpha = 0.0040
Simulating trial. . .
espilon = 0.8057; alpha = 0.0040
Simulating trial. . .
espilon = 0.8057; alpha = 0.0040
Simulating trial. . .
espilon = 0.8057; alpha = 0.0040
Simulating trial. . .
espilon = 0.8057; alpha = 0.0040
Simulating trial. . .
espilon = 0.8057; alpha = 0.0040
Simulating trial. . .
espilon = 0.8057; alpha = 0.0040
Simulating trial. . .
espilon = 0.8057; alpha = 0.0040
Simulating trial. . .
espilon = 0.8057; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 2.42)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.64)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.37)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded 1.88)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 1.40)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 1.26)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.96)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 0.60)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent followed the waypoint right. (rewarded 1.03)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 1.28)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -9.02)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.63)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.49)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded -0.07)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'right')
Agent drove right instead of left. (rewarded 0.42)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove forward instead of right. (rewarded 0.71)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.13)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'right')
Agent attempted driving forward through a red light. (rewarded -9.85)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent drove right instead of forward. (rewarded 0.95)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.57)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent attempted driving forward through a red light. (rewarded -9.31)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.94)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.42)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove right instead of left. (rewarded -0.43)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.89)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 56
\-------------------------
Simulating trial. . .
espilon = 0.8025; alpha = 0.0040
Simulating trial. . .
espilon = 0.8025; alpha = 0.0040
Simulating trial. . .
espilon = 0.8025; alpha = 0.0040
Simulating trial. . .
espilon = 0.8025; alpha = 0.0040
Simulating trial. . .
espilon = 0.8025; alpha = 0.0040
Simulating trial. . .
espilon = 0.8025; alpha = 0.0040
Simulating trial. . .
espilon = 0.8025; alpha = 0.0040
Simulating trial. . .
espilon = 0.8025; alpha = 0.0040
Simulating trial. . .
espilon = 0.8025; alpha = 0.0040
Simulating trial. . .
espilon = 0.8025; alpha = 0.0040
Simulating trial. . .
espilon = 0.8025; alpha = 0.0040
Simulating trial. . .
espilon = 0.8025; alpha = 0.0040
Simulating trial. . .
espilon = 0.8025; alpha = 0.0040
Simulating trial. . .
espilon = 0.8025; alpha = 0.0040
Simulating trial. . .
espilon = 0.8025; alpha = 0.0040
Simulating trial. . .
espilon = 0.8025; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.93)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'right')
Agent drove right instead of left. (rewarded 0.99)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.78)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.96)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.19)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.33)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.69)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'forward')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.19)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.19)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.73)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.51)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.09)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', None)
Agent drove right instead of forward. (rewarded 0.49)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent drove right instead of left. (rewarded 1.53)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent drove forward instead of left. (rewarded 0.11)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 0.88)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.85)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.48)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.49)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove left instead of forward. (rewarded 0.79)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent drove forward instead of right. (rewarded 1.05)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.32)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.78)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.09)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, 'forward')
Agent properly idled at a red light. (rewarded 1.97)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 57
\-------------------------
Simulating trial. . .
espilon = 0.7993; alpha = 0.0040
Simulating trial. . .
espilon = 0.7993; alpha = 0.0040
Simulating trial. . .
espilon = 0.7993; alpha = 0.0040
Simulating trial. . .
espilon = 0.7993; alpha = 0.0040
Simulating trial. . .
espilon = 0.7993; alpha = 0.0040
Simulating trial. . .
espilon = 0.7993; alpha = 0.0040
Simulating trial. . .
espilon = 0.7993; alpha = 0.0040
Simulating trial. . .
espilon = 0.7993; alpha = 0.0040
Simulating trial. . .
espilon = 0.7993; alpha = 0.0040
Simulating trial. . .
espilon = 0.7993; alpha = 0.0040
Simulating trial. . .
espilon = 0.7993; alpha = 0.0040
Simulating trial. . .
espilon = 0.7993; alpha = 0.0040
Simulating trial. . .
espilon = 0.7993; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.42)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.32)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.40)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.87)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove forward instead of left. (rewarded 1.16)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded 0.56)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent followed the waypoint left. (rewarded 2.56)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent drove right instead of left. (rewarded 0.93)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent drove right instead of left. (rewarded 0.28)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.92)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent attempted driving forward through a red light. (rewarded -9.70)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 0.39)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.20)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 2.07)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.40)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.71)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent drove right instead of forward. (rewarded -0.01)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -9.47)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'right')
Agent drove right instead of left. (rewarded 0.86)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.39)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 58
\-------------------------
Simulating trial. . .
espilon = 0.7961; alpha = 0.0040
Simulating trial. . .
espilon = 0.7961; alpha = 0.0040
Simulating trial. . .
espilon = 0.7961; alpha = 0.0040
Simulating trial. . .
espilon = 0.7961; alpha = 0.0040
Simulating trial. . .
espilon = 0.7961; alpha = 0.0040
Simulating trial. . .
espilon = 0.7961; alpha = 0.0040
Simulating trial. . .
espilon = 0.7961; alpha = 0.0040
Simulating trial. . .
espilon = 0.7961; alpha = 0.0040
Simulating trial. . .
espilon = 0.7961; alpha = 0.0040
Simulating trial. . .
espilon = 0.7961; alpha = 0.0040
Simulating trial. . .
espilon = 0.7961; alpha = 0.0040
Simulating trial. . .
espilon = 0.7961; alpha = 0.0040
Simulating trial. . .
espilon = 0.7961; alpha = 0.0040
Simulating trial. . .
espilon = 0.7961; alpha = 0.0040
Simulating trial. . .
espilon = 0.7961; alpha = 0.0040
Simulating trial. . .
espilon = 0.7961; alpha = 0.0040
Simulating trial. . .
espilon = 0.7961; alpha = 0.0040
Simulating trial. . .
espilon = 0.7961; alpha = 0.0040
Simulating trial. . .
espilon = 0.7961; alpha = 0.0040
Simulating trial. . .
espilon = 0.7961; alpha = 0.0040
Simulating trial. . .
espilon = 0.7961; alpha = 0.0040
Simulating trial. . .
espilon = 0.7961; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'forward')
Agent drove forward instead of left. (rewarded 0.43)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -4.34)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.53)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent drove right instead of forward. (rewarded 1.61)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 0.76)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.85)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent drove right instead of left. (rewarded 1.21)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.98)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -10.96)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.58)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.08)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove right instead of forward. (rewarded 1.19)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.82)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.60)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 1.60)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 1.03)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent drove right instead of forward. (rewarded -0.18)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.34)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.95)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.04)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 0.64)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.41)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.87)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.82)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent drove forward instead of left. (rewarded 0.71)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.54)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove forward instead of left. (rewarded -0.65)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent drove forward instead of left. (rewarded 0.32)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded -0.19)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded -0.73)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 59
\-------------------------
Simulating trial. . .
espilon = 0.7929; alpha = 0.0040
Simulating trial. . .
espilon = 0.7929; alpha = 0.0040
Simulating trial. . .
espilon = 0.7929; alpha = 0.0040
Simulating trial. . .
espilon = 0.7929; alpha = 0.0040
Simulating trial. . .
espilon = 0.7929; alpha = 0.0040
Simulating trial. . .
espilon = 0.7929; alpha = 0.0040
Simulating trial. . .
espilon = 0.7929; alpha = 0.0040
Simulating trial. . .
espilon = 0.7929; alpha = 0.0040
Simulating trial. . .
espilon = 0.7929; alpha = 0.0040
Simulating trial. . .
espilon = 0.7929; alpha = 0.0040
Simulating trial. . .
espilon = 0.7929; alpha = 0.0040
Simulating trial. . .
espilon = 0.7929; alpha = 0.0040
Simulating trial. . .
espilon = 0.7929; alpha = 0.0040
Simulating trial. . .
espilon = 0.7929; alpha = 0.0040
Simulating trial. . .
espilon = 0.7929; alpha = 0.0040
Simulating trial. . .
espilon = 0.7929; alpha = 0.0040
Simulating trial. . .
espilon = 0.7929; alpha = 0.0040
Simulating trial. . .
espilon = 0.7929; alpha = 0.0040
Simulating trial. . .
espilon = 0.7929; alpha = 0.0040
Simulating trial. . .
espilon = 0.7929; alpha = 0.0040
Simulating trial. . .
espilon = 0.7929; alpha = 0.0040
Simulating trial. . .
espilon = 0.7929; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'left')
Agent followed the waypoint right. (rewarded 2.81)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 0.78)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.78)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 0.96)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent drove left instead of right. (rewarded 0.79)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded 0.14)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.74)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'forward')
Agent attempted driving forward through a red light. (rewarded -9.72)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 2.51)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.91)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.02)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.76)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.46)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -4.01)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'right')
Agent followed the waypoint right. (rewarded 0.89)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.71)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent idled at a green light with oncoming traffic. (rewarded -0.14)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove left instead of forward. (rewarded 0.17)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.58)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent attempted driving forward through a red light. (rewarded -10.03)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 60
\-------------------------
Simulating trial. . .
espilon = 0.7898; alpha = 0.0040
Simulating trial. . .
espilon = 0.7898; alpha = 0.0040
Simulating trial. . .
espilon = 0.7898; alpha = 0.0040
Simulating trial. . .
espilon = 0.7898; alpha = 0.0040
Simulating trial. . .
espilon = 0.7898; alpha = 0.0040
Simulating trial. . .
espilon = 0.7898; alpha = 0.0040
Simulating trial. . .
espilon = 0.7898; alpha = 0.0040
Simulating trial. . .
espilon = 0.7898; alpha = 0.0040
Simulating trial. . .
espilon = 0.7898; alpha = 0.0040
Simulating trial. . .
espilon = 0.7898; alpha = 0.0040
Simulating trial. . .
espilon = 0.7898; alpha = 0.0040
Simulating trial. . .
espilon = 0.7898; alpha = 0.0040
Simulating trial. . .
espilon = 0.7898; alpha = 0.0040
Simulating trial. . .
espilon = 0.7898; alpha = 0.0040
Simulating trial. . .
espilon = 0.7898; alpha = 0.0040
Simulating trial. . .
espilon = 0.7898; alpha = 0.0040
Simulating trial. . .
espilon = 0.7898; alpha = 0.0040
Simulating trial. . .
espilon = 0.7898; alpha = 0.0040
Simulating trial. . .
espilon = 0.7898; alpha = 0.0040
Simulating trial. . .
espilon = 0.7898; alpha = 0.0040
Simulating trial. . .
espilon = 0.7898; alpha = 0.0040
Simulating trial. . .
espilon = 0.7898; alpha = 0.0040
Simulating trial. . .
espilon = 0.7898; alpha = 0.0040
Simulating trial. . .
espilon = 0.7898; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.70)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'right')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.11)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.21)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', 'right')
Agent drove forward instead of right. (rewarded 1.72)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.74)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent attempted driving left through a red light. (rewarded -10.32)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.55)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.91)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.62)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.06)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'left')
Agent drove forward instead of right. (rewarded 1.33)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.48)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 1.21)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.37)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.36)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.21)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.76)
15% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 61
\-------------------------
Simulating trial. . .
espilon = 0.7866; alpha = 0.0040
Simulating trial. . .
espilon = 0.7866; alpha = 0.0040
Simulating trial. . .
espilon = 0.7866; alpha = 0.0040
Simulating trial. . .
espilon = 0.7866; alpha = 0.0040
Simulating trial. . .
espilon = 0.7866; alpha = 0.0040
Simulating trial. . .
espilon = 0.7866; alpha = 0.0040
Simulating trial. . .
espilon = 0.7866; alpha = 0.0040
Simulating trial. . .
espilon = 0.7866; alpha = 0.0040
Simulating trial. . .
espilon = 0.7866; alpha = 0.0040
Simulating trial. . .
espilon = 0.7866; alpha = 0.0040
Simulating trial. . .
espilon = 0.7866; alpha = 0.0040
Simulating trial. . .
espilon = 0.7866; alpha = 0.0040
Simulating trial. . .
espilon = 0.7866; alpha = 0.0040
Simulating trial. . .
espilon = 0.7866; alpha = 0.0040
Simulating trial. . .
espilon = 0.7866; alpha = 0.0040
Simulating trial. . .
espilon = 0.7866; alpha = 0.0040
Simulating trial. . .
espilon = 0.7866; alpha = 0.0040
Simulating trial. . .
espilon = 0.7866; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.67)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 1.84)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent followed the waypoint right. (rewarded 2.76)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent drove left instead of forward. (rewarded -0.02)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.93)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.53)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.74)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.97)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.46)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'left')
Agent drove right instead of left. (rewarded 0.57)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 0.68)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 0.68)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.06)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.02)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.60)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.16)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.52)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.36)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.91)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.91)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.80)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.17)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.13)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.60)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', None)
Agent drove left instead of forward. (rewarded -0.09)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 62
\-------------------------
Simulating trial. . .
espilon = 0.7835; alpha = 0.0040
Simulating trial. . .
espilon = 0.7835; alpha = 0.0040
Simulating trial. . .
espilon = 0.7835; alpha = 0.0040
Simulating trial. . .
espilon = 0.7835; alpha = 0.0040
Simulating trial. . .
espilon = 0.7835; alpha = 0.0040
Simulating trial. . .
espilon = 0.7835; alpha = 0.0040
Simulating trial. . .
espilon = 0.7835; alpha = 0.0040
Simulating trial. . .
espilon = 0.7835; alpha = 0.0040
Simulating trial. . .
espilon = 0.7835; alpha = 0.0040
Simulating trial. . .
espilon = 0.7835; alpha = 0.0040
Simulating trial. . .
espilon = 0.7835; alpha = 0.0040
Simulating trial. . .
espilon = 0.7835; alpha = 0.0040
Simulating trial. . .
espilon = 0.7835; alpha = 0.0040
Simulating trial. . .
espilon = 0.7835; alpha = 0.0040
Simulating trial. . .
espilon = 0.7835; alpha = 0.0040
Simulating trial. . .
espilon = 0.7835; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.31)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.50)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.38)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.01)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.73)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.11)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 1.12)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent drove right instead of left. (rewarded 0.98)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent drove left instead of forward. (rewarded 0.39)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.50)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.28)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 1.80)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 2.05)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.35)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.74)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent drove right instead of forward. (rewarded 0.23)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.42)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.19)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 0.70)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.98)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent drove right instead of forward. (rewarded 0.57)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded 0.49)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.17)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.61)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.89)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded -0.31)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded -0.62)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.43)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', 'forward')
Agent drove right instead of left. (rewarded -0.26)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent drove left instead of right. (rewarded 0.02)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 63
\-------------------------
Simulating trial. . .
espilon = 0.7804; alpha = 0.0040
Simulating trial. . .
espilon = 0.7804; alpha = 0.0040
Simulating trial. . .
espilon = 0.7804; alpha = 0.0040
Simulating trial. . .
espilon = 0.7804; alpha = 0.0040
Simulating trial. . .
espilon = 0.7804; alpha = 0.0040
Simulating trial. . .
espilon = 0.7804; alpha = 0.0040
Simulating trial. . .
espilon = 0.7804; alpha = 0.0040
Simulating trial. . .
espilon = 0.7804; alpha = 0.0040
Simulating trial. . .
espilon = 0.7804; alpha = 0.0040
Simulating trial. . .
espilon = 0.7804; alpha = 0.0040
Simulating trial. . .
espilon = 0.7804; alpha = 0.0040
Simulating trial. . .
espilon = 0.7804; alpha = 0.0040
Simulating trial. . .
espilon = 0.7804; alpha = 0.0040
Simulating trial. . .
espilon = 0.7804; alpha = 0.0040
Simulating trial. . .
espilon = 0.7804; alpha = 0.0040
Simulating trial. . .
espilon = 0.7804; alpha = 0.0040
Simulating trial. . .
espilon = 0.7804; alpha = 0.0040
Simulating trial. . .
espilon = 0.7804; alpha = 0.0040
Simulating trial. . .
espilon = 0.7804; alpha = 0.0040
Simulating trial. . .
espilon = 0.7804; alpha = 0.0040
Simulating trial. . .
espilon = 0.7804; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent drove forward instead of left. (rewarded 0.61)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 0.48)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent drove right instead of left. (rewarded 1.02)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.57)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent attempted driving forward through a red light. (rewarded -10.59)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove left instead of forward. (rewarded 0.41)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -10.29)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.90)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent attempted driving forward through a red light. (rewarded -10.80)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.24)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', 'right')
Agent followed the waypoint right. (rewarded 1.26)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -9.20)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.41)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 1.21)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent attempted driving left through a red light. (rewarded -9.96)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove forward instead of left. (rewarded 1.31)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.98)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.64)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.86)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent attempted driving left through a red light. (rewarded -10.26)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 64
\-------------------------
Simulating trial. . .
espilon = 0.7772; alpha = 0.0040
Simulating trial. . .
espilon = 0.7772; alpha = 0.0040
Simulating trial. . .
espilon = 0.7772; alpha = 0.0040
Simulating trial. . .
espilon = 0.7772; alpha = 0.0040
Simulating trial. . .
espilon = 0.7772; alpha = 0.0040
Simulating trial. . .
espilon = 0.7772; alpha = 0.0040
Simulating trial. . .
espilon = 0.7772; alpha = 0.0040
Simulating trial. . .
espilon = 0.7772; alpha = 0.0040
Simulating trial. . .
espilon = 0.7772; alpha = 0.0040
Simulating trial. . .
espilon = 0.7772; alpha = 0.0040
Simulating trial. . .
espilon = 0.7772; alpha = 0.0040
Simulating trial. . .
espilon = 0.7772; alpha = 0.0040
Simulating trial. . .
espilon = 0.7772; alpha = 0.0040
Simulating trial. . .
espilon = 0.7772; alpha = 0.0040
Simulating trial. . .
espilon = 0.7772; alpha = 0.0040
Simulating trial. . .
espilon = 0.7772; alpha = 0.0040
Simulating trial. . .
espilon = 0.7772; alpha = 0.0040
Simulating trial. . .
espilon = 0.7772; alpha = 0.0040
Simulating trial. . .
espilon = 0.7772; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.93)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.54)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent attempted driving left through a red light. (rewarded -10.46)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.29)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 0.79)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent attempted driving left through a red light. (rewarded -10.25)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.07)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 1.79)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.60)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -10.09)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 1.22)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.51)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 1.63)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.11)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 2.44)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.07)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.30)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.08)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.85)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent followed the waypoint right. (rewarded 2.18)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -10.00)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.15)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.53)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded -0.10)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent drove forward instead of left. (rewarded 0.76)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 65
\-------------------------
Simulating trial. . .
espilon = 0.7741; alpha = 0.0040
Simulating trial. . .
espilon = 0.7741; alpha = 0.0040
Simulating trial. . .
espilon = 0.7741; alpha = 0.0040
Simulating trial. . .
espilon = 0.7741; alpha = 0.0040
Simulating trial. . .
espilon = 0.7741; alpha = 0.0040
Simulating trial. . .
espilon = 0.7741; alpha = 0.0040
Simulating trial. . .
espilon = 0.7741; alpha = 0.0040
Simulating trial. . .
espilon = 0.7741; alpha = 0.0040
Simulating trial. . .
espilon = 0.7741; alpha = 0.0040
Simulating trial. . .
espilon = 0.7741; alpha = 0.0040
Simulating trial. . .
espilon = 0.7741; alpha = 0.0040
Simulating trial. . .
espilon = 0.7741; alpha = 0.0040
Simulating trial. . .
espilon = 0.7741; alpha = 0.0040
Simulating trial. . .
espilon = 0.7741; alpha = 0.0040
Simulating trial. . .
espilon = 0.7741; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.48)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent drove left instead of right. (rewarded 0.96)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent drove right instead of left. (rewarded 0.89)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 2.35)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent drove left instead of forward. (rewarded 0.28)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.72)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.42)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'right', 'left')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.59)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.74)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.07)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'left')
Agent attempted driving left through a red light. (rewarded -10.51)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove forward instead of right. (rewarded 1.31)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', 'left')
Agent followed the waypoint right. (rewarded 1.32)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', None)
Agent attempted driving forward through a red light. (rewarded -10.55)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent drove right instead of forward. (rewarded 1.09)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent drove right instead of left. (rewarded 1.08)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded -0.44)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent followed the waypoint right. (rewarded 1.79)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, None)
Agent drove forward instead of right. (rewarded 1.11)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.34)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 66
\-------------------------
Simulating trial. . .
espilon = 0.7711; alpha = 0.0040
Simulating trial. . .
espilon = 0.7711; alpha = 0.0040
Simulating trial. . .
espilon = 0.7711; alpha = 0.0040
Simulating trial. . .
espilon = 0.7711; alpha = 0.0040
Simulating trial. . .
espilon = 0.7711; alpha = 0.0040
Simulating trial. . .
espilon = 0.7711; alpha = 0.0040
Simulating trial. . .
espilon = 0.7711; alpha = 0.0040
Simulating trial. . .
espilon = 0.7711; alpha = 0.0040
Simulating trial. . .
espilon = 0.7711; alpha = 0.0040
Simulating trial. . .
espilon = 0.7711; alpha = 0.0040
Simulating trial. . .
espilon = 0.7711; alpha = 0.0040
Simulating trial. . .
espilon = 0.7711; alpha = 0.0040
Simulating trial. . .
espilon = 0.7711; alpha = 0.0040
Simulating trial. . .
espilon = 0.7711; alpha = 0.0040
Simulating trial. . .
espilon = 0.7711; alpha = 0.0040
Simulating trial. . .
espilon = 0.7711; alpha = 0.0040
Simulating trial. . .
espilon = 0.7711; alpha = 0.0040
Simulating trial. . .
espilon = 0.7711; alpha = 0.0040
Simulating trial. . .
espilon = 0.7711; alpha = 0.0040
Simulating trial. . .
espilon = 0.7711; alpha = 0.0040
Simulating trial. . .
espilon = 0.7711; alpha = 0.0040
Simulating trial. . .
espilon = 0.7711; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'right', 'right')
Agent properly idled at a red light. (rewarded 1.50)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent drove right instead of left. (rewarded 1.93)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.54)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.88)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.98)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 1.59)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'forward')
Agent properly idled at a red light. (rewarded 1.80)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 0.38)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent drove forward instead of left. (rewarded 1.20)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.40)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.50)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.00)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent drove right instead of left. (rewarded -0.14)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.57)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.85)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 2.40)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.34)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent drove left instead of right. (rewarded 0.21)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent drove right instead of forward. (rewarded 0.94)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.14)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.15)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.94)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.71)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove forward instead of left. (rewarded 0.91)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.08)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 67
\-------------------------
Simulating trial. . .
espilon = 0.7680; alpha = 0.0040
Simulating trial. . .
espilon = 0.7680; alpha = 0.0040
Simulating trial. . .
espilon = 0.7680; alpha = 0.0040
Simulating trial. . .
espilon = 0.7680; alpha = 0.0040
Simulating trial. . .
espilon = 0.7680; alpha = 0.0040
Simulating trial. . .
espilon = 0.7680; alpha = 0.0040
Simulating trial. . .
espilon = 0.7680; alpha = 0.0040
Simulating trial. . .
espilon = 0.7680; alpha = 0.0040
Simulating trial. . .
espilon = 0.7680; alpha = 0.0040
Simulating trial. . .
espilon = 0.7680; alpha = 0.0040
Simulating trial. . .
espilon = 0.7680; alpha = 0.0040
Simulating trial. . .
espilon = 0.7680; alpha = 0.0040
Simulating trial. . .
espilon = 0.7680; alpha = 0.0040
Simulating trial. . .
espilon = 0.7680; alpha = 0.0040
Simulating trial. . .
espilon = 0.7680; alpha = 0.0040
Simulating trial. . .
espilon = 0.7680; alpha = 0.0040
Simulating trial. . .
espilon = 0.7680; alpha = 0.0040
Simulating trial. . .
espilon = 0.7680; alpha = 0.0040
Simulating trial. . .
espilon = 0.7680; alpha = 0.0040
Simulating trial. . .
espilon = 0.7680; alpha = 0.0040
Simulating trial. . .
espilon = 0.7680; alpha = 0.0040
Simulating trial. . .
espilon = 0.7680; alpha = 0.0040
Simulating trial. . .
espilon = 0.7680; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.28)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.07)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.10)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'forward')
Agent attempted driving left through a red light. (rewarded -9.63)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.98)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.84)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'forward')
Agent drove right instead of forward. (rewarded 1.41)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', 'right')
Agent properly idled at a red light. (rewarded 1.60)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.27)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 1.43)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent drove forward instead of left. (rewarded 0.40)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.05)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'right', None)
Agent drove right instead of forward. (rewarded 0.51)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.93)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 1.49)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 0.15)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.47)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.64)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'right')
Agent drove right instead of forward. (rewarded -0.33)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.20)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'left')
Agent drove right instead of left. (rewarded 0.17)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.44)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.27)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent drove right instead of forward. (rewarded 0.59)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent drove forward instead of left. (rewarded 0.62)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 68
\-------------------------
Simulating trial. . .
espilon = 0.7649; alpha = 0.0040
Simulating trial. . .
espilon = 0.7649; alpha = 0.0040
Simulating trial. . .
espilon = 0.7649; alpha = 0.0040
Simulating trial. . .
espilon = 0.7649; alpha = 0.0040
Simulating trial. . .
espilon = 0.7649; alpha = 0.0040
Simulating trial. . .
espilon = 0.7649; alpha = 0.0040
Simulating trial. . .
espilon = 0.7649; alpha = 0.0040
Simulating trial. . .
espilon = 0.7649; alpha = 0.0040
Simulating trial. . .
espilon = 0.7649; alpha = 0.0040
Simulating trial. . .
espilon = 0.7649; alpha = 0.0040
Simulating trial. . .
espilon = 0.7649; alpha = 0.0040
Simulating trial. . .
espilon = 0.7649; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.05)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -9.67)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.84)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.34)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.81)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent drove forward instead of left. (rewarded 1.47)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.85)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent drove forward instead of left. (rewarded 1.45)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.86)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'right', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.52)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.53)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'left')
Agent followed the waypoint left. (rewarded 0.82)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded 0.10)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.33)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.43)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.11)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.61)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent attempted driving left through a red light. (rewarded -10.65)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.42)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded -0.44)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent drove forward instead of right. (rewarded -0.10)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.53)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded -0.65)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'right', 'forward')
Agent attempted driving left through a red light. (rewarded -9.00)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent attempted driving forward through a red light. (rewarded -9.28)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 69
\-------------------------
Simulating trial. . .
espilon = 0.7619; alpha = 0.0040
Simulating trial. . .
espilon = 0.7619; alpha = 0.0040
Simulating trial. . .
espilon = 0.7619; alpha = 0.0040
Simulating trial. . .
espilon = 0.7619; alpha = 0.0040
Simulating trial. . .
espilon = 0.7619; alpha = 0.0040
Simulating trial. . .
espilon = 0.7619; alpha = 0.0040
Simulating trial. . .
espilon = 0.7619; alpha = 0.0040
Simulating trial. . .
espilon = 0.7619; alpha = 0.0040
Simulating trial. . .
espilon = 0.7619; alpha = 0.0040
Simulating trial. . .
espilon = 0.7619; alpha = 0.0040
Simulating trial. . .
espilon = 0.7619; alpha = 0.0040
Simulating trial. . .
espilon = 0.7619; alpha = 0.0040
Simulating trial. . .
espilon = 0.7619; alpha = 0.0040
Simulating trial. . .
espilon = 0.7619; alpha = 0.0040
Simulating trial. . .
espilon = 0.7619; alpha = 0.0040
Simulating trial. . .
espilon = 0.7619; alpha = 0.0040
Simulating trial. . .
espilon = 0.7619; alpha = 0.0040
Simulating trial. . .
espilon = 0.7619; alpha = 0.0040
Simulating trial. . .
espilon = 0.7619; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent drove right instead of forward. (rewarded 0.23)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove right instead of left. (rewarded 1.20)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.24)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent drove right instead of forward. (rewarded 0.98)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'right', 'left')
Agent properly idled at a red light. (rewarded 2.60)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 1.01)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.64)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove left instead of forward. (rewarded 1.22)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent drove left instead of right. (rewarded 0.21)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.49)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent drove right instead of forward. (rewarded 1.12)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -10.07)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.69)
63% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove forward instead of left. (rewarded 0.02)
60% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent drove forward instead of left. (rewarded 1.76)
57% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.14)
54% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.51)
51% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent drove right instead of forward. (rewarded 1.43)
49% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -9.75)
46% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.61)
43% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.48)
40% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded 0.81)
37% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.15)
34% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.91)
31% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.52)
29% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.84)
26% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 0.77)
23% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent drove right instead of forward. (rewarded -0.42)
20% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.82)
17% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.67)
14% of time remaining to reach destination.
/-------------------
| Step 30 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.04)
11% of time remaining to reach destination.
/-------------------
| Step 31 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.99)
9% of time remaining to reach destination.
/-------------------
| Step 32 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent attempted driving left through a red light. (rewarded -10.90)
6% of time remaining to reach destination.
/-------------------
| Step 33 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.61)
3% of time remaining to reach destination.
/-------------------
| Step 34 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent drove forward instead of right. (rewarded 0.26)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 70
\-------------------------
Simulating trial. . .
espilon = 0.7588; alpha = 0.0040
Simulating trial. . .
espilon = 0.7588; alpha = 0.0040
Simulating trial. . .
espilon = 0.7588; alpha = 0.0040
Simulating trial. . .
espilon = 0.7588; alpha = 0.0040
Simulating trial. . .
espilon = 0.7588; alpha = 0.0040
Simulating trial. . .
espilon = 0.7588; alpha = 0.0040
Simulating trial. . .
espilon = 0.7588; alpha = 0.0040
Simulating trial. . .
espilon = 0.7588; alpha = 0.0040
Simulating trial. . .
espilon = 0.7588; alpha = 0.0040
Simulating trial. . .
espilon = 0.7588; alpha = 0.0040
Simulating trial. . .
espilon = 0.7588; alpha = 0.0040
Simulating trial. . .
espilon = 0.7588; alpha = 0.0040
Simulating trial. . .
espilon = 0.7588; alpha = 0.0040
Simulating trial. . .
espilon = 0.7588; alpha = 0.0040
Simulating trial. . .
espilon = 0.7588; alpha = 0.0040
Simulating trial. . .
espilon = 0.7588; alpha = 0.0040
Simulating trial. . .
espilon = 0.7588; alpha = 0.0040
Simulating trial. . .
espilon = 0.7588; alpha = 0.0040
Simulating trial. . .
espilon = 0.7588; alpha = 0.0040
Simulating trial. . .
espilon = 0.7588; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', 'left')
Agent followed the waypoint right. (rewarded 2.93)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 1.89)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 2.25)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.95)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.03)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent drove right instead of forward. (rewarded 1.56)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.26)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 1.05)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.05)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent attempted driving forward through a red light. (rewarded -9.63)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'right')
Agent attempted driving forward through a red light. (rewarded -10.24)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent drove right instead of left. (rewarded 0.69)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.37)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -9.29)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.43)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 0.12)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.43)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.68)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.54)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded 0.62)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.89)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove forward instead of right. (rewarded 0.88)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 0.52)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -9.21)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.43)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent drove right instead of forward. (rewarded -0.18)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.65)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded -0.28)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.30)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.91)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 71
\-------------------------
Simulating trial. . .
espilon = 0.7558; alpha = 0.0040
Simulating trial. . .
espilon = 0.7558; alpha = 0.0040
Simulating trial. . .
espilon = 0.7558; alpha = 0.0040
Simulating trial. . .
espilon = 0.7558; alpha = 0.0040
Simulating trial. . .
espilon = 0.7558; alpha = 0.0040
Simulating trial. . .
espilon = 0.7558; alpha = 0.0040
Simulating trial. . .
espilon = 0.7558; alpha = 0.0040
Simulating trial. . .
espilon = 0.7558; alpha = 0.0040
Simulating trial. . .
espilon = 0.7558; alpha = 0.0040
Simulating trial. . .
espilon = 0.7558; alpha = 0.0040
Simulating trial. . .
espilon = 0.7558; alpha = 0.0040
Simulating trial. . .
espilon = 0.7558; alpha = 0.0040
Simulating trial. . .
espilon = 0.7558; alpha = 0.0040
Simulating trial. . .
espilon = 0.7558; alpha = 0.0040
Simulating trial. . .
espilon = 0.7558; alpha = 0.0040
Simulating trial. . .
espilon = 0.7558; alpha = 0.0040
Simulating trial. . .
espilon = 0.7558; alpha = 0.0040
Simulating trial. . .
espilon = 0.7558; alpha = 0.0040
Simulating trial. . .
espilon = 0.7558; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'right')
Agent attempted driving forward through a red light. (rewarded -9.16)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.67)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.17)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent drove left instead of forward. (rewarded 0.05)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent attempted driving left through a red light. (rewarded -10.93)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.17)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.19)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 1.90)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 1.55)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded 0.32)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent drove forward instead of left. (rewarded 0.86)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent drove right instead of left. (rewarded 1.14)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.03)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded -0.15)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.75)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent drove left instead of right. (rewarded 0.22)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent attempted driving left through a red light. (rewarded -10.12)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.86)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 2.48)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'left', 'forward')
Agent drove right instead of forward. (rewarded 0.67)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.74)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.33)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', 'left')
Agent drove forward instead of left. (rewarded 1.32)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent drove forward instead of left. (rewarded 1.43)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.93)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.72)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent followed the waypoint left. (rewarded 2.04)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded -0.74)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.95)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 1.93)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 72
\-------------------------
Simulating trial. . .
espilon = 0.7528; alpha = 0.0040
Simulating trial. . .
espilon = 0.7528; alpha = 0.0040
Simulating trial. . .
espilon = 0.7528; alpha = 0.0040
Simulating trial. . .
espilon = 0.7528; alpha = 0.0040
Simulating trial. . .
espilon = 0.7528; alpha = 0.0040
Simulating trial. . .
espilon = 0.7528; alpha = 0.0040
Simulating trial. . .
espilon = 0.7528; alpha = 0.0040
Simulating trial. . .
espilon = 0.7528; alpha = 0.0040
Simulating trial. . .
espilon = 0.7528; alpha = 0.0040
Simulating trial. . .
espilon = 0.7528; alpha = 0.0040
Simulating trial. . .
espilon = 0.7528; alpha = 0.0040
Simulating trial. . .
espilon = 0.7528; alpha = 0.0040
Simulating trial. . .
espilon = 0.7528; alpha = 0.0040
Simulating trial. . .
espilon = 0.7528; alpha = 0.0040
Simulating trial. . .
espilon = 0.7528; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'right', 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.68)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent drove right instead of forward. (rewarded 0.92)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.97)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.75)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.66)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent drove right instead of left. (rewarded 1.69)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 2.54)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 0.62)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent followed the waypoint left. (rewarded 1.47)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.42)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.01)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.80)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.93)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.08)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.94)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 1.51)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.12)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'forward')
Agent drove forward instead of right. (rewarded 0.43)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.36)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.44)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.75)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.39)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.16)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.82)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -4.31)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -9.36)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent attempted driving forward through a red light. (rewarded -9.25)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent drove right instead of left. (rewarded 0.57)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.59)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.13)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 73
\-------------------------
Simulating trial. . .
espilon = 0.7498; alpha = 0.0040
Simulating trial. . .
espilon = 0.7498; alpha = 0.0040
Simulating trial. . .
espilon = 0.7498; alpha = 0.0040
Simulating trial. . .
espilon = 0.7498; alpha = 0.0040
Simulating trial. . .
espilon = 0.7498; alpha = 0.0040
Simulating trial. . .
espilon = 0.7498; alpha = 0.0040
Simulating trial. . .
espilon = 0.7498; alpha = 0.0040
Simulating trial. . .
espilon = 0.7498; alpha = 0.0040
Simulating trial. . .
espilon = 0.7498; alpha = 0.0040
Simulating trial. . .
espilon = 0.7498; alpha = 0.0040
Simulating trial. . .
espilon = 0.7498; alpha = 0.0040
Simulating trial. . .
espilon = 0.7498; alpha = 0.0040
Simulating trial. . .
espilon = 0.7498; alpha = 0.0040
Simulating trial. . .
espilon = 0.7498; alpha = 0.0040
Simulating trial. . .
espilon = 0.7498; alpha = 0.0040
Simulating trial. . .
espilon = 0.7498; alpha = 0.0040
Simulating trial. . .
espilon = 0.7498; alpha = 0.0040
Simulating trial. . .
espilon = 0.7498; alpha = 0.0040
Simulating trial. . .
espilon = 0.7498; alpha = 0.0040
Simulating trial. . .
espilon = 0.7498; alpha = 0.0040
Simulating trial. . .
espilon = 0.7498; alpha = 0.0040
Simulating trial. . .
espilon = 0.7498; alpha = 0.0040
Simulating trial. . .
espilon = 0.7498; alpha = 0.0040
Simulating trial. . .
espilon = 0.7498; alpha = 0.0040
Simulating trial. . .
espilon = 0.7498; alpha = 0.0040
Simulating trial. . .
espilon = 0.7498; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded 0.77)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 1.25)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent drove forward instead of left. (rewarded 0.38)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove forward instead of left. (rewarded 1.90)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.30)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent attempted driving forward through a red light. (rewarded -10.80)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.91)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -9.58)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.76)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.57)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.38)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'right')
Agent attempted driving forward through a red light. (rewarded -9.10)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.66)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 1.60)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -10.73)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 0.93)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 1.03)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.52)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'right')
Agent properly idled at a red light. (rewarded 2.43)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.37)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded -0.09)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 0.72)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 0.76)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove forward instead of right. (rewarded 0.83)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'right')
Agent followed the waypoint right. (rewarded 2.09)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent drove right instead of forward. (rewarded 0.25)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.03)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.91)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded -0.09)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.82)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 74
\-------------------------
Simulating trial. . .
espilon = 0.7468; alpha = 0.0040
Simulating trial. . .
espilon = 0.7468; alpha = 0.0040
Simulating trial. . .
espilon = 0.7468; alpha = 0.0040
Simulating trial. . .
espilon = 0.7468; alpha = 0.0040
Simulating trial. . .
espilon = 0.7468; alpha = 0.0040
Simulating trial. . .
espilon = 0.7468; alpha = 0.0040
Simulating trial. . .
espilon = 0.7468; alpha = 0.0040
Simulating trial. . .
espilon = 0.7468; alpha = 0.0040
Simulating trial. . .
espilon = 0.7468; alpha = 0.0040
Simulating trial. . .
espilon = 0.7468; alpha = 0.0040
Simulating trial. . .
espilon = 0.7468; alpha = 0.0040
Simulating trial. . .
espilon = 0.7468; alpha = 0.0040
Simulating trial. . .
espilon = 0.7468; alpha = 0.0040
Simulating trial. . .
espilon = 0.7468; alpha = 0.0040
Simulating trial. . .
espilon = 0.7468; alpha = 0.0040
Simulating trial. . .
espilon = 0.7468; alpha = 0.0040
Simulating trial. . .
espilon = 0.7468; alpha = 0.0040
Simulating trial. . .
espilon = 0.7468; alpha = 0.0040
Simulating trial. . .
espilon = 0.7468; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', 'left')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.34)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.82)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.18)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.09)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'left')
Agent attempted driving left through a red light. (rewarded -9.35)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove left instead of right. (rewarded 0.58)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.64)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.42)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.26)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'right')
Agent properly idled at a red light. (rewarded 1.32)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'right')
Agent drove right instead of left. (rewarded 1.72)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.25)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.25)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent drove right instead of forward. (rewarded 0.53)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 1.40)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.90)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent properly idled at a red light. (rewarded 1.98)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded 0.37)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.46)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 0.32)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 75
\-------------------------
Simulating trial. . .
espilon = 0.7438; alpha = 0.0040
Simulating trial. . .
espilon = 0.7438; alpha = 0.0040
Simulating trial. . .
espilon = 0.7438; alpha = 0.0040
Simulating trial. . .
espilon = 0.7438; alpha = 0.0040
Simulating trial. . .
espilon = 0.7438; alpha = 0.0040
Simulating trial. . .
espilon = 0.7438; alpha = 0.0040
Simulating trial. . .
espilon = 0.7438; alpha = 0.0040
Simulating trial. . .
espilon = 0.7438; alpha = 0.0040
Simulating trial. . .
espilon = 0.7438; alpha = 0.0040
Simulating trial. . .
espilon = 0.7438; alpha = 0.0040
Simulating trial. . .
espilon = 0.7438; alpha = 0.0040
Simulating trial. . .
espilon = 0.7438; alpha = 0.0040
Simulating trial. . .
espilon = 0.7438; alpha = 0.0040
Simulating trial. . .
espilon = 0.7438; alpha = 0.0040
Simulating trial. . .
espilon = 0.7438; alpha = 0.0040
Simulating trial. . .
espilon = 0.7438; alpha = 0.0040
Simulating trial. . .
espilon = 0.7438; alpha = 0.0040
Simulating trial. . .
espilon = 0.7438; alpha = 0.0040
Simulating trial. . .
espilon = 0.7438; alpha = 0.0040
Simulating trial. . .
espilon = 0.7438; alpha = 0.0040
Simulating trial. . .
espilon = 0.7438; alpha = 0.0040
Simulating trial. . .
espilon = 0.7438; alpha = 0.0040
Simulating trial. . .
espilon = 0.7438; alpha = 0.0040
Simulating trial. . .
espilon = 0.7438; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.09)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.27)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.91)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.82)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.37)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.79)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -5.83)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove left instead of right. (rewarded 1.60)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.49)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.04)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 0.93)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.70)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, None)
Agent followed the waypoint right. (rewarded 1.28)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded -0.33)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove forward instead of right. (rewarded -0.41)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 1.44)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.82)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 0.64)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'right')
Agent followed the waypoint right. (rewarded 2.22)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'left')
Agent followed the waypoint right. (rewarded 1.20)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 76
\-------------------------
Simulating trial. . .
espilon = 0.7408; alpha = 0.0040
Simulating trial. . .
espilon = 0.7408; alpha = 0.0040
Simulating trial. . .
espilon = 0.7408; alpha = 0.0040
Simulating trial. . .
espilon = 0.7408; alpha = 0.0040
Simulating trial. . .
espilon = 0.7408; alpha = 0.0040
Simulating trial. . .
espilon = 0.7408; alpha = 0.0040
Simulating trial. . .
espilon = 0.7408; alpha = 0.0040
Simulating trial. . .
espilon = 0.7408; alpha = 0.0040
Simulating trial. . .
espilon = 0.7408; alpha = 0.0040
Simulating trial. . .
espilon = 0.7408; alpha = 0.0040
Simulating trial. . .
espilon = 0.7408; alpha = 0.0040
Simulating trial. . .
espilon = 0.7408; alpha = 0.0040
Simulating trial. . .
espilon = 0.7408; alpha = 0.0040
Simulating trial. . .
espilon = 0.7408; alpha = 0.0040
Simulating trial. . .
espilon = 0.7408; alpha = 0.0040
Simulating trial. . .
espilon = 0.7408; alpha = 0.0040
Simulating trial. . .
espilon = 0.7408; alpha = 0.0040
Simulating trial. . .
espilon = 0.7408; alpha = 0.0040
Simulating trial. . .
espilon = 0.7408; alpha = 0.0040
Simulating trial. . .
espilon = 0.7408; alpha = 0.0040
Simulating trial. . .
espilon = 0.7408; alpha = 0.0040
Simulating trial. . .
espilon = 0.7408; alpha = 0.0040
Simulating trial. . .
espilon = 0.7408; alpha = 0.0040
Simulating trial. . .
espilon = 0.7408; alpha = 0.0040
Simulating trial. . .
espilon = 0.7408; alpha = 0.0040
Simulating trial. . .
espilon = 0.7408; alpha = 0.0040
Simulating trial. . .
espilon = 0.7408; alpha = 0.0040
Simulating trial. . .
espilon = 0.7408; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.22)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.98)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.79)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.25)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent attempted driving left through a red light. (rewarded -9.08)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.21)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.20)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.03)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.92)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.56)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent attempted driving forward through a red light. (rewarded -9.36)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.32)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.05)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 0.78)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.22)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.08)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.84)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -5.96)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.73)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.59)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 77
\-------------------------
Simulating trial. . .
espilon = 0.7379; alpha = 0.0040
Simulating trial. . .
espilon = 0.7379; alpha = 0.0040
Simulating trial. . .
espilon = 0.7379; alpha = 0.0040
Simulating trial. . .
espilon = 0.7379; alpha = 0.0040
Simulating trial. . .
espilon = 0.7379; alpha = 0.0040
Simulating trial. . .
espilon = 0.7379; alpha = 0.0040
Simulating trial. . .
espilon = 0.7379; alpha = 0.0040
Simulating trial. . .
espilon = 0.7379; alpha = 0.0040
Simulating trial. . .
espilon = 0.7379; alpha = 0.0040
Simulating trial. . .
espilon = 0.7379; alpha = 0.0040
Simulating trial. . .
espilon = 0.7379; alpha = 0.0040
Simulating trial. . .
espilon = 0.7379; alpha = 0.0040
Simulating trial. . .
espilon = 0.7379; alpha = 0.0040
Simulating trial. . .
espilon = 0.7379; alpha = 0.0040
Simulating trial. . .
espilon = 0.7379; alpha = 0.0040
Simulating trial. . .
espilon = 0.7379; alpha = 0.0040
Simulating trial. . .
espilon = 0.7379; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.75)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.65)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 1.35)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent drove left instead of forward. (rewarded 1.19)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent followed the waypoint right. (rewarded 2.91)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.78)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'left')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.38)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove left instead of forward. (rewarded -0.09)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'left')
Agent drove left instead of right. (rewarded 1.69)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.92)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 2.57)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.28)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'right')
Agent drove right instead of left. (rewarded 0.52)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent drove right instead of forward. (rewarded 0.69)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.21)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.14)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.91)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.36)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.79)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'right', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.68)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 78
\-------------------------
Simulating trial. . .
espilon = 0.7349; alpha = 0.0040
Simulating trial. . .
espilon = 0.7349; alpha = 0.0040
Simulating trial. . .
espilon = 0.7349; alpha = 0.0040
Simulating trial. . .
espilon = 0.7349; alpha = 0.0040
Simulating trial. . .
espilon = 0.7349; alpha = 0.0040
Simulating trial. . .
espilon = 0.7349; alpha = 0.0040
Simulating trial. . .
espilon = 0.7349; alpha = 0.0040
Simulating trial. . .
espilon = 0.7349; alpha = 0.0040
Simulating trial. . .
espilon = 0.7349; alpha = 0.0040
Simulating trial. . .
espilon = 0.7349; alpha = 0.0040
Simulating trial. . .
espilon = 0.7349; alpha = 0.0040
Simulating trial. . .
espilon = 0.7349; alpha = 0.0040
Simulating trial. . .
espilon = 0.7349; alpha = 0.0040
Simulating trial. . .
espilon = 0.7349; alpha = 0.0040
Simulating trial. . .
espilon = 0.7349; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.81)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', 'right')
Agent followed the waypoint forward. (rewarded 2.82)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 2.74)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent attempted driving forward through a red light. (rewarded -10.18)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.59)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove left instead of forward. (rewarded 0.75)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 0.03)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -11.00)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'left')
Agent followed the waypoint right. (rewarded 1.40)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.67)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.01)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.14)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.81)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent drove right instead of forward. (rewarded 0.36)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove right instead of left. (rewarded 0.44)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.25)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.98)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'right')
Agent drove right instead of left. (rewarded 0.17)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.21)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.60)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 79
\-------------------------
Simulating trial. . .
espilon = 0.7320; alpha = 0.0040
Simulating trial. . .
espilon = 0.7320; alpha = 0.0040
Simulating trial. . .
espilon = 0.7320; alpha = 0.0040
Simulating trial. . .
espilon = 0.7320; alpha = 0.0040
Simulating trial. . .
espilon = 0.7320; alpha = 0.0040
Simulating trial. . .
espilon = 0.7320; alpha = 0.0040
Simulating trial. . .
espilon = 0.7320; alpha = 0.0040
Simulating trial. . .
espilon = 0.7320; alpha = 0.0040
Simulating trial. . .
espilon = 0.7320; alpha = 0.0040
Simulating trial. . .
espilon = 0.7320; alpha = 0.0040
Simulating trial. . .
espilon = 0.7320; alpha = 0.0040
Simulating trial. . .
espilon = 0.7320; alpha = 0.0040
Simulating trial. . .
espilon = 0.7320; alpha = 0.0040
Simulating trial. . .
espilon = 0.7320; alpha = 0.0040
Simulating trial. . .
espilon = 0.7320; alpha = 0.0040
Simulating trial. . .
espilon = 0.7320; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -10.63)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', 'left')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.32)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.36)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.32)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.42)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 0.49)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', 'right')
Agent drove right instead of left. (rewarded 1.67)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.50)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.63)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 0.54)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove left instead of right. (rewarded 0.38)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent drove right instead of forward. (rewarded 0.92)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 0.57)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 0.80)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.02)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.02)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.67)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent attempted driving left through a red light. (rewarded -9.25)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent drove right instead of forward. (rewarded 0.25)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent drove right instead of left. (rewarded 1.44)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.97)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 1.19)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.34)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded 0.94)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.73)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 80
\-------------------------
Simulating trial. . .
espilon = 0.7291; alpha = 0.0040
Simulating trial. . .
espilon = 0.7291; alpha = 0.0040
Simulating trial. . .
espilon = 0.7291; alpha = 0.0040
Simulating trial. . .
espilon = 0.7291; alpha = 0.0040
Simulating trial. . .
espilon = 0.7291; alpha = 0.0040
Simulating trial. . .
espilon = 0.7291; alpha = 0.0040
Simulating trial. . .
espilon = 0.7291; alpha = 0.0040
Simulating trial. . .
espilon = 0.7291; alpha = 0.0040
Simulating trial. . .
espilon = 0.7291; alpha = 0.0040
Simulating trial. . .
espilon = 0.7291; alpha = 0.0040
Simulating trial. . .
espilon = 0.7291; alpha = 0.0040
Simulating trial. . .
espilon = 0.7291; alpha = 0.0040
Simulating trial. . .
espilon = 0.7291; alpha = 0.0040
Simulating trial. . .
espilon = 0.7291; alpha = 0.0040
Simulating trial. . .
espilon = 0.7291; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.29)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent drove right instead of left. (rewarded 1.99)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent drove left instead of right. (rewarded 0.09)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.05)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.77)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.38)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded -0.01)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded 1.69)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', 'forward')
Agent attempted driving left through a red light. (rewarded -9.76)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.41)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.37)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'left', None)
Agent drove right instead of left. (rewarded 0.76)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 0.79)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.06)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.07)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent drove right instead of forward. (rewarded -0.12)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.28)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.85)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded -0.36)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.52)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.62)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.39)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent attempted driving forward through a red light. (rewarded -10.41)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.32)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'forward')
Agent attempted driving forward through a red light. (rewarded -9.53)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 81
\-------------------------
Simulating trial. . .
espilon = 0.7261; alpha = 0.0040
Simulating trial. . .
espilon = 0.7261; alpha = 0.0040
Simulating trial. . .
espilon = 0.7261; alpha = 0.0040
Simulating trial. . .
espilon = 0.7261; alpha = 0.0040
Simulating trial. . .
espilon = 0.7261; alpha = 0.0040
Simulating trial. . .
espilon = 0.7261; alpha = 0.0040
Simulating trial. . .
espilon = 0.7261; alpha = 0.0040
Simulating trial. . .
espilon = 0.7261; alpha = 0.0040
Simulating trial. . .
espilon = 0.7261; alpha = 0.0040
Simulating trial. . .
espilon = 0.7261; alpha = 0.0040
Simulating trial. . .
espilon = 0.7261; alpha = 0.0040
Simulating trial. . .
espilon = 0.7261; alpha = 0.0040
Simulating trial. . .
espilon = 0.7261; alpha = 0.0040
Simulating trial. . .
espilon = 0.7261; alpha = 0.0040
Simulating trial. . .
espilon = 0.7261; alpha = 0.0040
Simulating trial. . .
espilon = 0.7261; alpha = 0.0040
Simulating trial. . .
espilon = 0.7261; alpha = 0.0040
Simulating trial. . .
espilon = 0.7261; alpha = 0.0040
Simulating trial. . .
espilon = 0.7261; alpha = 0.0040
Simulating trial. . .
espilon = 0.7261; alpha = 0.0040
Simulating trial. . .
espilon = 0.7261; alpha = 0.0040
Simulating trial. . .
espilon = 0.7261; alpha = 0.0040
Simulating trial. . .
espilon = 0.7261; alpha = 0.0040
Simulating trial. . .
espilon = 0.7261; alpha = 0.0040
Simulating trial. . .
espilon = 0.7261; alpha = 0.0040
Simulating trial. . .
espilon = 0.7261; alpha = 0.0040
Simulating trial. . .
espilon = 0.7261; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 0.65)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.90)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.77)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.48)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.93)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent drove right instead of forward. (rewarded 0.14)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.94)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded -0.06)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent attempted driving forward through a red light. (rewarded -9.80)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.96)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.53)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.23)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.33)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent drove left instead of forward. (rewarded 1.40)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent followed the waypoint right. (rewarded 1.66)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'right')
Agent properly idled at a red light. (rewarded 1.82)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.29)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.18)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.22)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.83)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent drove left instead of forward. (rewarded 0.69)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.04)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent followed the waypoint right. (rewarded 1.38)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.56)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded -0.88)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 82
\-------------------------
Simulating trial. . .
espilon = 0.7233; alpha = 0.0040
Simulating trial. . .
espilon = 0.7233; alpha = 0.0040
Simulating trial. . .
espilon = 0.7233; alpha = 0.0040
Simulating trial. . .
espilon = 0.7233; alpha = 0.0040
Simulating trial. . .
espilon = 0.7233; alpha = 0.0040
Simulating trial. . .
espilon = 0.7233; alpha = 0.0040
Simulating trial. . .
espilon = 0.7233; alpha = 0.0040
Simulating trial. . .
espilon = 0.7233; alpha = 0.0040
Simulating trial. . .
espilon = 0.7233; alpha = 0.0040
Simulating trial. . .
espilon = 0.7233; alpha = 0.0040
Simulating trial. . .
espilon = 0.7233; alpha = 0.0040
Simulating trial. . .
espilon = 0.7233; alpha = 0.0040
Simulating trial. . .
espilon = 0.7233; alpha = 0.0040
Simulating trial. . .
espilon = 0.7233; alpha = 0.0040
Simulating trial. . .
espilon = 0.7233; alpha = 0.0040
Simulating trial. . .
espilon = 0.7233; alpha = 0.0040
Simulating trial. . .
espilon = 0.7233; alpha = 0.0040
Simulating trial. . .
espilon = 0.7233; alpha = 0.0040
Simulating trial. . .
espilon = 0.7233; alpha = 0.0040
Simulating trial. . .
espilon = 0.7233; alpha = 0.0040
Simulating trial. . .
espilon = 0.7233; alpha = 0.0040
Simulating trial. . .
espilon = 0.7233; alpha = 0.0040
Simulating trial. . .
espilon = 0.7233; alpha = 0.0040
Simulating trial. . .
espilon = 0.7233; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'right', 'forward')
Agent properly idled at a red light. (rewarded 2.52)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.20)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent followed the waypoint right. (rewarded 2.75)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent drove forward instead of right. (rewarded 1.68)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent attempted driving left through a red light. (rewarded -10.94)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'forward', 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.72)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.31)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.01)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.75)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.55)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove forward instead of right. (rewarded -0.24)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 2.44)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.51)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'left')
Agent drove forward instead of right. (rewarded 0.15)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.51)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.88)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded -0.37)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.14)
10% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 83
\-------------------------
Simulating trial. . .
espilon = 0.7204; alpha = 0.0040
Simulating trial. . .
espilon = 0.7204; alpha = 0.0040
Simulating trial. . .
espilon = 0.7204; alpha = 0.0040
Simulating trial. . .
espilon = 0.7204; alpha = 0.0040
Simulating trial. . .
espilon = 0.7204; alpha = 0.0040
Simulating trial. . .
espilon = 0.7204; alpha = 0.0040
Simulating trial. . .
espilon = 0.7204; alpha = 0.0040
Simulating trial. . .
espilon = 0.7204; alpha = 0.0040
Simulating trial. . .
espilon = 0.7204; alpha = 0.0040
Simulating trial. . .
espilon = 0.7204; alpha = 0.0040
Simulating trial. . .
espilon = 0.7204; alpha = 0.0040
Simulating trial. . .
espilon = 0.7204; alpha = 0.0040
Simulating trial. . .
espilon = 0.7204; alpha = 0.0040
Simulating trial. . .
espilon = 0.7204; alpha = 0.0040
Simulating trial. . .
espilon = 0.7204; alpha = 0.0040
Simulating trial. . .
espilon = 0.7204; alpha = 0.0040
Simulating trial. . .
espilon = 0.7204; alpha = 0.0040
Simulating trial. . .
espilon = 0.7204; alpha = 0.0040
Simulating trial. . .
espilon = 0.7204; alpha = 0.0040
Simulating trial. . .
espilon = 0.7204; alpha = 0.0040
Simulating trial. . .
espilon = 0.7204; alpha = 0.0040
Simulating trial. . .
espilon = 0.7204; alpha = 0.0040
Simulating trial. . .
espilon = 0.7204; alpha = 0.0040
Simulating trial. . .
espilon = 0.7204; alpha = 0.0040
Simulating trial. . .
espilon = 0.7204; alpha = 0.0040
Simulating trial. . .
espilon = 0.7204; alpha = 0.0040
Simulating trial. . .
espilon = 0.7204; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.61)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'forward', 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.98)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.64)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'left')
Agent followed the waypoint right. (rewarded 2.21)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.97)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', 'left')
Agent attempted driving forward through a red light. (rewarded -10.56)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 1.30)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.17)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'right')
Agent drove right instead of forward. (rewarded 0.86)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.70)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 1.61)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent attempted driving left through a red light. (rewarded -10.85)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.57)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.67)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent drove forward instead of left. (rewarded 0.83)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 1.08)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.42)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'left')
Agent drove forward instead of right. (rewarded 1.15)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'forward', 'forward')
Agent followed the waypoint right. (rewarded 1.26)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 1.23)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.51)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'forward', None)
Agent followed the waypoint left. (rewarded 1.92)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.21)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent attempted driving left through a red light. (rewarded -10.45)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -10.90)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 84
\-------------------------
Simulating trial. . .
espilon = 0.7175; alpha = 0.0040
Simulating trial. . .
espilon = 0.7175; alpha = 0.0040
Simulating trial. . .
espilon = 0.7175; alpha = 0.0040
Simulating trial. . .
espilon = 0.7175; alpha = 0.0040
Simulating trial. . .
espilon = 0.7175; alpha = 0.0040
Simulating trial. . .
espilon = 0.7175; alpha = 0.0040
Simulating trial. . .
espilon = 0.7175; alpha = 0.0040
Simulating trial. . .
espilon = 0.7175; alpha = 0.0040
Simulating trial. . .
espilon = 0.7175; alpha = 0.0040
Simulating trial. . .
espilon = 0.7175; alpha = 0.0040
Simulating trial. . .
espilon = 0.7175; alpha = 0.0040
Simulating trial. . .
espilon = 0.7175; alpha = 0.0040
Simulating trial. . .
espilon = 0.7175; alpha = 0.0040
Simulating trial. . .
espilon = 0.7175; alpha = 0.0040
Simulating trial. . .
espilon = 0.7175; alpha = 0.0040
Simulating trial. . .
espilon = 0.7175; alpha = 0.0040
Simulating trial. . .
espilon = 0.7175; alpha = 0.0040
Simulating trial. . .
espilon = 0.7175; alpha = 0.0040
Simulating trial. . .
espilon = 0.7175; alpha = 0.0040
Simulating trial. . .
espilon = 0.7175; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.53)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 1.26)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.24)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.80)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.32)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.64)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded 0.06)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent drove right instead of left. (rewarded 1.62)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.20)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.12)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.55)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.75)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent drove right instead of forward. (rewarded 0.34)
63% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.71)
60% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.18)
57% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent drove right instead of left. (rewarded 0.68)
54% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.74)
51% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.49)
49% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', 'right')
Agent drove right instead of left. (rewarded 0.56)
46% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 2.41)
43% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 1.35)
40% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.06)
37% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 1.23)
34% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', 'left')
Agent drove right instead of left. (rewarded 0.05)
31% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.62)
29% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', 'right')
Agent drove right instead of left. (rewarded 1.18)
26% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.09)
23% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent attempted driving forward through a red light. (rewarded -10.86)
20% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.18)
17% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove forward instead of right. (rewarded 0.07)
14% of time remaining to reach destination.
/-------------------
| Step 30 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 0.89)
11% of time remaining to reach destination.
/-------------------
| Step 31 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.96)
9% of time remaining to reach destination.
/-------------------
| Step 32 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent drove left instead of right. (rewarded 0.31)
6% of time remaining to reach destination.
/-------------------
| Step 33 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.73)
3% of time remaining to reach destination.
/-------------------
| Step 34 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent drove right instead of forward. (rewarded -0.08)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 85
\-------------------------
Simulating trial. . .
espilon = 0.7146; alpha = 0.0040
Simulating trial. . .
espilon = 0.7146; alpha = 0.0040
Simulating trial. . .
espilon = 0.7146; alpha = 0.0040
Simulating trial. . .
espilon = 0.7146; alpha = 0.0040
Simulating trial. . .
espilon = 0.7146; alpha = 0.0040
Simulating trial. . .
espilon = 0.7146; alpha = 0.0040
Simulating trial. . .
espilon = 0.7146; alpha = 0.0040
Simulating trial. . .
espilon = 0.7146; alpha = 0.0040
Simulating trial. . .
espilon = 0.7146; alpha = 0.0040
Simulating trial. . .
espilon = 0.7146; alpha = 0.0040
Simulating trial. . .
espilon = 0.7146; alpha = 0.0040
Simulating trial. . .
espilon = 0.7146; alpha = 0.0040
Simulating trial. . .
espilon = 0.7146; alpha = 0.0040
Simulating trial. . .
espilon = 0.7146; alpha = 0.0040
Simulating trial. . .
espilon = 0.7146; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.11)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.23)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'right')
Agent properly idled at a red light. (rewarded 1.16)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.71)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent drove right instead of forward. (rewarded 0.17)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent drove right instead of left. (rewarded 0.31)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent attempted driving forward through a red light. (rewarded -10.83)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.45)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.85)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.22)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.46)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 0.49)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.73)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.59)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.63)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.34)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent drove forward instead of left. (rewarded 0.44)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.70)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'right')
Agent drove right instead of left. (rewarded 0.86)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.35)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 86
\-------------------------
Simulating trial. . .
espilon = 0.7118; alpha = 0.0040
Simulating trial. . .
espilon = 0.7118; alpha = 0.0040
Simulating trial. . .
espilon = 0.7118; alpha = 0.0040
Simulating trial. . .
espilon = 0.7118; alpha = 0.0040
Simulating trial. . .
espilon = 0.7118; alpha = 0.0040
Simulating trial. . .
espilon = 0.7118; alpha = 0.0040
Simulating trial. . .
espilon = 0.7118; alpha = 0.0040
Simulating trial. . .
espilon = 0.7118; alpha = 0.0040
Simulating trial. . .
espilon = 0.7118; alpha = 0.0040
Simulating trial. . .
espilon = 0.7118; alpha = 0.0040
Simulating trial. . .
espilon = 0.7118; alpha = 0.0040
Simulating trial. . .
espilon = 0.7118; alpha = 0.0040
Simulating trial. . .
espilon = 0.7118; alpha = 0.0040
Simulating trial. . .
espilon = 0.7118; alpha = 0.0040
Simulating trial. . .
espilon = 0.7118; alpha = 0.0040
Simulating trial. . .
espilon = 0.7118; alpha = 0.0040
Simulating trial. . .
espilon = 0.7118; alpha = 0.0040
Simulating trial. . .
espilon = 0.7118; alpha = 0.0040
Simulating trial. . .
espilon = 0.7118; alpha = 0.0040
Simulating trial. . .
espilon = 0.7118; alpha = 0.0040
Simulating trial. . .
espilon = 0.7118; alpha = 0.0040
Simulating trial. . .
espilon = 0.7118; alpha = 0.0040
Simulating trial. . .
espilon = 0.7118; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.58)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.81)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.88)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.35)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.10)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.05)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.37)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 1.32)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 2.71)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -9.76)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent drove forward instead of right. (rewarded 1.62)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 0.15)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'left', 'left')
Agent drove left instead of right. (rewarded 1.43)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent drove right instead of forward. (rewarded 1.15)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent drove right instead of left. (rewarded -0.13)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.58)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 0.52)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.01)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.44)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.57)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 87
\-------------------------
Simulating trial. . .
espilon = 0.7089; alpha = 0.0040
Simulating trial. . .
espilon = 0.7089; alpha = 0.0040
Simulating trial. . .
espilon = 0.7089; alpha = 0.0040
Simulating trial. . .
espilon = 0.7089; alpha = 0.0040
Simulating trial. . .
espilon = 0.7089; alpha = 0.0040
Simulating trial. . .
espilon = 0.7089; alpha = 0.0040
Simulating trial. . .
espilon = 0.7089; alpha = 0.0040
Simulating trial. . .
espilon = 0.7089; alpha = 0.0040
Simulating trial. . .
espilon = 0.7089; alpha = 0.0040
Simulating trial. . .
espilon = 0.7089; alpha = 0.0040
Simulating trial. . .
espilon = 0.7089; alpha = 0.0040
Simulating trial. . .
espilon = 0.7089; alpha = 0.0040
Simulating trial. . .
espilon = 0.7089; alpha = 0.0040
Simulating trial. . .
espilon = 0.7089; alpha = 0.0040
Simulating trial. . .
espilon = 0.7089; alpha = 0.0040
Simulating trial. . .
espilon = 0.7089; alpha = 0.0040
Simulating trial. . .
espilon = 0.7089; alpha = 0.0040
Simulating trial. . .
espilon = 0.7089; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 0.90)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.49)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.64)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.73)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.33)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 0.39)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'right', None)
Agent drove left instead of right. (rewarded 0.44)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent drove right instead of forward. (rewarded -0.02)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'left')
Agent drove right instead of left. (rewarded 1.16)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.79)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent drove right instead of forward. (rewarded 1.68)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'left')
Agent drove right instead of left. (rewarded 1.58)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -5.90)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.08)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.49)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.22)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.61)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.92)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.37)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.28)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.32)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.43)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', None)
Agent drove right instead of forward. (rewarded 1.15)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent drove right instead of left. (rewarded -0.59)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.42)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 88
\-------------------------
Simulating trial. . .
espilon = 0.7061; alpha = 0.0040
Simulating trial. . .
espilon = 0.7061; alpha = 0.0040
Simulating trial. . .
espilon = 0.7061; alpha = 0.0040
Simulating trial. . .
espilon = 0.7061; alpha = 0.0040
Simulating trial. . .
espilon = 0.7061; alpha = 0.0040
Simulating trial. . .
espilon = 0.7061; alpha = 0.0040
Simulating trial. . .
espilon = 0.7061; alpha = 0.0040
Simulating trial. . .
espilon = 0.7061; alpha = 0.0040
Simulating trial. . .
espilon = 0.7061; alpha = 0.0040
Simulating trial. . .
espilon = 0.7061; alpha = 0.0040
Simulating trial. . .
espilon = 0.7061; alpha = 0.0040
Simulating trial. . .
espilon = 0.7061; alpha = 0.0040
Simulating trial. . .
espilon = 0.7061; alpha = 0.0040
Simulating trial. . .
espilon = 0.7061; alpha = 0.0040
Simulating trial. . .
espilon = 0.7061; alpha = 0.0040
Simulating trial. . .
espilon = 0.7061; alpha = 0.0040
Simulating trial. . .
espilon = 0.7061; alpha = 0.0040
Simulating trial. . .
espilon = 0.7061; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent drove right instead of forward. (rewarded 1.42)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.30)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.44)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.58)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.25)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.75)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.18)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.02)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'right')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.01)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -10.34)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 0.02)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.96)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded 0.29)
63% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.76)
60% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.73)
57% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent drove right instead of left. (rewarded 0.66)
54% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 0.71)
51% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.34)
49% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 0.84)
46% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 1.24)
43% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.47)
40% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.41)
37% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.58)
34% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent drove forward instead of left. (rewarded 0.82)
31% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent drove right instead of left. (rewarded 0.01)
29% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.40)
26% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -10.19)
23% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'right', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.29)
20% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.33)
17% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent attempted driving left through a red light. (rewarded -9.28)
14% of time remaining to reach destination.
/-------------------
| Step 30 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded -0.59)
11% of time remaining to reach destination.
/-------------------
| Step 31 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.62)
9% of time remaining to reach destination.
/-------------------
| Step 32 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.82)
6% of time remaining to reach destination.
/-------------------
| Step 33 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.89)
3% of time remaining to reach destination.
/-------------------
| Step 34 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent drove right instead of forward. (rewarded 0.88)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 89
\-------------------------
Simulating trial. . .
espilon = 0.7033; alpha = 0.0040
Simulating trial. . .
espilon = 0.7033; alpha = 0.0040
Simulating trial. . .
espilon = 0.7033; alpha = 0.0040
Simulating trial. . .
espilon = 0.7033; alpha = 0.0040
Simulating trial. . .
espilon = 0.7033; alpha = 0.0040
Simulating trial. . .
espilon = 0.7033; alpha = 0.0040
Simulating trial. . .
espilon = 0.7033; alpha = 0.0040
Simulating trial. . .
espilon = 0.7033; alpha = 0.0040
Simulating trial. . .
espilon = 0.7033; alpha = 0.0040
Simulating trial. . .
espilon = 0.7033; alpha = 0.0040
Simulating trial. . .
espilon = 0.7033; alpha = 0.0040
Simulating trial. . .
espilon = 0.7033; alpha = 0.0040
Simulating trial. . .
espilon = 0.7033; alpha = 0.0040
Simulating trial. . .
espilon = 0.7033; alpha = 0.0040
Simulating trial. . .
espilon = 0.7033; alpha = 0.0040
Simulating trial. . .
espilon = 0.7033; alpha = 0.0040
Simulating trial. . .
espilon = 0.7033; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 2.21)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.08)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.68)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.08)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 2.40)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.41)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.05)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 1.63)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'right')
Agent properly idled at a red light. (rewarded 1.37)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 2.82)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'right')
Agent followed the waypoint right. (rewarded 2.39)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.66)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent drove right instead of forward. (rewarded 1.80)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.72)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent drove right instead of forward. (rewarded -0.15)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'right', None)
Agent drove right instead of left. (rewarded 0.46)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.49)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.07)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.22)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.85)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.64)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.42)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent drove right instead of left. (rewarded 0.29)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.60)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 0.67)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.47)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.05)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.13)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded 0.82)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 0.70)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 90
\-------------------------
Simulating trial. . .
espilon = 0.7005; alpha = 0.0040
Simulating trial. . .
espilon = 0.7005; alpha = 0.0040
Simulating trial. . .
espilon = 0.7005; alpha = 0.0040
Simulating trial. . .
espilon = 0.7005; alpha = 0.0040
Simulating trial. . .
espilon = 0.7005; alpha = 0.0040
Simulating trial. . .
espilon = 0.7005; alpha = 0.0040
Simulating trial. . .
espilon = 0.7005; alpha = 0.0040
Simulating trial. . .
espilon = 0.7005; alpha = 0.0040
Simulating trial. . .
espilon = 0.7005; alpha = 0.0040
Simulating trial. . .
espilon = 0.7005; alpha = 0.0040
Simulating trial. . .
espilon = 0.7005; alpha = 0.0040
Simulating trial. . .
espilon = 0.7005; alpha = 0.0040
Simulating trial. . .
espilon = 0.7005; alpha = 0.0040
Simulating trial. . .
espilon = 0.7005; alpha = 0.0040
Simulating trial. . .
espilon = 0.7005; alpha = 0.0040
Simulating trial. . .
espilon = 0.7005; alpha = 0.0040
Simulating trial. . .
espilon = 0.7005; alpha = 0.0040
Simulating trial. . .
espilon = 0.7005; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 0.21)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 2.81)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent drove right instead of left. (rewarded 1.35)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.22)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove right instead of left. (rewarded -0.05)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.88)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.80)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.38)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.20)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.34)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.45)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove left instead of forward. (rewarded 1.51)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.25)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent drove forward instead of left. (rewarded 1.24)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'right', None)
Agent followed the waypoint left. (rewarded 2.22)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 0.33)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.69)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.99)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.17)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded -0.05)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 91
\-------------------------
Simulating trial. . .
espilon = 0.6977; alpha = 0.0040
Simulating trial. . .
espilon = 0.6977; alpha = 0.0040
Simulating trial. . .
espilon = 0.6977; alpha = 0.0040
Simulating trial. . .
espilon = 0.6977; alpha = 0.0040
Simulating trial. . .
espilon = 0.6977; alpha = 0.0040
Simulating trial. . .
espilon = 0.6977; alpha = 0.0040
Simulating trial. . .
espilon = 0.6977; alpha = 0.0040
Simulating trial. . .
espilon = 0.6977; alpha = 0.0040
Simulating trial. . .
espilon = 0.6977; alpha = 0.0040
Simulating trial. . .
espilon = 0.6977; alpha = 0.0040
Simulating trial. . .
espilon = 0.6977; alpha = 0.0040
Simulating trial. . .
espilon = 0.6977; alpha = 0.0040
Simulating trial. . .
espilon = 0.6977; alpha = 0.0040
Simulating trial. . .
espilon = 0.6977; alpha = 0.0040
Simulating trial. . .
espilon = 0.6977; alpha = 0.0040
Simulating trial. . .
espilon = 0.6977; alpha = 0.0040
Simulating trial. . .
espilon = 0.6977; alpha = 0.0040
Simulating trial. . .
espilon = 0.6977; alpha = 0.0040
Simulating trial. . .
espilon = 0.6977; alpha = 0.0040
Simulating trial. . .
espilon = 0.6977; alpha = 0.0040
Simulating trial. . .
espilon = 0.6977; alpha = 0.0040
Simulating trial. . .
espilon = 0.6977; alpha = 0.0040
Simulating trial. . .
espilon = 0.6977; alpha = 0.0040
Simulating trial. . .
espilon = 0.6977; alpha = 0.0040
Simulating trial. . .
espilon = 0.6977; alpha = 0.0040
Simulating trial. . .
espilon = 0.6977; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.52)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent drove right instead of left. (rewarded 0.51)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 2.24)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent followed the waypoint left. (rewarded 2.67)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent drove right instead of forward. (rewarded 0.60)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.22)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'left')
Agent followed the waypoint left. (rewarded 1.68)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.44)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'right', None)
Agent followed the waypoint forward. (rewarded 1.90)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 2.77)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove forward instead of left. (rewarded 0.85)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', None)
Agent drove right instead of left. (rewarded 1.32)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, None)
Agent followed the waypoint right. (rewarded 1.45)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.57)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.53)
25% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 92
\-------------------------
Simulating trial. . .
espilon = 0.6949; alpha = 0.0040
Simulating trial. . .
espilon = 0.6949; alpha = 0.0040
Simulating trial. . .
espilon = 0.6949; alpha = 0.0040
Simulating trial. . .
espilon = 0.6949; alpha = 0.0040
Simulating trial. . .
espilon = 0.6949; alpha = 0.0040
Simulating trial. . .
espilon = 0.6949; alpha = 0.0040
Simulating trial. . .
espilon = 0.6949; alpha = 0.0040
Simulating trial. . .
espilon = 0.6949; alpha = 0.0040
Simulating trial. . .
espilon = 0.6949; alpha = 0.0040
Simulating trial. . .
espilon = 0.6949; alpha = 0.0040
Simulating trial. . .
espilon = 0.6949; alpha = 0.0040
Simulating trial. . .
espilon = 0.6949; alpha = 0.0040
Simulating trial. . .
espilon = 0.6949; alpha = 0.0040
Simulating trial. . .
espilon = 0.6949; alpha = 0.0040
Simulating trial. . .
espilon = 0.6949; alpha = 0.0040
Simulating trial. . .
espilon = 0.6949; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', 'left')
Agent followed the waypoint right. (rewarded 2.86)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.25)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded 0.35)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 1.48)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', None)
Agent followed the waypoint forward. (rewarded 2.76)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.54)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'right')
Agent drove right instead of forward. (rewarded 1.38)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.80)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 2.37)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.46)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent drove forward instead of left. (rewarded 1.69)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.38)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.06)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 0.32)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 0.69)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 0.88)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.10)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.21)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent drove right instead of forward. (rewarded -0.26)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.03)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 93
\-------------------------
Simulating trial. . .
espilon = 0.6921; alpha = 0.0040
Simulating trial. . .
espilon = 0.6921; alpha = 0.0040
Simulating trial. . .
espilon = 0.6921; alpha = 0.0040
Simulating trial. . .
espilon = 0.6921; alpha = 0.0040
Simulating trial. . .
espilon = 0.6921; alpha = 0.0040
Simulating trial. . .
espilon = 0.6921; alpha = 0.0040
Simulating trial. . .
espilon = 0.6921; alpha = 0.0040
Simulating trial. . .
espilon = 0.6921; alpha = 0.0040
Simulating trial. . .
espilon = 0.6921; alpha = 0.0040
Simulating trial. . .
espilon = 0.6921; alpha = 0.0040
Simulating trial. . .
espilon = 0.6921; alpha = 0.0040
Simulating trial. . .
espilon = 0.6921; alpha = 0.0040
Simulating trial. . .
espilon = 0.6921; alpha = 0.0040
Simulating trial. . .
espilon = 0.6921; alpha = 0.0040
Simulating trial. . .
espilon = 0.6921; alpha = 0.0040
Simulating trial. . .
espilon = 0.6921; alpha = 0.0040
Simulating trial. . .
espilon = 0.6921; alpha = 0.0040
Simulating trial. . .
espilon = 0.6921; alpha = 0.0040
Simulating trial. . .
espilon = 0.6921; alpha = 0.0040
Simulating trial. . .
espilon = 0.6921; alpha = 0.0040
Simulating trial. . .
espilon = 0.6921; alpha = 0.0040
Simulating trial. . .
espilon = 0.6921; alpha = 0.0040
Simulating trial. . .
espilon = 0.6921; alpha = 0.0040
Simulating trial. . .
espilon = 0.6921; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove forward instead of right. (rewarded 0.92)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'left', 'forward')
Agent drove forward instead of right. (rewarded 1.16)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent drove forward instead of right. (rewarded 0.95)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.22)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'left')
Agent attempted driving left through a red light. (rewarded -9.48)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'left')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.73)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'left')
Agent drove left instead of right. (rewarded 1.85)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.97)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 1.19)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent attempted driving left through a red light. (rewarded -10.52)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.41)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent drove right instead of forward. (rewarded 0.47)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.43)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent drove right instead of left. (rewarded -0.35)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 0.49)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent drove forward instead of right. (rewarded 0.92)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove left instead of right. (rewarded 0.25)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove left instead of forward. (rewarded 0.76)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 0.85)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 0.81)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 94
\-------------------------
Simulating trial. . .
espilon = 0.6894; alpha = 0.0040
Simulating trial. . .
espilon = 0.6894; alpha = 0.0040
Simulating trial. . .
espilon = 0.6894; alpha = 0.0040
Simulating trial. . .
espilon = 0.6894; alpha = 0.0040
Simulating trial. . .
espilon = 0.6894; alpha = 0.0040
Simulating trial. . .
espilon = 0.6894; alpha = 0.0040
Simulating trial. . .
espilon = 0.6894; alpha = 0.0040
Simulating trial. . .
espilon = 0.6894; alpha = 0.0040
Simulating trial. . .
espilon = 0.6894; alpha = 0.0040
Simulating trial. . .
espilon = 0.6894; alpha = 0.0040
Simulating trial. . .
espilon = 0.6894; alpha = 0.0040
Simulating trial. . .
espilon = 0.6894; alpha = 0.0040
Simulating trial. . .
espilon = 0.6894; alpha = 0.0040
Simulating trial. . .
espilon = 0.6894; alpha = 0.0040
Simulating trial. . .
espilon = 0.6894; alpha = 0.0040
Simulating trial. . .
espilon = 0.6894; alpha = 0.0040
Simulating trial. . .
espilon = 0.6894; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'right')
Agent properly idled at a red light. (rewarded 1.88)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.31)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -10.34)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent drove right instead of forward. (rewarded 0.28)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove forward instead of left. (rewarded 0.81)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 2.11)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.95)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'forward')
Agent drove forward instead of left. (rewarded 1.87)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.56)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 0.52)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 0.99)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent drove left instead of right. (rewarded 0.89)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent followed the waypoint left. (rewarded 0.96)
63% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.22)
60% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.27)
57% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.28)
54% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent drove left instead of forward. (rewarded 0.62)
51% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.47)
49% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded -0.06)
46% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent drove left instead of right. (rewarded 1.33)
43% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.31)
40% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'forward')
Agent drove forward instead of right. (rewarded 0.17)
37% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -10.84)
34% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.59)
31% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 1.75)
29% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', 'left')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.85)
26% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.49)
23% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'left')
Agent drove forward instead of right. (rewarded 0.67)
20% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.82)
17% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'right')
Agent drove forward instead of right. (rewarded 0.88)
14% of time remaining to reach destination.
/-------------------
| Step 30 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.61)
11% of time remaining to reach destination.
/-------------------
| Step 31 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded -0.23)
9% of time remaining to reach destination.
/-------------------
| Step 32 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', 'left')
Agent followed the waypoint right. (rewarded 1.58)
6% of time remaining to reach destination.
/-------------------
| Step 33 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.29)
3% of time remaining to reach destination.
/-------------------
| Step 34 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', None)
Agent drove right instead of forward. (rewarded -0.06)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 95
\-------------------------
Simulating trial. . .
espilon = 0.6866; alpha = 0.0040
Simulating trial. . .
espilon = 0.6866; alpha = 0.0040
Simulating trial. . .
espilon = 0.6866; alpha = 0.0040
Simulating trial. . .
espilon = 0.6866; alpha = 0.0040
Simulating trial. . .
espilon = 0.6866; alpha = 0.0040
Simulating trial. . .
espilon = 0.6866; alpha = 0.0040
Simulating trial. . .
espilon = 0.6866; alpha = 0.0040
Simulating trial. . .
espilon = 0.6866; alpha = 0.0040
Simulating trial. . .
espilon = 0.6866; alpha = 0.0040
Simulating trial. . .
espilon = 0.6866; alpha = 0.0040
Simulating trial. . .
espilon = 0.6866; alpha = 0.0040
Simulating trial. . .
espilon = 0.6866; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.31)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.34)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.91)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded 1.01)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.92)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 0.55)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent drove right instead of left. (rewarded 1.24)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.47)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent drove right instead of forward. (rewarded 1.51)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.24)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -10.65)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', None)
Agent properly idled at a red light. (rewarded 1.94)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.60)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 0.05)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.68)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 0.99)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent followed the waypoint left. (rewarded 0.79)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 1.97)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'left', None)
Agent drove right instead of forward. (rewarded -0.21)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'right')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.02)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent attempted driving left through a red light. (rewarded -9.33)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.83)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 1.27)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent drove forward instead of left. (rewarded 1.08)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.70)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.79)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.48)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'forward', 'right')
Agent drove right instead of left. (rewarded -0.10)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.15)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.25)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 96
\-------------------------
Simulating trial. . .
espilon = 0.6839; alpha = 0.0040
Simulating trial. . .
espilon = 0.6839; alpha = 0.0040
Simulating trial. . .
espilon = 0.6839; alpha = 0.0040
Simulating trial. . .
espilon = 0.6839; alpha = 0.0040
Simulating trial. . .
espilon = 0.6839; alpha = 0.0040
Simulating trial. . .
espilon = 0.6839; alpha = 0.0040
Simulating trial. . .
espilon = 0.6839; alpha = 0.0040
Simulating trial. . .
espilon = 0.6839; alpha = 0.0040
Simulating trial. . .
espilon = 0.6839; alpha = 0.0040
Simulating trial. . .
espilon = 0.6839; alpha = 0.0040
Simulating trial. . .
espilon = 0.6839; alpha = 0.0040
Simulating trial. . .
espilon = 0.6839; alpha = 0.0040
Simulating trial. . .
espilon = 0.6839; alpha = 0.0040
Simulating trial. . .
espilon = 0.6839; alpha = 0.0040
Simulating trial. . .
espilon = 0.6839; alpha = 0.0040
Simulating trial. . .
espilon = 0.6839; alpha = 0.0040
Simulating trial. . .
espilon = 0.6839; alpha = 0.0040
Simulating trial. . .
espilon = 0.6839; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 2.47)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent attempted driving forward through a red light. (rewarded -10.85)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent attempted driving forward through a red light. (rewarded -10.04)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.46)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.66)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.48)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.48)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.50)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.25)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.26)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.35)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.93)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent drove forward instead of left. (rewarded 0.99)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.98)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent attempted driving forward through a red light. (rewarded -10.69)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent attempted driving forward through a red light. (rewarded -9.23)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.45)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 1.51)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 0.81)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.36)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 0.83)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 1.19)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.13)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.14)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 0.36)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 97
\-------------------------
Simulating trial. . .
espilon = 0.6811; alpha = 0.0040
Simulating trial. . .
espilon = 0.6811; alpha = 0.0040
Simulating trial. . .
espilon = 0.6811; alpha = 0.0040
Simulating trial. . .
espilon = 0.6811; alpha = 0.0040
Simulating trial. . .
espilon = 0.6811; alpha = 0.0040
Simulating trial. . .
espilon = 0.6811; alpha = 0.0040
Simulating trial. . .
espilon = 0.6811; alpha = 0.0040
Simulating trial. . .
espilon = 0.6811; alpha = 0.0040
Simulating trial. . .
espilon = 0.6811; alpha = 0.0040
Simulating trial. . .
espilon = 0.6811; alpha = 0.0040
Simulating trial. . .
espilon = 0.6811; alpha = 0.0040
Simulating trial. . .
espilon = 0.6811; alpha = 0.0040
Simulating trial. . .
espilon = 0.6811; alpha = 0.0040
Simulating trial. . .
espilon = 0.6811; alpha = 0.0040
Simulating trial. . .
espilon = 0.6811; alpha = 0.0040
Simulating trial. . .
espilon = 0.6811; alpha = 0.0040
Simulating trial. . .
espilon = 0.6811; alpha = 0.0040
Simulating trial. . .
espilon = 0.6811; alpha = 0.0040
Simulating trial. . .
espilon = 0.6811; alpha = 0.0040
Simulating trial. . .
espilon = 0.6811; alpha = 0.0040
Simulating trial. . .
espilon = 0.6811; alpha = 0.0040
Simulating trial. . .
espilon = 0.6811; alpha = 0.0040
Simulating trial. . .
espilon = 0.6811; alpha = 0.0040
Simulating trial. . .
espilon = 0.6811; alpha = 0.0040
Simulating trial. . .
espilon = 0.6811; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent drove right instead of left. (rewarded 0.94)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.22)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.31)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', None)
Agent attempted driving left through a red light. (rewarded -9.27)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent drove right instead of forward. (rewarded 0.50)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 0.62)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.27)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.47)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.15)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent drove right instead of left. (rewarded 0.71)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'right', None)
Agent properly idled at a red light. (rewarded 2.16)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.68)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.17)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.99)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.22)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'right', None)
Agent drove right instead of forward. (rewarded 1.02)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.24)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.40)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.35)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent drove left instead of forward. (rewarded 0.64)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 98
\-------------------------
Simulating trial. . .
espilon = 0.6784; alpha = 0.0040
Simulating trial. . .
espilon = 0.6784; alpha = 0.0040
Simulating trial. . .
espilon = 0.6784; alpha = 0.0040
Simulating trial. . .
espilon = 0.6784; alpha = 0.0040
Simulating trial. . .
espilon = 0.6784; alpha = 0.0040
Simulating trial. . .
espilon = 0.6784; alpha = 0.0040
Simulating trial. . .
espilon = 0.6784; alpha = 0.0040
Simulating trial. . .
espilon = 0.6784; alpha = 0.0040
Simulating trial. . .
espilon = 0.6784; alpha = 0.0040
Simulating trial. . .
espilon = 0.6784; alpha = 0.0040
Simulating trial. . .
espilon = 0.6784; alpha = 0.0040
Simulating trial. . .
espilon = 0.6784; alpha = 0.0040
Simulating trial. . .
espilon = 0.6784; alpha = 0.0040
Simulating trial. . .
espilon = 0.6784; alpha = 0.0040
Simulating trial. . .
espilon = 0.6784; alpha = 0.0040
Simulating trial. . .
espilon = 0.6784; alpha = 0.0040
Simulating trial. . .
espilon = 0.6784; alpha = 0.0040
Simulating trial. . .
espilon = 0.6784; alpha = 0.0040
Simulating trial. . .
espilon = 0.6784; alpha = 0.0040
Simulating trial. . .
espilon = 0.6784; alpha = 0.0040
Simulating trial. . .
espilon = 0.6784; alpha = 0.0040
Simulating trial. . .
espilon = 0.6784; alpha = 0.0040
Simulating trial. . .
espilon = 0.6784; alpha = 0.0040
Simulating trial. . .
espilon = 0.6784; alpha = 0.0040
Simulating trial. . .
espilon = 0.6784; alpha = 0.0040
Simulating trial. . .
espilon = 0.6784; alpha = 0.0040
Simulating trial. . .
espilon = 0.6784; alpha = 0.0040
Simulating trial. . .
espilon = 0.6784; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.98)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'forward')
Agent attempted driving left through a red light. (rewarded -10.34)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.39)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.60)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent drove forward instead of right. (rewarded 0.86)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.85)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.54)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.72)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.10)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.15)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.30)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.53)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.09)
48% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 99
\-------------------------
Simulating trial. . .
espilon = 0.6757; alpha = 0.0040
Simulating trial. . .
espilon = 0.6757; alpha = 0.0040
Simulating trial. . .
espilon = 0.6757; alpha = 0.0040
Simulating trial. . .
espilon = 0.6757; alpha = 0.0040
Simulating trial. . .
espilon = 0.6757; alpha = 0.0040
Simulating trial. . .
espilon = 0.6757; alpha = 0.0040
Simulating trial. . .
espilon = 0.6757; alpha = 0.0040
Simulating trial. . .
espilon = 0.6757; alpha = 0.0040
Simulating trial. . .
espilon = 0.6757; alpha = 0.0040
Simulating trial. . .
espilon = 0.6757; alpha = 0.0040
Simulating trial. . .
espilon = 0.6757; alpha = 0.0040
Simulating trial. . .
espilon = 0.6757; alpha = 0.0040
Simulating trial. . .
espilon = 0.6757; alpha = 0.0040
Simulating trial. . .
espilon = 0.6757; alpha = 0.0040
Simulating trial. . .
espilon = 0.6757; alpha = 0.0040
Simulating trial. . .
espilon = 0.6757; alpha = 0.0040
Simulating trial. . .
espilon = 0.6757; alpha = 0.0040
Simulating trial. . .
espilon = 0.6757; alpha = 0.0040
Simulating trial. . .
espilon = 0.6757; alpha = 0.0040
Simulating trial. . .
espilon = 0.6757; alpha = 0.0040
Simulating trial. . .
espilon = 0.6757; alpha = 0.0040
Simulating trial. . .
espilon = 0.6757; alpha = 0.0040
Simulating trial. . .
espilon = 0.6757; alpha = 0.0040
Simulating trial. . .
espilon = 0.6757; alpha = 0.0040
Simulating trial. . .
espilon = 0.6757; alpha = 0.0040
Simulating trial. . .
espilon = 0.6757; alpha = 0.0040
Simulating trial. . .
espilon = 0.6757; alpha = 0.0040
Simulating trial. . .
espilon = 0.6757; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.51)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.97)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent drove forward instead of left. (rewarded 0.43)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.94)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded 0.41)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent drove left instead of forward. (rewarded 1.11)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove forward instead of right. (rewarded 1.54)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove forward instead of right. (rewarded 1.70)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.53)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.23)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.22)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.30)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent drove left instead of right. (rewarded 0.34)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.74)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.82)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent drove right instead of forward. (rewarded 0.62)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent drove right instead of left. (rewarded -0.22)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 0.71)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded 1.19)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.13)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 0.54)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.47)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.73)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.48)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.56)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 100
\-------------------------
Simulating trial. . .
espilon = 0.6730; alpha = 0.0040
Simulating trial. . .
espilon = 0.6730; alpha = 0.0040
Simulating trial. . .
espilon = 0.6730; alpha = 0.0040
Simulating trial. . .
espilon = 0.6730; alpha = 0.0040
Simulating trial. . .
espilon = 0.6730; alpha = 0.0040
Simulating trial. . .
espilon = 0.6730; alpha = 0.0040
Simulating trial. . .
espilon = 0.6730; alpha = 0.0040
Simulating trial. . .
espilon = 0.6730; alpha = 0.0040
Simulating trial. . .
espilon = 0.6730; alpha = 0.0040
Simulating trial. . .
espilon = 0.6730; alpha = 0.0040
Simulating trial. . .
espilon = 0.6730; alpha = 0.0040
Simulating trial. . .
espilon = 0.6730; alpha = 0.0040
Simulating trial. . .
espilon = 0.6730; alpha = 0.0040
Simulating trial. . .
espilon = 0.6730; alpha = 0.0040
Simulating trial. . .
espilon = 0.6730; alpha = 0.0040
Simulating trial. . .
espilon = 0.6730; alpha = 0.0040
Simulating trial. . .
espilon = 0.6730; alpha = 0.0040
Simulating trial. . .
espilon = 0.6730; alpha = 0.0040
Simulating trial. . .
espilon = 0.6730; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent drove left instead of right. (rewarded 0.82)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.50)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.64)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent followed the waypoint right. (rewarded 2.24)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.12)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent drove right instead of forward. (rewarded 1.15)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.17)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.62)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.57)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent followed the waypoint left. (rewarded 1.69)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.10)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.26)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.29)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.18)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent drove left instead of forward. (rewarded 1.43)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'left')
Agent followed the waypoint right. (rewarded 1.81)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.91)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.40)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 1.00)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 0.73)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 101
\-------------------------
Simulating trial. . .
espilon = 0.6703; alpha = 0.0040
Simulating trial. . .
espilon = 0.6703; alpha = 0.0040
Simulating trial. . .
espilon = 0.6703; alpha = 0.0040
Simulating trial. . .
espilon = 0.6703; alpha = 0.0040
Simulating trial. . .
espilon = 0.6703; alpha = 0.0040
Simulating trial. . .
espilon = 0.6703; alpha = 0.0040
Simulating trial. . .
espilon = 0.6703; alpha = 0.0040
Simulating trial. . .
espilon = 0.6703; alpha = 0.0040
Simulating trial. . .
espilon = 0.6703; alpha = 0.0040
Simulating trial. . .
espilon = 0.6703; alpha = 0.0040
Simulating trial. . .
espilon = 0.6703; alpha = 0.0040
Simulating trial. . .
espilon = 0.6703; alpha = 0.0040
Simulating trial. . .
espilon = 0.6703; alpha = 0.0040
Simulating trial. . .
espilon = 0.6703; alpha = 0.0040
Simulating trial. . .
espilon = 0.6703; alpha = 0.0040
Simulating trial. . .
espilon = 0.6703; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.70)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.63)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.31)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent drove right instead of left. (rewarded 1.81)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.66)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 2.33)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent followed the waypoint right. (rewarded 1.46)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.78)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.32)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.94)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.03)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent drove right instead of forward. (rewarded 0.53)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.01)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -10.25)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.81)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.04)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.41)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.62)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove forward instead of left. (rewarded 0.85)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.25)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent drove right instead of left. (rewarded 0.84)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove forward instead of right. (rewarded 1.14)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'right')
Agent drove forward instead of right. (rewarded 0.01)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.51)
4% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 102
\-------------------------
Simulating trial. . .
espilon = 0.6676; alpha = 0.0040
Simulating trial. . .
espilon = 0.6676; alpha = 0.0040
Simulating trial. . .
espilon = 0.6676; alpha = 0.0040
Simulating trial. . .
espilon = 0.6676; alpha = 0.0040
Simulating trial. . .
espilon = 0.6676; alpha = 0.0040
Simulating trial. . .
espilon = 0.6676; alpha = 0.0040
Simulating trial. . .
espilon = 0.6676; alpha = 0.0040
Simulating trial. . .
espilon = 0.6676; alpha = 0.0040
Simulating trial. . .
espilon = 0.6676; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'right')
Agent followed the waypoint forward. (rewarded 1.88)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.77)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 0.92)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 1.02)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 1.51)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', 'right')
Agent drove right instead of left. (rewarded 1.59)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.04)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.18)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.79)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.77)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.85)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.42)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.45)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.05)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent drove right instead of forward. (rewarded 0.80)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', 'left')
Agent drove forward instead of left. (rewarded 0.37)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.21)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'right')
Agent drove forward instead of left. (rewarded -0.04)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.37)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.59)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 0.46)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.19)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded -0.03)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.35)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.84)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 103
\-------------------------
Simulating trial. . .
espilon = 0.6650; alpha = 0.0040
Simulating trial. . .
espilon = 0.6650; alpha = 0.0040
Simulating trial. . .
espilon = 0.6650; alpha = 0.0040
Simulating trial. . .
espilon = 0.6650; alpha = 0.0040
Simulating trial. . .
espilon = 0.6650; alpha = 0.0040
Simulating trial. . .
espilon = 0.6650; alpha = 0.0040
Simulating trial. . .
espilon = 0.6650; alpha = 0.0040
Simulating trial. . .
espilon = 0.6650; alpha = 0.0040
Simulating trial. . .
espilon = 0.6650; alpha = 0.0040
Simulating trial. . .
espilon = 0.6650; alpha = 0.0040
Simulating trial. . .
espilon = 0.6650; alpha = 0.0040
Simulating trial. . .
espilon = 0.6650; alpha = 0.0040
Simulating trial. . .
espilon = 0.6650; alpha = 0.0040
Simulating trial. . .
espilon = 0.6650; alpha = 0.0040
Simulating trial. . .
espilon = 0.6650; alpha = 0.0040
Simulating trial. . .
espilon = 0.6650; alpha = 0.0040
Simulating trial. . .
espilon = 0.6650; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.84)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.88)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', 'left')
Agent followed the waypoint forward. (rewarded 2.29)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.04)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -4.36)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.93)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent drove right instead of forward. (rewarded 1.13)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.90)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.45)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent drove right instead of forward. (rewarded 1.17)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent drove right instead of left. (rewarded 1.59)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.49)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'left')
Agent drove right instead of left. (rewarded 1.37)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.07)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.44)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 0.81)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.96)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.69)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.96)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent drove right instead of forward. (rewarded -0.49)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent attempted driving left through a red light. (rewarded -10.72)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 0.93)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -10.33)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.54)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.28)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 104
\-------------------------
Simulating trial. . .
espilon = 0.6623; alpha = 0.0040
Simulating trial. . .
espilon = 0.6623; alpha = 0.0040
Simulating trial. . .
espilon = 0.6623; alpha = 0.0040
Simulating trial. . .
espilon = 0.6623; alpha = 0.0040
Simulating trial. . .
espilon = 0.6623; alpha = 0.0040
Simulating trial. . .
espilon = 0.6623; alpha = 0.0040
Simulating trial. . .
espilon = 0.6623; alpha = 0.0040
Simulating trial. . .
espilon = 0.6623; alpha = 0.0040
Simulating trial. . .
espilon = 0.6623; alpha = 0.0040
Simulating trial. . .
espilon = 0.6623; alpha = 0.0040
Simulating trial. . .
espilon = 0.6623; alpha = 0.0040
Simulating trial. . .
espilon = 0.6623; alpha = 0.0040
Simulating trial. . .
espilon = 0.6623; alpha = 0.0040
Simulating trial. . .
espilon = 0.6623; alpha = 0.0040
Simulating trial. . .
espilon = 0.6623; alpha = 0.0040
Simulating trial. . .
espilon = 0.6623; alpha = 0.0040
Simulating trial. . .
espilon = 0.6623; alpha = 0.0040
Simulating trial. . .
espilon = 0.6623; alpha = 0.0040
Simulating trial. . .
espilon = 0.6623; alpha = 0.0040
Simulating trial. . .
espilon = 0.6623; alpha = 0.0040
Simulating trial. . .
espilon = 0.6623; alpha = 0.0040
Simulating trial. . .
espilon = 0.6623; alpha = 0.0040
Simulating trial. . .
espilon = 0.6623; alpha = 0.0040
Simulating trial. . .
espilon = 0.6623; alpha = 0.0040
Simulating trial. . .
espilon = 0.6623; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.67)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove left instead of right. (rewarded 0.98)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent drove right instead of forward. (rewarded 0.20)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent drove forward instead of left. (rewarded 1.24)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent drove right instead of left. (rewarded -0.00)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.41)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.07)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent drove right instead of forward. (rewarded 0.91)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.48)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.89)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent drove right instead of left. (rewarded 1.32)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.02)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.07)
63% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.51)
60% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent drove right instead of forward. (rewarded 0.17)
57% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent drove right instead of left. (rewarded 1.34)
54% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.86)
51% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', 'forward')
Agent properly idled at a red light. (rewarded 0.86)
49% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 0.79)
46% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.46)
43% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 1.20)
40% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.52)
37% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.22)
34% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.95)
31% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 0.91)
29% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.47)
26% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 0.62)
23% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.06)
20% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'right', None)
Agent followed the waypoint forward. (rewarded 0.42)
17% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.50)
14% of time remaining to reach destination.
/-------------------
| Step 30 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.17)
11% of time remaining to reach destination.
/-------------------
| Step 31 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 0.94)
9% of time remaining to reach destination.
/-------------------
| Step 32 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.39)
6% of time remaining to reach destination.
/-------------------
| Step 33 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove forward instead of right. (rewarded -0.48)
3% of time remaining to reach destination.
/-------------------
| Step 34 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove forward instead of right. (rewarded -0.93)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 105
\-------------------------
Simulating trial. . .
espilon = 0.6597; alpha = 0.0040
Simulating trial. . .
espilon = 0.6597; alpha = 0.0040
Simulating trial. . .
espilon = 0.6597; alpha = 0.0040
Simulating trial. . .
espilon = 0.6597; alpha = 0.0040
Simulating trial. . .
espilon = 0.6597; alpha = 0.0040
Simulating trial. . .
espilon = 0.6597; alpha = 0.0040
Simulating trial. . .
espilon = 0.6597; alpha = 0.0040
Simulating trial. . .
espilon = 0.6597; alpha = 0.0040
Simulating trial. . .
espilon = 0.6597; alpha = 0.0040
Simulating trial. . .
espilon = 0.6597; alpha = 0.0040
Simulating trial. . .
espilon = 0.6597; alpha = 0.0040
Simulating trial. . .
espilon = 0.6597; alpha = 0.0040
Simulating trial. . .
espilon = 0.6597; alpha = 0.0040
Simulating trial. . .
espilon = 0.6597; alpha = 0.0040
Simulating trial. . .
espilon = 0.6597; alpha = 0.0040
Simulating trial. . .
espilon = 0.6597; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', 'forward')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.32)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.98)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent attempted driving forward through a red light. (rewarded -10.29)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.05)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.17)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.11)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'left', 'forward')
Agent drove right instead of left. (rewarded 1.37)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.28)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.88)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.83)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 0.87)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.23)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.65)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.34)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.96)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.45)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.66)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent drove right instead of forward. (rewarded 1.23)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove right instead of left. (rewarded -0.17)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded -0.51)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 106
\-------------------------
Simulating trial. . .
espilon = 0.6570; alpha = 0.0040
Simulating trial. . .
espilon = 0.6570; alpha = 0.0040
Simulating trial. . .
espilon = 0.6570; alpha = 0.0040
Simulating trial. . .
espilon = 0.6570; alpha = 0.0040
Simulating trial. . .
espilon = 0.6570; alpha = 0.0040
Simulating trial. . .
espilon = 0.6570; alpha = 0.0040
Simulating trial. . .
espilon = 0.6570; alpha = 0.0040
Simulating trial. . .
espilon = 0.6570; alpha = 0.0040
Simulating trial. . .
espilon = 0.6570; alpha = 0.0040
Simulating trial. . .
espilon = 0.6570; alpha = 0.0040
Simulating trial. . .
espilon = 0.6570; alpha = 0.0040
Simulating trial. . .
espilon = 0.6570; alpha = 0.0040
Simulating trial. . .
espilon = 0.6570; alpha = 0.0040
Simulating trial. . .
espilon = 0.6570; alpha = 0.0040
Simulating trial. . .
espilon = 0.6570; alpha = 0.0040
Simulating trial. . .
espilon = 0.6570; alpha = 0.0040
Simulating trial. . .
espilon = 0.6570; alpha = 0.0040
Simulating trial. . .
espilon = 0.6570; alpha = 0.0040
Simulating trial. . .
espilon = 0.6570; alpha = 0.0040
Simulating trial. . .
espilon = 0.6570; alpha = 0.0040
Simulating trial. . .
espilon = 0.6570; alpha = 0.0040
Simulating trial. . .
espilon = 0.6570; alpha = 0.0040
Simulating trial. . .
espilon = 0.6570; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 0.33)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.16)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.06)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.83)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.68)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent drove right instead of forward. (rewarded 1.62)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.25)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'left', None)
Agent drove forward instead of left. (rewarded 1.53)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 2.48)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.93)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 1.79)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.11)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.86)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded -0.24)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'right', None)
Agent drove left instead of right. (rewarded 1.43)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.07)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 0.70)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.30)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', 'left')
Agent properly idled at a red light. (rewarded 2.06)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.91)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.31)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', 'left')
Agent drove forward instead of right. (rewarded 0.37)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 0.70)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.35)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.59)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 107
\-------------------------
Simulating trial. . .
espilon = 0.6544; alpha = 0.0040
Simulating trial. . .
espilon = 0.6544; alpha = 0.0040
Simulating trial. . .
espilon = 0.6544; alpha = 0.0040
Simulating trial. . .
espilon = 0.6544; alpha = 0.0040
Simulating trial. . .
espilon = 0.6544; alpha = 0.0040
Simulating trial. . .
espilon = 0.6544; alpha = 0.0040
Simulating trial. . .
espilon = 0.6544; alpha = 0.0040
Simulating trial. . .
espilon = 0.6544; alpha = 0.0040
Simulating trial. . .
espilon = 0.6544; alpha = 0.0040
Simulating trial. . .
espilon = 0.6544; alpha = 0.0040
Simulating trial. . .
espilon = 0.6544; alpha = 0.0040
Simulating trial. . .
espilon = 0.6544; alpha = 0.0040
Simulating trial. . .
espilon = 0.6544; alpha = 0.0040
Simulating trial. . .
espilon = 0.6544; alpha = 0.0040
Simulating trial. . .
espilon = 0.6544; alpha = 0.0040
Simulating trial. . .
espilon = 0.6544; alpha = 0.0040
Simulating trial. . .
espilon = 0.6544; alpha = 0.0040
Simulating trial. . .
espilon = 0.6544; alpha = 0.0040
Simulating trial. . .
espilon = 0.6544; alpha = 0.0040
Simulating trial. . .
espilon = 0.6544; alpha = 0.0040
Simulating trial. . .
espilon = 0.6544; alpha = 0.0040
Simulating trial. . .
espilon = 0.6544; alpha = 0.0040
Simulating trial. . .
espilon = 0.6544; alpha = 0.0040
Simulating trial. . .
espilon = 0.6544; alpha = 0.0040
Simulating trial. . .
espilon = 0.6544; alpha = 0.0040
Simulating trial. . .
espilon = 0.6544; alpha = 0.0040
Simulating trial. . .
espilon = 0.6544; alpha = 0.0040
Simulating trial. . .
espilon = 0.6544; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', 'forward')
Agent attempted driving forward through a red light. (rewarded -10.08)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.64)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.38)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.79)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.31)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.19)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.34)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.21)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 0.93)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent drove left instead of forward. (rewarded 1.38)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent drove left instead of right. (rewarded 0.39)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.13)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.17)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -9.72)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.20)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 1.39)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 0.60)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove forward instead of left. (rewarded 0.21)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded -0.23)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.66)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 108
\-------------------------
Simulating trial. . .
espilon = 0.6518; alpha = 0.0040
Simulating trial. . .
espilon = 0.6518; alpha = 0.0040
Simulating trial. . .
espilon = 0.6518; alpha = 0.0040
Simulating trial. . .
espilon = 0.6518; alpha = 0.0040
Simulating trial. . .
espilon = 0.6518; alpha = 0.0040
Simulating trial. . .
espilon = 0.6518; alpha = 0.0040
Simulating trial. . .
espilon = 0.6518; alpha = 0.0040
Simulating trial. . .
espilon = 0.6518; alpha = 0.0040
Simulating trial. . .
espilon = 0.6518; alpha = 0.0040
Simulating trial. . .
espilon = 0.6518; alpha = 0.0040
Simulating trial. . .
espilon = 0.6518; alpha = 0.0040
Simulating trial. . .
espilon = 0.6518; alpha = 0.0040
Simulating trial. . .
espilon = 0.6518; alpha = 0.0040
Simulating trial. . .
espilon = 0.6518; alpha = 0.0040
Simulating trial. . .
espilon = 0.6518; alpha = 0.0040
Simulating trial. . .
espilon = 0.6518; alpha = 0.0040
Simulating trial. . .
espilon = 0.6518; alpha = 0.0040
Simulating trial. . .
espilon = 0.6518; alpha = 0.0040
Simulating trial. . .
espilon = 0.6518; alpha = 0.0040
Simulating trial. . .
espilon = 0.6518; alpha = 0.0040
Simulating trial. . .
espilon = 0.6518; alpha = 0.0040
Simulating trial. . .
espilon = 0.6518; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'right', 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.80)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'left')
Agent followed the waypoint right. (rewarded 1.52)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 1.49)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.11)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.88)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.40)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.13)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.46)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 0.98)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.69)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded 1.00)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.00)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.23)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', None)
Agent attempted driving forward through a red light. (rewarded -10.89)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 1.93)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.98)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.98)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.50)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent drove left instead of right. (rewarded 0.84)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'right')
Agent followed the waypoint forward. (rewarded 1.51)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 109
\-------------------------
Simulating trial. . .
espilon = 0.6492; alpha = 0.0040
Simulating trial. . .
espilon = 0.6492; alpha = 0.0040
Simulating trial. . .
espilon = 0.6492; alpha = 0.0040
Simulating trial. . .
espilon = 0.6492; alpha = 0.0040
Simulating trial. . .
espilon = 0.6492; alpha = 0.0040
Simulating trial. . .
espilon = 0.6492; alpha = 0.0040
Simulating trial. . .
espilon = 0.6492; alpha = 0.0040
Simulating trial. . .
espilon = 0.6492; alpha = 0.0040
Simulating trial. . .
espilon = 0.6492; alpha = 0.0040
Simulating trial. . .
espilon = 0.6492; alpha = 0.0040
Simulating trial. . .
espilon = 0.6492; alpha = 0.0040
Simulating trial. . .
espilon = 0.6492; alpha = 0.0040
Simulating trial. . .
espilon = 0.6492; alpha = 0.0040
Simulating trial. . .
espilon = 0.6492; alpha = 0.0040
Simulating trial. . .
espilon = 0.6492; alpha = 0.0040
Simulating trial. . .
espilon = 0.6492; alpha = 0.0040
Simulating trial. . .
espilon = 0.6492; alpha = 0.0040
Simulating trial. . .
espilon = 0.6492; alpha = 0.0040
Simulating trial. . .
espilon = 0.6492; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.42)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.12)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.43)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 1.55)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', 'left')
Agent attempted driving forward through a red light. (rewarded -10.34)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'left')
Agent drove forward instead of right. (rewarded 0.61)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.45)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.17)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.47)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'right', None)
Agent drove left instead of forward. (rewarded 1.03)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.18)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.28)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.99)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 0.80)
30% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 110
\-------------------------
Simulating trial. . .
espilon = 0.6466; alpha = 0.0040
Simulating trial. . .
espilon = 0.6466; alpha = 0.0040
Simulating trial. . .
espilon = 0.6466; alpha = 0.0040
Simulating trial. . .
espilon = 0.6466; alpha = 0.0040
Simulating trial. . .
espilon = 0.6466; alpha = 0.0040
Simulating trial. . .
espilon = 0.6466; alpha = 0.0040
Simulating trial. . .
espilon = 0.6466; alpha = 0.0040
Simulating trial. . .
espilon = 0.6466; alpha = 0.0040
Simulating trial. . .
espilon = 0.6466; alpha = 0.0040
Simulating trial. . .
espilon = 0.6466; alpha = 0.0040
Simulating trial. . .
espilon = 0.6466; alpha = 0.0040
Simulating trial. . .
espilon = 0.6466; alpha = 0.0040
Simulating trial. . .
espilon = 0.6466; alpha = 0.0040
Simulating trial. . .
espilon = 0.6466; alpha = 0.0040
Simulating trial. . .
espilon = 0.6466; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.27)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 2.51)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', 'left')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.33)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.09)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'left')
Agent drove left instead of right. (rewarded 1.83)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.38)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', 'left')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.28)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.93)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.99)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.29)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent drove left instead of right. (rewarded 1.68)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent drove left instead of forward. (rewarded 0.49)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.47)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.48)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 0.16)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove left instead of right. (rewarded 1.47)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.85)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.49)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.77)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.76)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 1.19)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove left instead of forward. (rewarded 0.22)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 1.16)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.37)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent drove left instead of forward. (rewarded 0.33)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 111
\-------------------------
Simulating trial. . .
espilon = 0.6440; alpha = 0.0040
Simulating trial. . .
espilon = 0.6440; alpha = 0.0040
Simulating trial. . .
espilon = 0.6440; alpha = 0.0040
Simulating trial. . .
espilon = 0.6440; alpha = 0.0040
Simulating trial. . .
espilon = 0.6440; alpha = 0.0040
Simulating trial. . .
espilon = 0.6440; alpha = 0.0040
Simulating trial. . .
espilon = 0.6440; alpha = 0.0040
Simulating trial. . .
espilon = 0.6440; alpha = 0.0040
Simulating trial. . .
espilon = 0.6440; alpha = 0.0040
Simulating trial. . .
espilon = 0.6440; alpha = 0.0040
Simulating trial. . .
espilon = 0.6440; alpha = 0.0040
Simulating trial. . .
espilon = 0.6440; alpha = 0.0040
Simulating trial. . .
espilon = 0.6440; alpha = 0.0040
Simulating trial. . .
espilon = 0.6440; alpha = 0.0040
Simulating trial. . .
espilon = 0.6440; alpha = 0.0040
Simulating trial. . .
espilon = 0.6440; alpha = 0.0040
Simulating trial. . .
espilon = 0.6440; alpha = 0.0040
Simulating trial. . .
espilon = 0.6440; alpha = 0.0040
Simulating trial. . .
espilon = 0.6440; alpha = 0.0040
Simulating trial. . .
espilon = 0.6440; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.36)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'right', None)
Agent followed the waypoint forward. (rewarded 1.55)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent attempted driving forward through a red light. (rewarded -9.66)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 1.47)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent attempted driving left through a red light. (rewarded -9.10)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.13)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', 'left')
Agent drove right instead of forward. (rewarded 1.59)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.82)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.46)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.47)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.45)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent followed the waypoint left. (rewarded 0.88)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 1.57)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove right instead of left. (rewarded -0.21)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.00)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', 'right')
Agent drove forward instead of right. (rewarded 0.38)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.16)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -10.46)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.68)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'left', None)
Agent drove forward instead of right. (rewarded -0.82)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 112
\-------------------------
Simulating trial. . .
espilon = 0.6415; alpha = 0.0040
Simulating trial. . .
espilon = 0.6415; alpha = 0.0040
Simulating trial. . .
espilon = 0.6415; alpha = 0.0040
Simulating trial. . .
espilon = 0.6415; alpha = 0.0040
Simulating trial. . .
espilon = 0.6415; alpha = 0.0040
Simulating trial. . .
espilon = 0.6415; alpha = 0.0040
Simulating trial. . .
espilon = 0.6415; alpha = 0.0040
Simulating trial. . .
espilon = 0.6415; alpha = 0.0040
Simulating trial. . .
espilon = 0.6415; alpha = 0.0040
Simulating trial. . .
espilon = 0.6415; alpha = 0.0040
Simulating trial. . .
espilon = 0.6415; alpha = 0.0040
Simulating trial. . .
espilon = 0.6415; alpha = 0.0040
Simulating trial. . .
espilon = 0.6415; alpha = 0.0040
Simulating trial. . .
espilon = 0.6415; alpha = 0.0040
Simulating trial. . .
espilon = 0.6415; alpha = 0.0040
Simulating trial. . .
espilon = 0.6415; alpha = 0.0040
Simulating trial. . .
espilon = 0.6415; alpha = 0.0040
Simulating trial. . .
espilon = 0.6415; alpha = 0.0040
Simulating trial. . .
espilon = 0.6415; alpha = 0.0040
Simulating trial. . .
espilon = 0.6415; alpha = 0.0040
Simulating trial. . .
espilon = 0.6415; alpha = 0.0040
Simulating trial. . .
espilon = 0.6415; alpha = 0.0040
Simulating trial. . .
espilon = 0.6415; alpha = 0.0040
Simulating trial. . .
espilon = 0.6415; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 2.31)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.79)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'right')
Agent drove right instead of forward. (rewarded 1.27)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.67)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent attempted driving forward through a red light. (rewarded -9.70)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.42)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent drove right instead of left. (rewarded 1.79)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.28)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'forward', None)
Agent followed the waypoint right. (rewarded 2.73)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.54)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.25)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', 'left')
Agent attempted driving forward through a red light. (rewarded -9.71)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent drove right instead of left. (rewarded 0.72)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.24)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.75)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'forward')
Agent followed the waypoint right. (rewarded 2.30)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.46)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -5.95)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.30)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.19)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.25)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.09)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent drove forward instead of left. (rewarded 1.26)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.84)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.47)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.34)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.51)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.88)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'forward', 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.24)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.21)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 113
\-------------------------
Simulating trial. . .
espilon = 0.6389; alpha = 0.0040
Simulating trial. . .
espilon = 0.6389; alpha = 0.0040
Simulating trial. . .
espilon = 0.6389; alpha = 0.0040
Simulating trial. . .
espilon = 0.6389; alpha = 0.0040
Simulating trial. . .
espilon = 0.6389; alpha = 0.0040
Simulating trial. . .
espilon = 0.6389; alpha = 0.0040
Simulating trial. . .
espilon = 0.6389; alpha = 0.0040
Simulating trial. . .
espilon = 0.6389; alpha = 0.0040
Simulating trial. . .
espilon = 0.6389; alpha = 0.0040
Simulating trial. . .
espilon = 0.6389; alpha = 0.0040
Simulating trial. . .
espilon = 0.6389; alpha = 0.0040
Simulating trial. . .
espilon = 0.6389; alpha = 0.0040
Simulating trial. . .
espilon = 0.6389; alpha = 0.0040
Simulating trial. . .
espilon = 0.6389; alpha = 0.0040
Simulating trial. . .
espilon = 0.6389; alpha = 0.0040
Simulating trial. . .
espilon = 0.6389; alpha = 0.0040
Simulating trial. . .
espilon = 0.6389; alpha = 0.0040
Simulating trial. . .
espilon = 0.6389; alpha = 0.0040
Simulating trial. . .
espilon = 0.6389; alpha = 0.0040
Simulating trial. . .
espilon = 0.6389; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent drove left instead of forward. (rewarded 1.20)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.77)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.13)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.73)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -9.99)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 1.34)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 2.81)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -10.91)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.87)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.72)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.03)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.60)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.65)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent attempted driving left through a red light. (rewarded -10.55)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 1.09)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.27)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.38)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 1.44)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.05)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'forward')
Agent drove right instead of left. (rewarded 0.53)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.01)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.27)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.67)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'forward', 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.78)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.29)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.39)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.98)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent drove left instead of forward. (rewarded 0.00)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.70)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 0.63)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 114
\-------------------------
Simulating trial. . .
espilon = 0.6364; alpha = 0.0040
Simulating trial. . .
espilon = 0.6364; alpha = 0.0040
Simulating trial. . .
espilon = 0.6364; alpha = 0.0040
Simulating trial. . .
espilon = 0.6364; alpha = 0.0040
Simulating trial. . .
espilon = 0.6364; alpha = 0.0040
Simulating trial. . .
espilon = 0.6364; alpha = 0.0040
Simulating trial. . .
espilon = 0.6364; alpha = 0.0040
Simulating trial. . .
espilon = 0.6364; alpha = 0.0040
Simulating trial. . .
espilon = 0.6364; alpha = 0.0040
Simulating trial. . .
espilon = 0.6364; alpha = 0.0040
Simulating trial. . .
espilon = 0.6364; alpha = 0.0040
Simulating trial. . .
espilon = 0.6364; alpha = 0.0040
Simulating trial. . .
espilon = 0.6364; alpha = 0.0040
Simulating trial. . .
espilon = 0.6364; alpha = 0.0040
Simulating trial. . .
espilon = 0.6364; alpha = 0.0040
Simulating trial. . .
espilon = 0.6364; alpha = 0.0040
Simulating trial. . .
espilon = 0.6364; alpha = 0.0040
Simulating trial. . .
espilon = 0.6364; alpha = 0.0040
Simulating trial. . .
espilon = 0.6364; alpha = 0.0040
Simulating trial. . .
espilon = 0.6364; alpha = 0.0040
Simulating trial. . .
espilon = 0.6364; alpha = 0.0040
Simulating trial. . .
espilon = 0.6364; alpha = 0.0040
Simulating trial. . .
espilon = 0.6364; alpha = 0.0040
Simulating trial. . .
espilon = 0.6364; alpha = 0.0040
Simulating trial. . .
espilon = 0.6364; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.75)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.08)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 1.25)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.48)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.31)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'forward')
Agent attempted driving left through a red light. (rewarded -9.61)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.21)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.49)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.74)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'right')
Agent drove forward instead of right. (rewarded 0.04)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.51)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent followed the waypoint right. (rewarded 1.40)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.68)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent drove forward instead of right. (rewarded 0.83)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 0.81)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.60)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 0.95)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent followed the waypoint right. (rewarded 1.25)
10% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 115
\-------------------------
Simulating trial. . .
espilon = 0.6338; alpha = 0.0040
Simulating trial. . .
espilon = 0.6338; alpha = 0.0040
Simulating trial. . .
espilon = 0.6338; alpha = 0.0040
Simulating trial. . .
espilon = 0.6338; alpha = 0.0040
Simulating trial. . .
espilon = 0.6338; alpha = 0.0040
Simulating trial. . .
espilon = 0.6338; alpha = 0.0040
Simulating trial. . .
espilon = 0.6338; alpha = 0.0040
Simulating trial. . .
espilon = 0.6338; alpha = 0.0040
Simulating trial. . .
espilon = 0.6338; alpha = 0.0040
Simulating trial. . .
espilon = 0.6338; alpha = 0.0040
Simulating trial. . .
espilon = 0.6338; alpha = 0.0040
Simulating trial. . .
espilon = 0.6338; alpha = 0.0040
Simulating trial. . .
espilon = 0.6338; alpha = 0.0040
Simulating trial. . .
espilon = 0.6338; alpha = 0.0040
Simulating trial. . .
espilon = 0.6338; alpha = 0.0040
Simulating trial. . .
espilon = 0.6338; alpha = 0.0040
Simulating trial. . .
espilon = 0.6338; alpha = 0.0040
Simulating trial. . .
espilon = 0.6338; alpha = 0.0040
Simulating trial. . .
espilon = 0.6338; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'right', 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.18)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 1.03)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 1.06)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent drove left instead of forward. (rewarded 1.50)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent drove forward instead of right. (rewarded 0.85)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.51)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.38)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', 'forward')
Agent attempted driving forward through a red light. (rewarded -9.09)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', 'forward')
Agent properly idled at a red light. (rewarded 0.99)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent attempted driving forward through a red light. (rewarded -10.71)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', 'forward')
Agent attempted driving left through a red light. (rewarded -9.47)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.27)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent followed the waypoint right. (rewarded 1.41)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent drove forward instead of right. (rewarded -0.38)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 0.16)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, None)
Agent followed the waypoint right. (rewarded 1.35)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.61)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.87)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'left')
Agent drove left instead of right. (rewarded -0.74)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.49)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 116
\-------------------------
Simulating trial. . .
espilon = 0.6313; alpha = 0.0040
Simulating trial. . .
espilon = 0.6313; alpha = 0.0040
Simulating trial. . .
espilon = 0.6313; alpha = 0.0040
Simulating trial. . .
espilon = 0.6313; alpha = 0.0040
Simulating trial. . .
espilon = 0.6313; alpha = 0.0040
Simulating trial. . .
espilon = 0.6313; alpha = 0.0040
Simulating trial. . .
espilon = 0.6313; alpha = 0.0040
Simulating trial. . .
espilon = 0.6313; alpha = 0.0040
Simulating trial. . .
espilon = 0.6313; alpha = 0.0040
Simulating trial. . .
espilon = 0.6313; alpha = 0.0040
Simulating trial. . .
espilon = 0.6313; alpha = 0.0040
Simulating trial. . .
espilon = 0.6313; alpha = 0.0040
Simulating trial. . .
espilon = 0.6313; alpha = 0.0040
Simulating trial. . .
espilon = 0.6313; alpha = 0.0040
Simulating trial. . .
espilon = 0.6313; alpha = 0.0040
Simulating trial. . .
espilon = 0.6313; alpha = 0.0040
Simulating trial. . .
espilon = 0.6313; alpha = 0.0040
Simulating trial. . .
espilon = 0.6313; alpha = 0.0040
Simulating trial. . .
espilon = 0.6313; alpha = 0.0040
Simulating trial. . .
espilon = 0.6313; alpha = 0.0040
Simulating trial. . .
espilon = 0.6313; alpha = 0.0040
Simulating trial. . .
espilon = 0.6313; alpha = 0.0040
Simulating trial. . .
espilon = 0.6313; alpha = 0.0040
Simulating trial. . .
espilon = 0.6313; alpha = 0.0040
Simulating trial. . .
espilon = 0.6313; alpha = 0.0040
Simulating trial. . .
espilon = 0.6313; alpha = 0.0040
Simulating trial. . .
espilon = 0.6313; alpha = 0.0040
Simulating trial. . .
espilon = 0.6313; alpha = 0.0040
Simulating trial. . .
espilon = 0.6313; alpha = 0.0040
Simulating trial. . .
espilon = 0.6313; alpha = 0.0040
Simulating trial. . .
espilon = 0.6313; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.09)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.98)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.43)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.55)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.74)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.44)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 1.55)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -5.84)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove left instead of forward. (rewarded -0.10)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 0.52)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.46)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 1.44)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.44)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.40)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.77)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.59)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.49)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.51)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'forward', None)
Agent followed the waypoint left. (rewarded 0.26)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent drove left instead of forward. (rewarded 0.69)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 117
\-------------------------
Simulating trial. . .
espilon = 0.6288; alpha = 0.0040
Simulating trial. . .
espilon = 0.6288; alpha = 0.0040
Simulating trial. . .
espilon = 0.6288; alpha = 0.0040
Simulating trial. . .
espilon = 0.6288; alpha = 0.0040
Simulating trial. . .
espilon = 0.6288; alpha = 0.0040
Simulating trial. . .
espilon = 0.6288; alpha = 0.0040
Simulating trial. . .
espilon = 0.6288; alpha = 0.0040
Simulating trial. . .
espilon = 0.6288; alpha = 0.0040
Simulating trial. . .
espilon = 0.6288; alpha = 0.0040
Simulating trial. . .
espilon = 0.6288; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.51)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.11)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.16)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent drove right instead of forward. (rewarded 0.04)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 2.93)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 2.03)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.40)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.79)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.13)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 1.19)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.31)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.13)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.33)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.81)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent attempted driving left through a red light. (rewarded -10.63)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.03)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 0.61)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.07)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.05)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.95)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', 'right')
Agent followed the waypoint right. (rewarded 2.32)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent drove right instead of forward. (rewarded 0.88)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.03)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent drove right instead of left. (rewarded 0.16)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent drove left instead of right. (rewarded 0.22)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded 0.12)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.29)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'right', None)
Agent drove left instead of right. (rewarded -0.17)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', 'forward')
Agent drove right instead of left. (rewarded -0.79)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 0.09)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 118
\-------------------------
Simulating trial. . .
espilon = 0.6263; alpha = 0.0040
Simulating trial. . .
espilon = 0.6263; alpha = 0.0040
Simulating trial. . .
espilon = 0.6263; alpha = 0.0040
Simulating trial. . .
espilon = 0.6263; alpha = 0.0040
Simulating trial. . .
espilon = 0.6263; alpha = 0.0040
Simulating trial. . .
espilon = 0.6263; alpha = 0.0040
Simulating trial. . .
espilon = 0.6263; alpha = 0.0040
Simulating trial. . .
espilon = 0.6263; alpha = 0.0040
Simulating trial. . .
espilon = 0.6263; alpha = 0.0040
Simulating trial. . .
espilon = 0.6263; alpha = 0.0040
Simulating trial. . .
espilon = 0.6263; alpha = 0.0040
Simulating trial. . .
espilon = 0.6263; alpha = 0.0040
Simulating trial. . .
espilon = 0.6263; alpha = 0.0040
Simulating trial. . .
espilon = 0.6263; alpha = 0.0040
Simulating trial. . .
espilon = 0.6263; alpha = 0.0040
Simulating trial. . .
espilon = 0.6263; alpha = 0.0040
Simulating trial. . .
espilon = 0.6263; alpha = 0.0040
Simulating trial. . .
espilon = 0.6263; alpha = 0.0040
Simulating trial. . .
espilon = 0.6263; alpha = 0.0040
Simulating trial. . .
espilon = 0.6263; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.47)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.28)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.30)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 1.49)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.05)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent drove right instead of left. (rewarded 0.83)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, None)
Agent drove forward instead of right. (rewarded 0.99)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.61)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent drove forward instead of right. (rewarded 1.59)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove left instead of right. (rewarded 0.63)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent drove right instead of left. (rewarded 1.16)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.05)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.51)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent drove right instead of forward. (rewarded -0.29)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove right instead of left. (rewarded 0.16)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.88)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 0.61)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.23)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.35)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'right')
Agent drove right instead of forward. (rewarded -0.38)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 119
\-------------------------
Simulating trial. . .
espilon = 0.6238; alpha = 0.0040
Simulating trial. . .
espilon = 0.6238; alpha = 0.0040
Simulating trial. . .
espilon = 0.6238; alpha = 0.0040
Simulating trial. . .
espilon = 0.6238; alpha = 0.0040
Simulating trial. . .
espilon = 0.6238; alpha = 0.0040
Simulating trial. . .
espilon = 0.6238; alpha = 0.0040
Simulating trial. . .
espilon = 0.6238; alpha = 0.0040
Simulating trial. . .
espilon = 0.6238; alpha = 0.0040
Simulating trial. . .
espilon = 0.6238; alpha = 0.0040
Simulating trial. . .
espilon = 0.6238; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', 'left')
Agent attempted driving forward through a red light. (rewarded -10.92)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.96)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.71)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.52)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent drove right instead of left. (rewarded 0.53)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent attempted driving forward through a red light. (rewarded -9.17)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.39)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.11)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.58)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent drove forward instead of right. (rewarded 1.48)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.64)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'right', None)
Agent followed the waypoint right. (rewarded 1.69)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.67)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 1.80)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.48)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 0.56)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -9.49)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.55)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.56)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 1.76)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 120
\-------------------------
Simulating trial. . .
espilon = 0.6213; alpha = 0.0040
Simulating trial. . .
espilon = 0.6213; alpha = 0.0040
Simulating trial. . .
espilon = 0.6213; alpha = 0.0040
Simulating trial. . .
espilon = 0.6213; alpha = 0.0040
Simulating trial. . .
espilon = 0.6213; alpha = 0.0040
Simulating trial. . .
espilon = 0.6213; alpha = 0.0040
Simulating trial. . .
espilon = 0.6213; alpha = 0.0040
Simulating trial. . .
espilon = 0.6213; alpha = 0.0040
Simulating trial. . .
espilon = 0.6213; alpha = 0.0040
Simulating trial. . .
espilon = 0.6213; alpha = 0.0040
Simulating trial. . .
espilon = 0.6213; alpha = 0.0040
Simulating trial. . .
espilon = 0.6213; alpha = 0.0040
Simulating trial. . .
espilon = 0.6213; alpha = 0.0040
Simulating trial. . .
espilon = 0.6213; alpha = 0.0040
Simulating trial. . .
espilon = 0.6213; alpha = 0.0040
Simulating trial. . .
espilon = 0.6213; alpha = 0.0040
Simulating trial. . .
espilon = 0.6213; alpha = 0.0040
Simulating trial. . .
espilon = 0.6213; alpha = 0.0040
Simulating trial. . .
espilon = 0.6213; alpha = 0.0040
Simulating trial. . .
espilon = 0.6213; alpha = 0.0040
Simulating trial. . .
espilon = 0.6213; alpha = 0.0040
Simulating trial. . .
espilon = 0.6213; alpha = 0.0040
Simulating trial. . .
espilon = 0.6213; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent attempted driving left through a red light. (rewarded -9.53)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.28)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.02)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.21)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded 0.85)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove right instead of forward. (rewarded 1.86)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', None)
Agent drove right instead of left. (rewarded 1.68)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.73)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 1.71)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent drove forward instead of left. (rewarded 0.71)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent drove forward instead of left. (rewarded 0.68)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove right instead of left. (rewarded 0.05)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.42)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'right')
Agent followed the waypoint right. (rewarded 2.13)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.87)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.24)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent drove left instead of forward. (rewarded 0.06)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.55)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded -0.15)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove right instead of left. (rewarded -0.23)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 121
\-------------------------
Simulating trial. . .
espilon = 0.6188; alpha = 0.0040
Simulating trial. . .
espilon = 0.6188; alpha = 0.0040
Simulating trial. . .
espilon = 0.6188; alpha = 0.0040
Simulating trial. . .
espilon = 0.6188; alpha = 0.0040
Simulating trial. . .
espilon = 0.6188; alpha = 0.0040
Simulating trial. . .
espilon = 0.6188; alpha = 0.0040
Simulating trial. . .
espilon = 0.6188; alpha = 0.0040
Simulating trial. . .
espilon = 0.6188; alpha = 0.0040
Simulating trial. . .
espilon = 0.6188; alpha = 0.0040
Simulating trial. . .
espilon = 0.6188; alpha = 0.0040
Simulating trial. . .
espilon = 0.6188; alpha = 0.0040
Simulating trial. . .
espilon = 0.6188; alpha = 0.0040
Simulating trial. . .
espilon = 0.6188; alpha = 0.0040
Simulating trial. . .
espilon = 0.6188; alpha = 0.0040
Simulating trial. . .
espilon = 0.6188; alpha = 0.0040
Simulating trial. . .
espilon = 0.6188; alpha = 0.0040
Simulating trial. . .
espilon = 0.6188; alpha = 0.0040
Simulating trial. . .
espilon = 0.6188; alpha = 0.0040
Simulating trial. . .
espilon = 0.6188; alpha = 0.0040
Simulating trial. . .
espilon = 0.6188; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.86)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.77)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.20)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded 1.61)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 0.71)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent attempted driving left through a red light. (rewarded -9.81)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -10.33)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'left', None)
Agent drove forward instead of left. (rewarded 1.12)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.59)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.15)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent drove right instead of forward. (rewarded 0.19)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.49)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent drove right instead of forward. (rewarded 0.60)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove right instead of left. (rewarded 1.55)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.95)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 0.49)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'left')
Agent drove left instead of right. (rewarded 0.78)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.79)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded -0.24)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent attempted driving forward through a red light. (rewarded -10.52)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.55)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 1.03)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.61)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 1.73)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.39)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 122
\-------------------------
Simulating trial. . .
espilon = 0.6163; alpha = 0.0040
Simulating trial. . .
espilon = 0.6163; alpha = 0.0040
Simulating trial. . .
espilon = 0.6163; alpha = 0.0040
Simulating trial. . .
espilon = 0.6163; alpha = 0.0040
Simulating trial. . .
espilon = 0.6163; alpha = 0.0040
Simulating trial. . .
espilon = 0.6163; alpha = 0.0040
Simulating trial. . .
espilon = 0.6163; alpha = 0.0040
Simulating trial. . .
espilon = 0.6163; alpha = 0.0040
Simulating trial. . .
espilon = 0.6163; alpha = 0.0040
Simulating trial. . .
espilon = 0.6163; alpha = 0.0040
Simulating trial. . .
espilon = 0.6163; alpha = 0.0040
Simulating trial. . .
espilon = 0.6163; alpha = 0.0040
Simulating trial. . .
espilon = 0.6163; alpha = 0.0040
Simulating trial. . .
espilon = 0.6163; alpha = 0.0040
Simulating trial. . .
espilon = 0.6163; alpha = 0.0040
Simulating trial. . .
espilon = 0.6163; alpha = 0.0040
Simulating trial. . .
espilon = 0.6163; alpha = 0.0040
Simulating trial. . .
espilon = 0.6163; alpha = 0.0040
Simulating trial. . .
espilon = 0.6163; alpha = 0.0040
Simulating trial. . .
espilon = 0.6163; alpha = 0.0040
Simulating trial. . .
espilon = 0.6163; alpha = 0.0040
Simulating trial. . .
espilon = 0.6163; alpha = 0.0040
Simulating trial. . .
espilon = 0.6163; alpha = 0.0040
Simulating trial. . .
espilon = 0.6163; alpha = 0.0040
Simulating trial. . .
espilon = 0.6163; alpha = 0.0040
Simulating trial. . .
espilon = 0.6163; alpha = 0.0040
Simulating trial. . .
espilon = 0.6163; alpha = 0.0040
Simulating trial. . .
espilon = 0.6163; alpha = 0.0040
Simulating trial. . .
espilon = 0.6163; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'left', 'left')
Agent followed the waypoint right. (rewarded 1.17)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', 'left')
Agent drove left instead of right. (rewarded 0.90)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 2.72)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent followed the waypoint right. (rewarded 2.36)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.95)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.09)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.14)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'forward')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.23)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.57)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent drove right instead of forward. (rewarded 0.05)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent drove right instead of left. (rewarded 0.45)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.36)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.86)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 0.03)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.54)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded 0.96)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'forward')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.05)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.73)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'right')
Agent properly idled at a red light. (rewarded 1.63)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.66)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 123
\-------------------------
Simulating trial. . .
espilon = 0.6139; alpha = 0.0040
Simulating trial. . .
espilon = 0.6139; alpha = 0.0040
Simulating trial. . .
espilon = 0.6139; alpha = 0.0040
Simulating trial. . .
espilon = 0.6139; alpha = 0.0040
Simulating trial. . .
espilon = 0.6139; alpha = 0.0040
Simulating trial. . .
espilon = 0.6139; alpha = 0.0040
Simulating trial. . .
espilon = 0.6139; alpha = 0.0040
Simulating trial. . .
espilon = 0.6139; alpha = 0.0040
Simulating trial. . .
espilon = 0.6139; alpha = 0.0040
Simulating trial. . .
espilon = 0.6139; alpha = 0.0040
Simulating trial. . .
espilon = 0.6139; alpha = 0.0040
Simulating trial. . .
espilon = 0.6139; alpha = 0.0040
Simulating trial. . .
espilon = 0.6139; alpha = 0.0040
Simulating trial. . .
espilon = 0.6139; alpha = 0.0040
Simulating trial. . .
espilon = 0.6139; alpha = 0.0040
Simulating trial. . .
espilon = 0.6139; alpha = 0.0040
Simulating trial. . .
espilon = 0.6139; alpha = 0.0040
Simulating trial. . .
espilon = 0.6139; alpha = 0.0040
Simulating trial. . .
espilon = 0.6139; alpha = 0.0040
Simulating trial. . .
espilon = 0.6139; alpha = 0.0040
Simulating trial. . .
espilon = 0.6139; alpha = 0.0040
Simulating trial. . .
espilon = 0.6139; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.99)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.89)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.88)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent drove right instead of forward. (rewarded 0.04)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 2.89)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.88)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.89)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.24)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.40)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.32)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.01)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.08)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 124
\-------------------------
Simulating trial. . .
espilon = 0.6114; alpha = 0.0040
Simulating trial. . .
espilon = 0.6114; alpha = 0.0040
Simulating trial. . .
espilon = 0.6114; alpha = 0.0040
Simulating trial. . .
espilon = 0.6114; alpha = 0.0040
Simulating trial. . .
espilon = 0.6114; alpha = 0.0040
Simulating trial. . .
espilon = 0.6114; alpha = 0.0040
Simulating trial. . .
espilon = 0.6114; alpha = 0.0040
Simulating trial. . .
espilon = 0.6114; alpha = 0.0040
Simulating trial. . .
espilon = 0.6114; alpha = 0.0040
Simulating trial. . .
espilon = 0.6114; alpha = 0.0040
Simulating trial. . .
espilon = 0.6114; alpha = 0.0040
Simulating trial. . .
espilon = 0.6114; alpha = 0.0040
Simulating trial. . .
espilon = 0.6114; alpha = 0.0040
Simulating trial. . .
espilon = 0.6114; alpha = 0.0040
Simulating trial. . .
espilon = 0.6114; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 1.79)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'forward')
Agent drove forward instead of right. (rewarded 1.93)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.23)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 1.57)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.62)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.19)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 0.70)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.16)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.42)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 1.69)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.99)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.94)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'left')
Agent drove left instead of right. (rewarded 0.56)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.16)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.74)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.67)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent drove forward instead of right. (rewarded 0.08)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'left')
Agent followed the waypoint right. (rewarded 0.62)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent drove right instead of forward. (rewarded 1.02)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.09)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 125
\-------------------------
Simulating trial. . .
espilon = 0.6090; alpha = 0.0040
Simulating trial. . .
espilon = 0.6090; alpha = 0.0040
Simulating trial. . .
espilon = 0.6090; alpha = 0.0040
Simulating trial. . .
espilon = 0.6090; alpha = 0.0040
Simulating trial. . .
espilon = 0.6090; alpha = 0.0040
Simulating trial. . .
espilon = 0.6090; alpha = 0.0040
Simulating trial. . .
espilon = 0.6090; alpha = 0.0040
Simulating trial. . .
espilon = 0.6090; alpha = 0.0040
Simulating trial. . .
espilon = 0.6090; alpha = 0.0040
Simulating trial. . .
espilon = 0.6090; alpha = 0.0040
Simulating trial. . .
espilon = 0.6090; alpha = 0.0040
Simulating trial. . .
espilon = 0.6090; alpha = 0.0040
Simulating trial. . .
espilon = 0.6090; alpha = 0.0040
Simulating trial. . .
espilon = 0.6090; alpha = 0.0040
Simulating trial. . .
espilon = 0.6090; alpha = 0.0040
Simulating trial. . .
espilon = 0.6090; alpha = 0.0040
Simulating trial. . .
espilon = 0.6090; alpha = 0.0040
Simulating trial. . .
espilon = 0.6090; alpha = 0.0040
Simulating trial. . .
espilon = 0.6090; alpha = 0.0040
Simulating trial. . .
espilon = 0.6090; alpha = 0.0040
Simulating trial. . .
espilon = 0.6090; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'right')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.65)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.28)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.50)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 0.20)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.18)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.52)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.54)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.03)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.30)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove left instead of forward. (rewarded 0.03)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.24)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.77)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.75)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.54)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.37)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded -0.32)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.50)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.18)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 1.27)
24% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 126
\-------------------------
Simulating trial. . .
espilon = 0.6065; alpha = 0.0040
Simulating trial. . .
espilon = 0.6065; alpha = 0.0040
Simulating trial. . .
espilon = 0.6065; alpha = 0.0040
Simulating trial. . .
espilon = 0.6065; alpha = 0.0040
Simulating trial. . .
espilon = 0.6065; alpha = 0.0040
Simulating trial. . .
espilon = 0.6065; alpha = 0.0040
Simulating trial. . .
espilon = 0.6065; alpha = 0.0040
Simulating trial. . .
espilon = 0.6065; alpha = 0.0040
Simulating trial. . .
espilon = 0.6065; alpha = 0.0040
Simulating trial. . .
espilon = 0.6065; alpha = 0.0040
Simulating trial. . .
espilon = 0.6065; alpha = 0.0040
Simulating trial. . .
espilon = 0.6065; alpha = 0.0040
Simulating trial. . .
espilon = 0.6065; alpha = 0.0040
Simulating trial. . .
espilon = 0.6065; alpha = 0.0040
Simulating trial. . .
espilon = 0.6065; alpha = 0.0040
Simulating trial. . .
espilon = 0.6065; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent followed the waypoint left. (rewarded 1.95)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.67)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.80)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.28)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.76)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.25)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.55)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 1.94)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.31)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent drove right instead of forward. (rewarded 0.72)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.08)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.13)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'forward', None)
Agent followed the waypoint left. (rewarded 1.34)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.14)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', 'forward')
Agent attempted driving forward through a red light. (rewarded -10.27)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.93)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.35)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.17)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.91)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.04)
20% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 127
\-------------------------
Simulating trial. . .
espilon = 0.6041; alpha = 0.0040
Simulating trial. . .
espilon = 0.6041; alpha = 0.0040
Simulating trial. . .
espilon = 0.6041; alpha = 0.0040
Simulating trial. . .
espilon = 0.6041; alpha = 0.0040
Simulating trial. . .
espilon = 0.6041; alpha = 0.0040
Simulating trial. . .
espilon = 0.6041; alpha = 0.0040
Simulating trial. . .
espilon = 0.6041; alpha = 0.0040
Simulating trial. . .
espilon = 0.6041; alpha = 0.0040
Simulating trial. . .
espilon = 0.6041; alpha = 0.0040
Simulating trial. . .
espilon = 0.6041; alpha = 0.0040
Simulating trial. . .
espilon = 0.6041; alpha = 0.0040
Simulating trial. . .
espilon = 0.6041; alpha = 0.0040
Simulating trial. . .
espilon = 0.6041; alpha = 0.0040
Simulating trial. . .
espilon = 0.6041; alpha = 0.0040
Simulating trial. . .
espilon = 0.6041; alpha = 0.0040
Simulating trial. . .
espilon = 0.6041; alpha = 0.0040
Simulating trial. . .
espilon = 0.6041; alpha = 0.0040
Simulating trial. . .
espilon = 0.6041; alpha = 0.0040
Simulating trial. . .
espilon = 0.6041; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'right', 'forward')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.06)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.48)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded 1.77)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent followed the waypoint right. (rewarded 1.56)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.67)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent drove left instead of forward. (rewarded 1.25)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.37)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent drove right instead of forward. (rewarded -0.04)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.29)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.39)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent drove forward instead of left. (rewarded 1.72)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded 0.49)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 1.23)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'forward')
Agent drove forward instead of left. (rewarded 0.29)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 1.24)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.19)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.00)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.89)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'forward', 'right')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.19)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.75)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.44)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.81)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.68)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.82)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.55)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 128
\-------------------------
Simulating trial. . .
espilon = 0.6017; alpha = 0.0040
Simulating trial. . .
espilon = 0.6017; alpha = 0.0040
Simulating trial. . .
espilon = 0.6017; alpha = 0.0040
Simulating trial. . .
espilon = 0.6017; alpha = 0.0040
Simulating trial. . .
espilon = 0.6017; alpha = 0.0040
Simulating trial. . .
espilon = 0.6017; alpha = 0.0040
Simulating trial. . .
espilon = 0.6017; alpha = 0.0040
Simulating trial. . .
espilon = 0.6017; alpha = 0.0040
Simulating trial. . .
espilon = 0.6017; alpha = 0.0040
Simulating trial. . .
espilon = 0.6017; alpha = 0.0040
Simulating trial. . .
espilon = 0.6017; alpha = 0.0040
Simulating trial. . .
espilon = 0.6017; alpha = 0.0040
Simulating trial. . .
espilon = 0.6017; alpha = 0.0040
Simulating trial. . .
espilon = 0.6017; alpha = 0.0040
Simulating trial. . .
espilon = 0.6017; alpha = 0.0040
Simulating trial. . .
espilon = 0.6017; alpha = 0.0040
Simulating trial. . .
espilon = 0.6017; alpha = 0.0040
Simulating trial. . .
espilon = 0.6017; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.97)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'left')
Agent drove forward instead of left. (rewarded 0.79)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -9.05)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.81)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.31)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.26)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'left')
Agent followed the waypoint left. (rewarded 1.01)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.14)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -5.80)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'right')
Agent drove right instead of forward. (rewarded 0.14)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'right', 'left')
Agent properly idled at a red light. (rewarded 2.62)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded 1.19)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.72)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.15)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.28)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.45)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded -0.46)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'forward')
Agent drove right instead of left. (rewarded 1.23)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.08)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove left instead of right. (rewarded -0.35)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 129
\-------------------------
Simulating trial. . .
espilon = 0.5993; alpha = 0.0040
Simulating trial. . .
espilon = 0.5993; alpha = 0.0040
Simulating trial. . .
espilon = 0.5993; alpha = 0.0040
Simulating trial. . .
espilon = 0.5993; alpha = 0.0040
Simulating trial. . .
espilon = 0.5993; alpha = 0.0040
Simulating trial. . .
espilon = 0.5993; alpha = 0.0040
Simulating trial. . .
espilon = 0.5993; alpha = 0.0040
Simulating trial. . .
espilon = 0.5993; alpha = 0.0040
Simulating trial. . .
espilon = 0.5993; alpha = 0.0040
Simulating trial. . .
espilon = 0.5993; alpha = 0.0040
Simulating trial. . .
espilon = 0.5993; alpha = 0.0040
Simulating trial. . .
espilon = 0.5993; alpha = 0.0040
Simulating trial. . .
espilon = 0.5993; alpha = 0.0040
Simulating trial. . .
espilon = 0.5993; alpha = 0.0040
Simulating trial. . .
espilon = 0.5993; alpha = 0.0040
Simulating trial. . .
espilon = 0.5993; alpha = 0.0040
Simulating trial. . .
espilon = 0.5993; alpha = 0.0040
Simulating trial. . .
espilon = 0.5993; alpha = 0.0040
Simulating trial. . .
espilon = 0.5993; alpha = 0.0040
Simulating trial. . .
espilon = 0.5993; alpha = 0.0040
Simulating trial. . .
espilon = 0.5993; alpha = 0.0040
Simulating trial. . .
espilon = 0.5993; alpha = 0.0040
Simulating trial. . .
espilon = 0.5993; alpha = 0.0040
Simulating trial. . .
espilon = 0.5993; alpha = 0.0040
Simulating trial. . .
espilon = 0.5993; alpha = 0.0040
Simulating trial. . .
espilon = 0.5993; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'right', 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.76)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.35)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.98)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.31)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.00)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.43)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent followed the waypoint forward. (rewarded 1.83)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.64)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.41)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.54)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent drove right instead of forward. (rewarded 0.02)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', 'left')
Agent drove right instead of left. (rewarded 0.25)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', 'left')
Agent drove right instead of left. (rewarded 1.28)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 1.57)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.56)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 0.52)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.68)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.45)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.23)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.81)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 130
\-------------------------
Simulating trial. . .
espilon = 0.5969; alpha = 0.0040
Simulating trial. . .
espilon = 0.5969; alpha = 0.0040
Simulating trial. . .
espilon = 0.5969; alpha = 0.0040
Simulating trial. . .
espilon = 0.5969; alpha = 0.0040
Simulating trial. . .
espilon = 0.5969; alpha = 0.0040
Simulating trial. . .
espilon = 0.5969; alpha = 0.0040
Simulating trial. . .
espilon = 0.5969; alpha = 0.0040
Simulating trial. . .
espilon = 0.5969; alpha = 0.0040
Simulating trial. . .
espilon = 0.5969; alpha = 0.0040
Simulating trial. . .
espilon = 0.5969; alpha = 0.0040
Simulating trial. . .
espilon = 0.5969; alpha = 0.0040
Simulating trial. . .
espilon = 0.5969; alpha = 0.0040
Simulating trial. . .
espilon = 0.5969; alpha = 0.0040
Simulating trial. . .
espilon = 0.5969; alpha = 0.0040
Simulating trial. . .
espilon = 0.5969; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', None)
Agent followed the waypoint forward. (rewarded 2.71)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent drove left instead of forward. (rewarded 0.65)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.20)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.27)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.77)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.26)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.71)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.94)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.09)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.87)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 0.19)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.05)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.13)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.13)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 1.60)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.75)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.87)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.58)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.71)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.10)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 1.68)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent drove right instead of forward. (rewarded -0.55)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'forward', None)
Agent drove right instead of left. (rewarded 1.21)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.59)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent followed the waypoint right. (rewarded 1.05)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 131
\-------------------------
Simulating trial. . .
espilon = 0.5945; alpha = 0.0040
Simulating trial. . .
espilon = 0.5945; alpha = 0.0040
Simulating trial. . .
espilon = 0.5945; alpha = 0.0040
Simulating trial. . .
espilon = 0.5945; alpha = 0.0040
Simulating trial. . .
espilon = 0.5945; alpha = 0.0040
Simulating trial. . .
espilon = 0.5945; alpha = 0.0040
Simulating trial. . .
espilon = 0.5945; alpha = 0.0040
Simulating trial. . .
espilon = 0.5945; alpha = 0.0040
Simulating trial. . .
espilon = 0.5945; alpha = 0.0040
Simulating trial. . .
espilon = 0.5945; alpha = 0.0040
Simulating trial. . .
espilon = 0.5945; alpha = 0.0040
Simulating trial. . .
espilon = 0.5945; alpha = 0.0040
Simulating trial. . .
espilon = 0.5945; alpha = 0.0040
Simulating trial. . .
espilon = 0.5945; alpha = 0.0040
Simulating trial. . .
espilon = 0.5945; alpha = 0.0040
Simulating trial. . .
espilon = 0.5945; alpha = 0.0040
Simulating trial. . .
espilon = 0.5945; alpha = 0.0040
Simulating trial. . .
espilon = 0.5945; alpha = 0.0040
Simulating trial. . .
espilon = 0.5945; alpha = 0.0040
Simulating trial. . .
espilon = 0.5945; alpha = 0.0040
Simulating trial. . .
espilon = 0.5945; alpha = 0.0040
Simulating trial. . .
espilon = 0.5945; alpha = 0.0040
Simulating trial. . .
espilon = 0.5945; alpha = 0.0040
Simulating trial. . .
espilon = 0.5945; alpha = 0.0040
Simulating trial. . .
espilon = 0.5945; alpha = 0.0040
Simulating trial. . .
espilon = 0.5945; alpha = 0.0040
Simulating trial. . .
espilon = 0.5945; alpha = 0.0040
Simulating trial. . .
espilon = 0.5945; alpha = 0.0040
Simulating trial. . .
espilon = 0.5945; alpha = 0.0040
Simulating trial. . .
espilon = 0.5945; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', 'forward')
Agent drove forward instead of right. (rewarded 1.58)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.86)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.02)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.82)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.38)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.85)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 1.53)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.43)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.24)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.18)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded 0.77)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.13)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent drove right instead of left. (rewarded -0.07)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'forward')
Agent properly idled at a red light. (rewarded 2.07)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.90)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.24)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded 0.46)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 1.48)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'right')
Agent followed the waypoint right. (rewarded 1.79)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.04)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.07)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.94)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.21)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.82)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent attempted driving forward through a red light. (rewarded -9.25)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 132
\-------------------------
Simulating trial. . .
espilon = 0.5921; alpha = 0.0040
Simulating trial. . .
espilon = 0.5921; alpha = 0.0040
Simulating trial. . .
espilon = 0.5921; alpha = 0.0040
Simulating trial. . .
espilon = 0.5921; alpha = 0.0040
Simulating trial. . .
espilon = 0.5921; alpha = 0.0040
Simulating trial. . .
espilon = 0.5921; alpha = 0.0040
Simulating trial. . .
espilon = 0.5921; alpha = 0.0040
Simulating trial. . .
espilon = 0.5921; alpha = 0.0040
Simulating trial. . .
espilon = 0.5921; alpha = 0.0040
Simulating trial. . .
espilon = 0.5921; alpha = 0.0040
Simulating trial. . .
espilon = 0.5921; alpha = 0.0040
Simulating trial. . .
espilon = 0.5921; alpha = 0.0040
Simulating trial. . .
espilon = 0.5921; alpha = 0.0040
Simulating trial. . .
espilon = 0.5921; alpha = 0.0040
Simulating trial. . .
espilon = 0.5921; alpha = 0.0040
Simulating trial. . .
espilon = 0.5921; alpha = 0.0040
Simulating trial. . .
espilon = 0.5921; alpha = 0.0040
Simulating trial. . .
espilon = 0.5921; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.30)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.45)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.87)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.81)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.88)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.69)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.92)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.13)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.14)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.70)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent attempted driving forward through a red light. (rewarded -9.98)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.13)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove left instead of right. (rewarded 1.52)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.08)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent drove right instead of forward. (rewarded 0.08)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.84)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.04)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.82)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'right')
Agent attempted driving forward through a red light. (rewarded -9.35)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.73)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.11)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.31)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.47)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', 'forward')
Agent drove forward instead of left. (rewarded 0.28)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.83)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 133
\-------------------------
Simulating trial. . .
espilon = 0.5898; alpha = 0.0040
Simulating trial. . .
espilon = 0.5898; alpha = 0.0040
Simulating trial. . .
espilon = 0.5898; alpha = 0.0040
Simulating trial. . .
espilon = 0.5898; alpha = 0.0040
Simulating trial. . .
espilon = 0.5898; alpha = 0.0040
Simulating trial. . .
espilon = 0.5898; alpha = 0.0040
Simulating trial. . .
espilon = 0.5898; alpha = 0.0040
Simulating trial. . .
espilon = 0.5898; alpha = 0.0040
Simulating trial. . .
espilon = 0.5898; alpha = 0.0040
Simulating trial. . .
espilon = 0.5898; alpha = 0.0040
Simulating trial. . .
espilon = 0.5898; alpha = 0.0040
Simulating trial. . .
espilon = 0.5898; alpha = 0.0040
Simulating trial. . .
espilon = 0.5898; alpha = 0.0040
Simulating trial. . .
espilon = 0.5898; alpha = 0.0040
Simulating trial. . .
espilon = 0.5898; alpha = 0.0040
Simulating trial. . .
espilon = 0.5898; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.56)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent drove right instead of left. (rewarded 1.86)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.61)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.73)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.13)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.28)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove right instead of left. (rewarded 0.84)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.26)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 1.74)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.72)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', 'left')
Agent drove left instead of forward. (rewarded 1.38)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', 'right')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.44)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.77)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.30)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'right', 'left')
Agent properly idled at a red light. (rewarded 2.06)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent followed the waypoint forward. (rewarded 0.90)
36% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 134
\-------------------------
Simulating trial. . .
espilon = 0.5874; alpha = 0.0040
Simulating trial. . .
espilon = 0.5874; alpha = 0.0040
Simulating trial. . .
espilon = 0.5874; alpha = 0.0040
Simulating trial. . .
espilon = 0.5874; alpha = 0.0040
Simulating trial. . .
espilon = 0.5874; alpha = 0.0040
Simulating trial. . .
espilon = 0.5874; alpha = 0.0040
Simulating trial. . .
espilon = 0.5874; alpha = 0.0040
Simulating trial. . .
espilon = 0.5874; alpha = 0.0040
Simulating trial. . .
espilon = 0.5874; alpha = 0.0040
Simulating trial. . .
espilon = 0.5874; alpha = 0.0040
Simulating trial. . .
espilon = 0.5874; alpha = 0.0040
Simulating trial. . .
espilon = 0.5874; alpha = 0.0040
Simulating trial. . .
espilon = 0.5874; alpha = 0.0040
Simulating trial. . .
espilon = 0.5874; alpha = 0.0040
Simulating trial. . .
espilon = 0.5874; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 2.44)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 2.09)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 1.32)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.44)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent followed the waypoint forward. (rewarded 2.22)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', None)
Agent drove right instead of forward. (rewarded 0.43)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.95)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.34)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.32)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.91)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent drove right instead of left. (rewarded 1.36)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'right')
Agent attempted driving forward through a red light. (rewarded -9.58)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'right', None)
Agent attempted driving left through a red light. (rewarded -10.57)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove forward instead of right. (rewarded 0.46)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.18)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 1.56)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.00)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.34)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.98)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent drove right instead of forward. (rewarded 0.88)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 0.55)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.88)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove left instead of forward. (rewarded 0.57)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'right', 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.00)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'forward')
Agent attempted driving forward through a red light. (rewarded -10.29)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 135
\-------------------------
Simulating trial. . .
espilon = 0.5851; alpha = 0.0040
Simulating trial. . .
espilon = 0.5851; alpha = 0.0040
Simulating trial. . .
espilon = 0.5851; alpha = 0.0040
Simulating trial. . .
espilon = 0.5851; alpha = 0.0040
Simulating trial. . .
espilon = 0.5851; alpha = 0.0040
Simulating trial. . .
espilon = 0.5851; alpha = 0.0040
Simulating trial. . .
espilon = 0.5851; alpha = 0.0040
Simulating trial. . .
espilon = 0.5851; alpha = 0.0040
Simulating trial. . .
espilon = 0.5851; alpha = 0.0040
Simulating trial. . .
espilon = 0.5851; alpha = 0.0040
Simulating trial. . .
espilon = 0.5851; alpha = 0.0040
Simulating trial. . .
espilon = 0.5851; alpha = 0.0040
Simulating trial. . .
espilon = 0.5851; alpha = 0.0040
Simulating trial. . .
espilon = 0.5851; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.18)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.35)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.87)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.49)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -5.19)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.49)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 0.26)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.59)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.69)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'right', 'left')
Agent drove right instead of forward. (rewarded 0.37)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -10.33)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.71)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent attempted driving forward through a red light. (rewarded -10.35)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.30)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.76)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.16)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent drove right instead of forward. (rewarded 1.24)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 0.71)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.13)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -10.77)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.05)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 0.68)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'right')
Agent properly idled at a red light. (rewarded 0.87)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -9.45)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.25)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 136
\-------------------------
Simulating trial. . .
espilon = 0.5827; alpha = 0.0040
Simulating trial. . .
espilon = 0.5827; alpha = 0.0040
Simulating trial. . .
espilon = 0.5827; alpha = 0.0040
Simulating trial. . .
espilon = 0.5827; alpha = 0.0040
Simulating trial. . .
espilon = 0.5827; alpha = 0.0040
Simulating trial. . .
espilon = 0.5827; alpha = 0.0040
Simulating trial. . .
espilon = 0.5827; alpha = 0.0040
Simulating trial. . .
espilon = 0.5827; alpha = 0.0040
Simulating trial. . .
espilon = 0.5827; alpha = 0.0040
Simulating trial. . .
espilon = 0.5827; alpha = 0.0040
Simulating trial. . .
espilon = 0.5827; alpha = 0.0040
Simulating trial. . .
espilon = 0.5827; alpha = 0.0040
Simulating trial. . .
espilon = 0.5827; alpha = 0.0040
Simulating trial. . .
espilon = 0.5827; alpha = 0.0040
Simulating trial. . .
espilon = 0.5827; alpha = 0.0040
Simulating trial. . .
espilon = 0.5827; alpha = 0.0040
Simulating trial. . .
espilon = 0.5827; alpha = 0.0040
Simulating trial. . .
espilon = 0.5827; alpha = 0.0040
Simulating trial. . .
espilon = 0.5827; alpha = 0.0040
Simulating trial. . .
espilon = 0.5827; alpha = 0.0040
Simulating trial. . .
espilon = 0.5827; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'forward')
Agent drove forward instead of left. (rewarded 0.07)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent drove right instead of forward. (rewarded 1.36)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove forward instead of left. (rewarded 1.33)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.42)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent drove right instead of left. (rewarded 0.97)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.37)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.73)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -10.29)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.37)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.75)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.37)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent drove right instead of forward. (rewarded 1.31)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'right', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.57)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 2.57)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 1.48)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'right')
Agent drove right instead of forward. (rewarded -0.24)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.81)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.21)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.02)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.50)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 137
\-------------------------
Simulating trial. . .
espilon = 0.5804; alpha = 0.0040
Simulating trial. . .
espilon = 0.5804; alpha = 0.0040
Simulating trial. . .
espilon = 0.5804; alpha = 0.0040
Simulating trial. . .
espilon = 0.5804; alpha = 0.0040
Simulating trial. . .
espilon = 0.5804; alpha = 0.0040
Simulating trial. . .
espilon = 0.5804; alpha = 0.0040
Simulating trial. . .
espilon = 0.5804; alpha = 0.0040
Simulating trial. . .
espilon = 0.5804; alpha = 0.0040
Simulating trial. . .
espilon = 0.5804; alpha = 0.0040
Simulating trial. . .
espilon = 0.5804; alpha = 0.0040
Simulating trial. . .
espilon = 0.5804; alpha = 0.0040
Simulating trial. . .
espilon = 0.5804; alpha = 0.0040
Simulating trial. . .
espilon = 0.5804; alpha = 0.0040
Simulating trial. . .
espilon = 0.5804; alpha = 0.0040
Simulating trial. . .
espilon = 0.5804; alpha = 0.0040
Simulating trial. . .
espilon = 0.5804; alpha = 0.0040
Simulating trial. . .
espilon = 0.5804; alpha = 0.0040
Simulating trial. . .
espilon = 0.5804; alpha = 0.0040
Simulating trial. . .
espilon = 0.5804; alpha = 0.0040
Simulating trial. . .
espilon = 0.5804; alpha = 0.0040
Simulating trial. . .
espilon = 0.5804; alpha = 0.0040
Simulating trial. . .
espilon = 0.5804; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.71)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.19)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent followed the waypoint forward. (rewarded 1.35)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.54)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', 'left')
Agent drove left instead of forward. (rewarded 1.17)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove forward instead of right. (rewarded 0.14)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.30)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.57)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.99)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent drove forward instead of right. (rewarded 0.48)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, None)
Agent drove left instead of right. (rewarded 0.05)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.55)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded -0.02)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.87)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.72)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.32)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 0.88)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.60)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent drove left instead of right. (rewarded 0.75)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.53)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'forward', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.13)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.16)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.13)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 0.07)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.31)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.68)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'forward')
Agent drove right instead of left. (rewarded -0.58)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent drove right instead of left. (rewarded 0.76)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.10)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 0.54)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 138
\-------------------------
Simulating trial. . .
espilon = 0.5781; alpha = 0.0040
Simulating trial. . .
espilon = 0.5781; alpha = 0.0040
Simulating trial. . .
espilon = 0.5781; alpha = 0.0040
Simulating trial. . .
espilon = 0.5781; alpha = 0.0040
Simulating trial. . .
espilon = 0.5781; alpha = 0.0040
Simulating trial. . .
espilon = 0.5781; alpha = 0.0040
Simulating trial. . .
espilon = 0.5781; alpha = 0.0040
Simulating trial. . .
espilon = 0.5781; alpha = 0.0040
Simulating trial. . .
espilon = 0.5781; alpha = 0.0040
Simulating trial. . .
espilon = 0.5781; alpha = 0.0040
Simulating trial. . .
espilon = 0.5781; alpha = 0.0040
Simulating trial. . .
espilon = 0.5781; alpha = 0.0040
Simulating trial. . .
espilon = 0.5781; alpha = 0.0040
Simulating trial. . .
espilon = 0.5781; alpha = 0.0040
Simulating trial. . .
espilon = 0.5781; alpha = 0.0040
Simulating trial. . .
espilon = 0.5781; alpha = 0.0040
Simulating trial. . .
espilon = 0.5781; alpha = 0.0040
Simulating trial. . .
espilon = 0.5781; alpha = 0.0040
Simulating trial. . .
espilon = 0.5781; alpha = 0.0040
Simulating trial. . .
espilon = 0.5781; alpha = 0.0040
Simulating trial. . .
espilon = 0.5781; alpha = 0.0040
Simulating trial. . .
espilon = 0.5781; alpha = 0.0040
Simulating trial. . .
espilon = 0.5781; alpha = 0.0040
Simulating trial. . .
espilon = 0.5781; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'right')
Agent properly idled at a red light. (rewarded 1.83)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'right')
Agent properly idled at a red light. (rewarded 1.94)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', 'left')
Agent properly idled at a red light. (rewarded 1.88)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.88)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.64)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent drove right instead of left. (rewarded 0.33)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.42)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'left')
Agent drove left instead of right. (rewarded 0.22)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.95)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.55)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.98)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.80)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.49)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.75)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.34)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 0.84)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent drove left instead of forward. (rewarded 0.47)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.86)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.21)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.08)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.68)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent drove forward instead of left. (rewarded -0.16)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove forward instead of left. (rewarded -0.00)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent attempted driving forward through a red light. (rewarded -10.45)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.90)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 139
\-------------------------
Simulating trial. . .
espilon = 0.5758; alpha = 0.0040
Simulating trial. . .
espilon = 0.5758; alpha = 0.0040
Simulating trial. . .
espilon = 0.5758; alpha = 0.0040
Simulating trial. . .
espilon = 0.5758; alpha = 0.0040
Simulating trial. . .
espilon = 0.5758; alpha = 0.0040
Simulating trial. . .
espilon = 0.5758; alpha = 0.0040
Simulating trial. . .
espilon = 0.5758; alpha = 0.0040
Simulating trial. . .
espilon = 0.5758; alpha = 0.0040
Simulating trial. . .
espilon = 0.5758; alpha = 0.0040
Simulating trial. . .
espilon = 0.5758; alpha = 0.0040
Simulating trial. . .
espilon = 0.5758; alpha = 0.0040
Simulating trial. . .
espilon = 0.5758; alpha = 0.0040
Simulating trial. . .
espilon = 0.5758; alpha = 0.0040
Simulating trial. . .
espilon = 0.5758; alpha = 0.0040
Simulating trial. . .
espilon = 0.5758; alpha = 0.0040
Simulating trial. . .
espilon = 0.5758; alpha = 0.0040
Simulating trial. . .
espilon = 0.5758; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.11)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.18)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.99)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -9.03)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 1.70)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.39)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.58)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent attempted driving forward through a red light. (rewarded -9.52)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.27)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.42)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', 'left')
Agent followed the waypoint forward. (rewarded 2.32)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.28)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 140
\-------------------------
Simulating trial. . .
espilon = 0.5735; alpha = 0.0040
Simulating trial. . .
espilon = 0.5735; alpha = 0.0040
Simulating trial. . .
espilon = 0.5735; alpha = 0.0040
Simulating trial. . .
espilon = 0.5735; alpha = 0.0040
Simulating trial. . .
espilon = 0.5735; alpha = 0.0040
Simulating trial. . .
espilon = 0.5735; alpha = 0.0040
Simulating trial. . .
espilon = 0.5735; alpha = 0.0040
Simulating trial. . .
espilon = 0.5735; alpha = 0.0040
Simulating trial. . .
espilon = 0.5735; alpha = 0.0040
Simulating trial. . .
espilon = 0.5735; alpha = 0.0040
Simulating trial. . .
espilon = 0.5735; alpha = 0.0040
Simulating trial. . .
espilon = 0.5735; alpha = 0.0040
Simulating trial. . .
espilon = 0.5735; alpha = 0.0040
Simulating trial. . .
espilon = 0.5735; alpha = 0.0040
Simulating trial. . .
espilon = 0.5735; alpha = 0.0040
Simulating trial. . .
espilon = 0.5735; alpha = 0.0040
Simulating trial. . .
espilon = 0.5735; alpha = 0.0040
Simulating trial. . .
espilon = 0.5735; alpha = 0.0040
Simulating trial. . .
espilon = 0.5735; alpha = 0.0040
Simulating trial. . .
espilon = 0.5735; alpha = 0.0040
Simulating trial. . .
espilon = 0.5735; alpha = 0.0040
Simulating trial. . .
espilon = 0.5735; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.09)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', 'forward')
Agent followed the waypoint right. (rewarded 1.92)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 1.77)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.57)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.37)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.75)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove right instead of left. (rewarded 1.11)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.65)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.36)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent drove left instead of right. (rewarded -0.05)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -10.11)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.50)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.61)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded 0.76)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.43)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'left')
Agent drove left instead of right. (rewarded 0.63)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded -0.28)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.46)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent drove right instead of left. (rewarded -0.75)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.58)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 141
\-------------------------
Simulating trial. . .
espilon = 0.5712; alpha = 0.0040
Simulating trial. . .
espilon = 0.5712; alpha = 0.0040
Simulating trial. . .
espilon = 0.5712; alpha = 0.0040
Simulating trial. . .
espilon = 0.5712; alpha = 0.0040
Simulating trial. . .
espilon = 0.5712; alpha = 0.0040
Simulating trial. . .
espilon = 0.5712; alpha = 0.0040
Simulating trial. . .
espilon = 0.5712; alpha = 0.0040
Simulating trial. . .
espilon = 0.5712; alpha = 0.0040
Simulating trial. . .
espilon = 0.5712; alpha = 0.0040
Simulating trial. . .
espilon = 0.5712; alpha = 0.0040
Simulating trial. . .
espilon = 0.5712; alpha = 0.0040
Simulating trial. . .
espilon = 0.5712; alpha = 0.0040
Simulating trial. . .
espilon = 0.5712; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.75)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.01)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.52)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.42)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.48)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.46)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'left', 'left')
Agent drove forward instead of right. (rewarded 1.49)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.63)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.60)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.65)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.04)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded 1.23)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 1.47)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent attempted driving left through a red light. (rewarded -10.80)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.20)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.44)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent drove forward instead of right. (rewarded 0.60)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.69)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent drove right instead of forward. (rewarded -0.50)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove forward instead of left. (rewarded 0.67)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 142
\-------------------------
Simulating trial. . .
espilon = 0.5689; alpha = 0.0040
Simulating trial. . .
espilon = 0.5689; alpha = 0.0040
Simulating trial. . .
espilon = 0.5689; alpha = 0.0040
Simulating trial. . .
espilon = 0.5689; alpha = 0.0040
Simulating trial. . .
espilon = 0.5689; alpha = 0.0040
Simulating trial. . .
espilon = 0.5689; alpha = 0.0040
Simulating trial. . .
espilon = 0.5689; alpha = 0.0040
Simulating trial. . .
espilon = 0.5689; alpha = 0.0040
Simulating trial. . .
espilon = 0.5689; alpha = 0.0040
Simulating trial. . .
espilon = 0.5689; alpha = 0.0040
Simulating trial. . .
espilon = 0.5689; alpha = 0.0040
Simulating trial. . .
espilon = 0.5689; alpha = 0.0040
Simulating trial. . .
espilon = 0.5689; alpha = 0.0040
Simulating trial. . .
espilon = 0.5689; alpha = 0.0040
Simulating trial. . .
espilon = 0.5689; alpha = 0.0040
Simulating trial. . .
espilon = 0.5689; alpha = 0.0040
Simulating trial. . .
espilon = 0.5689; alpha = 0.0040
Simulating trial. . .
espilon = 0.5689; alpha = 0.0040
Simulating trial. . .
espilon = 0.5689; alpha = 0.0040
Simulating trial. . .
espilon = 0.5689; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent drove forward instead of right. (rewarded 1.01)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.37)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 1.49)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.05)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.45)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.49)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.19)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 2.20)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.83)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.43)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent drove right instead of forward. (rewarded 0.92)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.19)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.33)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.18)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'forward', None)
Agent drove right instead of left. (rewarded 1.15)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 0.61)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent followed the waypoint right. (rewarded 0.43)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.09)
10% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 143
\-------------------------
Simulating trial. . .
espilon = 0.5667; alpha = 0.0040
Simulating trial. . .
espilon = 0.5667; alpha = 0.0040
Simulating trial. . .
espilon = 0.5667; alpha = 0.0040
Simulating trial. . .
espilon = 0.5667; alpha = 0.0040
Simulating trial. . .
espilon = 0.5667; alpha = 0.0040
Simulating trial. . .
espilon = 0.5667; alpha = 0.0040
Simulating trial. . .
espilon = 0.5667; alpha = 0.0040
Simulating trial. . .
espilon = 0.5667; alpha = 0.0040
Simulating trial. . .
espilon = 0.5667; alpha = 0.0040
Simulating trial. . .
espilon = 0.5667; alpha = 0.0040
Simulating trial. . .
espilon = 0.5667; alpha = 0.0040
Simulating trial. . .
espilon = 0.5667; alpha = 0.0040
Simulating trial. . .
espilon = 0.5667; alpha = 0.0040
Simulating trial. . .
espilon = 0.5667; alpha = 0.0040
Simulating trial. . .
espilon = 0.5667; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent drove forward instead of left. (rewarded 0.01)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.25)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 1.45)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.30)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent drove left instead of forward. (rewarded 0.91)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.22)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.15)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 1.05)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove right instead of left. (rewarded 0.33)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.31)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.57)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.81)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.93)
63% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 144
\-------------------------
Simulating trial. . .
espilon = 0.5644; alpha = 0.0040
Simulating trial. . .
espilon = 0.5644; alpha = 0.0040
Simulating trial. . .
espilon = 0.5644; alpha = 0.0040
Simulating trial. . .
espilon = 0.5644; alpha = 0.0040
Simulating trial. . .
espilon = 0.5644; alpha = 0.0040
Simulating trial. . .
espilon = 0.5644; alpha = 0.0040
Simulating trial. . .
espilon = 0.5644; alpha = 0.0040
Simulating trial. . .
espilon = 0.5644; alpha = 0.0040
Simulating trial. . .
espilon = 0.5644; alpha = 0.0040
Simulating trial. . .
espilon = 0.5644; alpha = 0.0040
Simulating trial. . .
espilon = 0.5644; alpha = 0.0040
Simulating trial. . .
espilon = 0.5644; alpha = 0.0040
Simulating trial. . .
espilon = 0.5644; alpha = 0.0040
Simulating trial. . .
espilon = 0.5644; alpha = 0.0040
Simulating trial. . .
espilon = 0.5644; alpha = 0.0040
Simulating trial. . .
espilon = 0.5644; alpha = 0.0040
Simulating trial. . .
espilon = 0.5644; alpha = 0.0040
Simulating trial. . .
espilon = 0.5644; alpha = 0.0040
Simulating trial. . .
espilon = 0.5644; alpha = 0.0040
Simulating trial. . .
espilon = 0.5644; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.81)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -4.35)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent followed the waypoint right. (rewarded 1.44)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.38)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent drove right instead of forward. (rewarded 0.60)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.60)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', 'forward')
Agent attempted driving forward through a red light. (rewarded -9.18)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 1.34)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.36)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded 1.53)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -10.25)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'right', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.84)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.38)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded -0.34)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.07)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', None)
Agent drove right instead of forward. (rewarded 1.30)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.03)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.13)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.19)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.06)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 145
\-------------------------
Simulating trial. . .
espilon = 0.5621; alpha = 0.0040
Simulating trial. . .
espilon = 0.5621; alpha = 0.0040
Simulating trial. . .
espilon = 0.5621; alpha = 0.0040
Simulating trial. . .
espilon = 0.5621; alpha = 0.0040
Simulating trial. . .
espilon = 0.5621; alpha = 0.0040
Simulating trial. . .
espilon = 0.5621; alpha = 0.0040
Simulating trial. . .
espilon = 0.5621; alpha = 0.0040
Simulating trial. . .
espilon = 0.5621; alpha = 0.0040
Simulating trial. . .
espilon = 0.5621; alpha = 0.0040
Simulating trial. . .
espilon = 0.5621; alpha = 0.0040
Simulating trial. . .
espilon = 0.5621; alpha = 0.0040
Simulating trial. . .
espilon = 0.5621; alpha = 0.0040
Simulating trial. . .
espilon = 0.5621; alpha = 0.0040
Simulating trial. . .
espilon = 0.5621; alpha = 0.0040
Simulating trial. . .
espilon = 0.5621; alpha = 0.0040
Simulating trial. . .
espilon = 0.5621; alpha = 0.0040
Simulating trial. . .
espilon = 0.5621; alpha = 0.0040
Simulating trial. . .
espilon = 0.5621; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent drove forward instead of left. (rewarded 0.29)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.52)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.32)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 1.68)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.30)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.01)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent drove forward instead of left. (rewarded 0.21)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 0.65)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'right', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.66)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.32)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent drove left instead of right. (rewarded 1.13)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.78)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 2.46)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'forward')
Agent followed the waypoint forward. (rewarded 1.04)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'right')
Agent followed the waypoint forward. (rewarded 1.99)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent drove right instead of forward. (rewarded -0.07)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded -0.10)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.04)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.70)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.90)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.42)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded -0.24)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.59)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 0.95)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.60)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 146
\-------------------------
Simulating trial. . .
espilon = 0.5599; alpha = 0.0040
Simulating trial. . .
espilon = 0.5599; alpha = 0.0040
Simulating trial. . .
espilon = 0.5599; alpha = 0.0040
Simulating trial. . .
espilon = 0.5599; alpha = 0.0040
Simulating trial. . .
espilon = 0.5599; alpha = 0.0040
Simulating trial. . .
espilon = 0.5599; alpha = 0.0040
Simulating trial. . .
espilon = 0.5599; alpha = 0.0040
Simulating trial. . .
espilon = 0.5599; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.93)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.95)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.90)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.93)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.80)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.03)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.75)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 2.40)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 0.43)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.77)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.66)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent followed the waypoint left. (rewarded 1.66)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.43)
48% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 147
\-------------------------
Simulating trial. . .
espilon = 0.5577; alpha = 0.0040
Simulating trial. . .
espilon = 0.5577; alpha = 0.0040
Simulating trial. . .
espilon = 0.5577; alpha = 0.0040
Simulating trial. . .
espilon = 0.5577; alpha = 0.0040
Simulating trial. . .
espilon = 0.5577; alpha = 0.0040
Simulating trial. . .
espilon = 0.5577; alpha = 0.0040
Simulating trial. . .
espilon = 0.5577; alpha = 0.0040
Simulating trial. . .
espilon = 0.5577; alpha = 0.0040
Simulating trial. . .
espilon = 0.5577; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.06)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.00)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.23)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 1.44)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 2.13)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', 'left')
Agent attempted driving left through a red light. (rewarded -10.10)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.70)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.46)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent attempted driving left through a red light. (rewarded -10.76)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.77)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 1.06)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 0.62)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.77)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.24)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.09)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.32)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.55)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.84)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.92)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.58)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 148
\-------------------------
Simulating trial. . .
espilon = 0.5554; alpha = 0.0040
Simulating trial. . .
espilon = 0.5554; alpha = 0.0040
Simulating trial. . .
espilon = 0.5554; alpha = 0.0040
Simulating trial. . .
espilon = 0.5554; alpha = 0.0040
Simulating trial. . .
espilon = 0.5554; alpha = 0.0040
Simulating trial. . .
espilon = 0.5554; alpha = 0.0040
Simulating trial. . .
espilon = 0.5554; alpha = 0.0040
Simulating trial. . .
espilon = 0.5554; alpha = 0.0040
Simulating trial. . .
espilon = 0.5554; alpha = 0.0040
Simulating trial. . .
espilon = 0.5554; alpha = 0.0040
Simulating trial. . .
espilon = 0.5554; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.25)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent properly idled at a red light. (rewarded 1.53)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.74)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.73)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.40)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.30)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'left')
Agent drove forward instead of left. (rewarded 0.01)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent followed the waypoint left. (rewarded 1.42)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 0.96)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.65)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.96)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.80)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.62)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.21)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.99)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.46)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'right')
Agent drove right instead of left. (rewarded 0.63)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.10)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.14)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -10.73)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.67)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.39)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.83)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent followed the waypoint forward. (rewarded 2.02)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.87)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 149
\-------------------------
Simulating trial. . .
espilon = 0.5532; alpha = 0.0040
Simulating trial. . .
espilon = 0.5532; alpha = 0.0040
Simulating trial. . .
espilon = 0.5532; alpha = 0.0040
Simulating trial. . .
espilon = 0.5532; alpha = 0.0040
Simulating trial. . .
espilon = 0.5532; alpha = 0.0040
Simulating trial. . .
espilon = 0.5532; alpha = 0.0040
Simulating trial. . .
espilon = 0.5532; alpha = 0.0040
Simulating trial. . .
espilon = 0.5532; alpha = 0.0040
Simulating trial. . .
espilon = 0.5532; alpha = 0.0040
Simulating trial. . .
espilon = 0.5532; alpha = 0.0040
Simulating trial. . .
espilon = 0.5532; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.36)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.17)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent drove right instead of left. (rewarded 0.55)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.96)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.22)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.35)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.83)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.46)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.37)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.12)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.70)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 1.08)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.60)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.21)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'left')
Agent followed the waypoint right. (rewarded 2.39)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.93)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove right instead of forward. (rewarded 0.55)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'left', 'left')
Agent drove forward instead of left. (rewarded 0.06)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'forward', None)
Agent drove forward instead of left. (rewarded 0.06)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded 1.36)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'forward', None)
Agent followed the waypoint left. (rewarded 1.57)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent followed the waypoint left. (rewarded 1.57)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.01)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.86)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.46)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'right', 'left')
Agent drove right instead of forward. (rewarded 0.13)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.27)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.41)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.51)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.72)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 150
\-------------------------
Simulating trial. . .
espilon = 0.5510; alpha = 0.0040
Simulating trial. . .
espilon = 0.5510; alpha = 0.0040
Simulating trial. . .
espilon = 0.5510; alpha = 0.0040
Simulating trial. . .
espilon = 0.5510; alpha = 0.0040
Simulating trial. . .
espilon = 0.5510; alpha = 0.0040
Simulating trial. . .
espilon = 0.5510; alpha = 0.0040
Simulating trial. . .
espilon = 0.5510; alpha = 0.0040
Simulating trial. . .
espilon = 0.5510; alpha = 0.0040
Simulating trial. . .
espilon = 0.5510; alpha = 0.0040
Simulating trial. . .
espilon = 0.5510; alpha = 0.0040
Simulating trial. . .
espilon = 0.5510; alpha = 0.0040
Simulating trial. . .
espilon = 0.5510; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.51)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent drove left instead of forward. (rewarded 1.37)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.77)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 1.65)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.13)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent drove right instead of left. (rewarded 0.24)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.76)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.33)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.64)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.65)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent followed the waypoint forward. (rewarded 2.59)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent attempted driving left through a red light. (rewarded -10.80)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.35)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'forward')
Agent followed the waypoint forward. (rewarded 1.12)
44% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 151
\-------------------------
Simulating trial. . .
espilon = 0.5488; alpha = 0.0040
Simulating trial. . .
espilon = 0.5488; alpha = 0.0040
Simulating trial. . .
espilon = 0.5488; alpha = 0.0040
Simulating trial. . .
espilon = 0.5488; alpha = 0.0040
Simulating trial. . .
espilon = 0.5488; alpha = 0.0040
Simulating trial. . .
espilon = 0.5488; alpha = 0.0040
Simulating trial. . .
espilon = 0.5488; alpha = 0.0040
Simulating trial. . .
espilon = 0.5488; alpha = 0.0040
Simulating trial. . .
espilon = 0.5488; alpha = 0.0040
Simulating trial. . .
espilon = 0.5488; alpha = 0.0040
Simulating trial. . .
espilon = 0.5488; alpha = 0.0040
Simulating trial. . .
espilon = 0.5488; alpha = 0.0040
Simulating trial. . .
espilon = 0.5488; alpha = 0.0040
Simulating trial. . .
espilon = 0.5488; alpha = 0.0040
Simulating trial. . .
espilon = 0.5488; alpha = 0.0040
Simulating trial. . .
espilon = 0.5488; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -4.74)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.93)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 0.88)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.65)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.03)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.48)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent attempted driving left through a red light. (rewarded -10.95)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.03)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 0.64)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.55)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.67)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.31)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.38)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.65)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.78)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'left')
Agent followed the waypoint left. (rewarded 2.06)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.34)
15% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 152
\-------------------------
Simulating trial. . .
espilon = 0.5466; alpha = 0.0040
Simulating trial. . .
espilon = 0.5466; alpha = 0.0040
Simulating trial. . .
espilon = 0.5466; alpha = 0.0040
Simulating trial. . .
espilon = 0.5466; alpha = 0.0040
Simulating trial. . .
espilon = 0.5466; alpha = 0.0040
Simulating trial. . .
espilon = 0.5466; alpha = 0.0040
Simulating trial. . .
espilon = 0.5466; alpha = 0.0040
Simulating trial. . .
espilon = 0.5466; alpha = 0.0040
Simulating trial. . .
espilon = 0.5466; alpha = 0.0040
Simulating trial. . .
espilon = 0.5466; alpha = 0.0040
Simulating trial. . .
espilon = 0.5466; alpha = 0.0040
Simulating trial. . .
espilon = 0.5466; alpha = 0.0040
Simulating trial. . .
espilon = 0.5466; alpha = 0.0040
Simulating trial. . .
espilon = 0.5466; alpha = 0.0040
Simulating trial. . .
espilon = 0.5466; alpha = 0.0040
Simulating trial. . .
espilon = 0.5466; alpha = 0.0040
Simulating trial. . .
espilon = 0.5466; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.70)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.29)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.95)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.44)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.63)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -10.98)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.33)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.09)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.87)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.56)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.14)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.36)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.93)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'forward')
Agent followed the waypoint forward. (rewarded 1.68)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.94)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent drove left instead of forward. (rewarded 0.73)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'left')
Agent drove left instead of right. (rewarded 0.99)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.31)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', 'left')
Agent drove forward instead of left. (rewarded 1.17)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent followed the waypoint left. (rewarded 1.61)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.43)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove forward instead of left. (rewarded 0.09)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.53)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.96)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent drove right instead of left. (rewarded 0.91)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.43)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'right')
Agent drove right instead of left. (rewarded 0.56)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.67)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded -0.32)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.44)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 153
\-------------------------
Simulating trial. . .
espilon = 0.5444; alpha = 0.0040
Simulating trial. . .
espilon = 0.5444; alpha = 0.0040
Simulating trial. . .
espilon = 0.5444; alpha = 0.0040
Simulating trial. . .
espilon = 0.5444; alpha = 0.0040
Simulating trial. . .
espilon = 0.5444; alpha = 0.0040
Simulating trial. . .
espilon = 0.5444; alpha = 0.0040
Simulating trial. . .
espilon = 0.5444; alpha = 0.0040
Simulating trial. . .
espilon = 0.5444; alpha = 0.0040
Simulating trial. . .
espilon = 0.5444; alpha = 0.0040
Simulating trial. . .
espilon = 0.5444; alpha = 0.0040
Simulating trial. . .
espilon = 0.5444; alpha = 0.0040
Simulating trial. . .
espilon = 0.5444; alpha = 0.0040
Simulating trial. . .
espilon = 0.5444; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.34)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent followed the waypoint right. (rewarded 1.94)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.30)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 0.27)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded -0.01)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.01)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent drove forward instead of left. (rewarded 0.04)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove forward instead of left. (rewarded 0.35)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.88)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.29)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.92)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.37)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.61)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.87)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.92)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.47)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.49)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 0.92)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.05)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 0.52)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.91)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove right instead of forward. (rewarded -0.36)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent drove right instead of left. (rewarded 0.86)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.29)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.11)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 154
\-------------------------
Simulating trial. . .
espilon = 0.5423; alpha = 0.0040
Simulating trial. . .
espilon = 0.5423; alpha = 0.0040
Simulating trial. . .
espilon = 0.5423; alpha = 0.0040
Simulating trial. . .
espilon = 0.5423; alpha = 0.0040
Simulating trial. . .
espilon = 0.5423; alpha = 0.0040
Simulating trial. . .
espilon = 0.5423; alpha = 0.0040
Simulating trial. . .
espilon = 0.5423; alpha = 0.0040
Simulating trial. . .
espilon = 0.5423; alpha = 0.0040
Simulating trial. . .
espilon = 0.5423; alpha = 0.0040
Simulating trial. . .
espilon = 0.5423; alpha = 0.0040
Simulating trial. . .
espilon = 0.5423; alpha = 0.0040
Simulating trial. . .
espilon = 0.5423; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.38)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent drove left instead of forward. (rewarded 1.50)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.17)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.07)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.29)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded 1.11)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.73)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 1.39)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.29)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.87)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.57)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'left', None)
Agent followed the waypoint right. (rewarded 0.73)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.56)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove right instead of forward. (rewarded 0.12)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove forward instead of left. (rewarded 0.93)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent drove right instead of left. (rewarded 1.11)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 1.01)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'forward')
Agent drove forward instead of left. (rewarded 0.20)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent attempted driving forward through a red light. (rewarded -10.34)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.02)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 155
\-------------------------
Simulating trial. . .
espilon = 0.5401; alpha = 0.0040
Simulating trial. . .
espilon = 0.5401; alpha = 0.0040
Simulating trial. . .
espilon = 0.5401; alpha = 0.0040
Simulating trial. . .
espilon = 0.5401; alpha = 0.0040
Simulating trial. . .
espilon = 0.5401; alpha = 0.0040
Simulating trial. . .
espilon = 0.5401; alpha = 0.0040
Simulating trial. . .
espilon = 0.5401; alpha = 0.0040
Simulating trial. . .
espilon = 0.5401; alpha = 0.0040
Simulating trial. . .
espilon = 0.5401; alpha = 0.0040
Simulating trial. . .
espilon = 0.5401; alpha = 0.0040
Simulating trial. . .
espilon = 0.5401; alpha = 0.0040
Simulating trial. . .
espilon = 0.5401; alpha = 0.0040
Simulating trial. . .
espilon = 0.5401; alpha = 0.0040
Simulating trial. . .
espilon = 0.5401; alpha = 0.0040
Simulating trial. . .
espilon = 0.5401; alpha = 0.0040
Simulating trial. . .
espilon = 0.5401; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.04)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'forward', None)
Agent drove right instead of left. (rewarded 0.81)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent drove right instead of forward. (rewarded 1.58)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.64)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.43)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.13)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent attempted driving left through a red light. (rewarded -10.36)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent drove right instead of forward. (rewarded 1.17)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.19)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 1.36)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 0.91)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.12)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.55)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.45)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.50)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.98)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.51)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.62)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 1.66)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent drove forward instead of left. (rewarded 1.51)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.10)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.12)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.09)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.12)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.23)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.81)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.22)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent drove forward instead of right. (rewarded -0.55)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'left')
Agent drove forward instead of right. (rewarded -0.77)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.95)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 156
\-------------------------
Simulating trial. . .
espilon = 0.5379; alpha = 0.0040
Simulating trial. . .
espilon = 0.5379; alpha = 0.0040
Simulating trial. . .
espilon = 0.5379; alpha = 0.0040
Simulating trial. . .
espilon = 0.5379; alpha = 0.0040
Simulating trial. . .
espilon = 0.5379; alpha = 0.0040
Simulating trial. . .
espilon = 0.5379; alpha = 0.0040
Simulating trial. . .
espilon = 0.5379; alpha = 0.0040
Simulating trial. . .
espilon = 0.5379; alpha = 0.0040
Simulating trial. . .
espilon = 0.5379; alpha = 0.0040
Simulating trial. . .
espilon = 0.5379; alpha = 0.0040
Simulating trial. . .
espilon = 0.5379; alpha = 0.0040
Simulating trial. . .
espilon = 0.5379; alpha = 0.0040
Simulating trial. . .
espilon = 0.5379; alpha = 0.0040
Simulating trial. . .
espilon = 0.5379; alpha = 0.0040
Simulating trial. . .
espilon = 0.5379; alpha = 0.0040
Simulating trial. . .
espilon = 0.5379; alpha = 0.0040
Simulating trial. . .
espilon = 0.5379; alpha = 0.0040
Simulating trial. . .
espilon = 0.5379; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.74)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.85)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded 1.42)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.13)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.98)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.05)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 0.89)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 0.91)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent drove right instead of forward. (rewarded 0.66)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 2.37)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.77)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.27)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 1.56)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -5.68)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'left')
Agent followed the waypoint left. (rewarded 1.57)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent drove forward instead of left. (rewarded 0.83)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.39)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', 'right')
Agent drove forward instead of left. (rewarded -0.41)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.09)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'right', 'left')
Agent properly idled at a red light. (rewarded 1.79)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 157
\-------------------------
Simulating trial. . .
espilon = 0.5358; alpha = 0.0040
Simulating trial. . .
espilon = 0.5358; alpha = 0.0040
Simulating trial. . .
espilon = 0.5358; alpha = 0.0040
Simulating trial. . .
espilon = 0.5358; alpha = 0.0040
Simulating trial. . .
espilon = 0.5358; alpha = 0.0040
Simulating trial. . .
espilon = 0.5358; alpha = 0.0040
Simulating trial. . .
espilon = 0.5358; alpha = 0.0040
Simulating trial. . .
espilon = 0.5358; alpha = 0.0040
Simulating trial. . .
espilon = 0.5358; alpha = 0.0040
Simulating trial. . .
espilon = 0.5358; alpha = 0.0040
Simulating trial. . .
espilon = 0.5358; alpha = 0.0040
Simulating trial. . .
espilon = 0.5358; alpha = 0.0040
Simulating trial. . .
espilon = 0.5358; alpha = 0.0040
Simulating trial. . .
espilon = 0.5358; alpha = 0.0040
Simulating trial. . .
espilon = 0.5358; alpha = 0.0040
Simulating trial. . .
espilon = 0.5358; alpha = 0.0040
Simulating trial. . .
espilon = 0.5358; alpha = 0.0040
Simulating trial. . .
espilon = 0.5358; alpha = 0.0040
Simulating trial. . .
espilon = 0.5358; alpha = 0.0040
Simulating trial. . .
espilon = 0.5358; alpha = 0.0040
Simulating trial. . .
espilon = 0.5358; alpha = 0.0040
Simulating trial. . .
espilon = 0.5358; alpha = 0.0040
Simulating trial. . .
espilon = 0.5358; alpha = 0.0040
Simulating trial. . .
espilon = 0.5358; alpha = 0.0040
Simulating trial. . .
espilon = 0.5358; alpha = 0.0040
Simulating trial. . .
espilon = 0.5358; alpha = 0.0040
Simulating trial. . .
espilon = 0.5358; alpha = 0.0040
Simulating trial. . .
espilon = 0.5358; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.64)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 2.78)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.04)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.23)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.15)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent attempted driving forward through a red light. (rewarded -10.55)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'right')
Agent drove forward instead of right. (rewarded 0.35)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.20)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.49)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.98)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'right')
Agent drove right instead of left. (rewarded 1.19)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -10.43)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.79)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 0.91)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.76)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent drove left instead of right. (rewarded 0.14)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.84)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.87)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.54)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent followed the waypoint left. (rewarded 0.16)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 158
\-------------------------
Simulating trial. . .
espilon = 0.5337; alpha = 0.0040
Simulating trial. . .
espilon = 0.5337; alpha = 0.0040
Simulating trial. . .
espilon = 0.5337; alpha = 0.0040
Simulating trial. . .
espilon = 0.5337; alpha = 0.0040
Simulating trial. . .
espilon = 0.5337; alpha = 0.0040
Simulating trial. . .
espilon = 0.5337; alpha = 0.0040
Simulating trial. . .
espilon = 0.5337; alpha = 0.0040
Simulating trial. . .
espilon = 0.5337; alpha = 0.0040
Simulating trial. . .
espilon = 0.5337; alpha = 0.0040
Simulating trial. . .
espilon = 0.5337; alpha = 0.0040
Simulating trial. . .
espilon = 0.5337; alpha = 0.0040
Simulating trial. . .
espilon = 0.5337; alpha = 0.0040
Simulating trial. . .
espilon = 0.5337; alpha = 0.0040
Simulating trial. . .
espilon = 0.5337; alpha = 0.0040
Simulating trial. . .
espilon = 0.5337; alpha = 0.0040
Simulating trial. . .
espilon = 0.5337; alpha = 0.0040
Simulating trial. . .
espilon = 0.5337; alpha = 0.0040
Simulating trial. . .
espilon = 0.5337; alpha = 0.0040
Simulating trial. . .
espilon = 0.5337; alpha = 0.0040
Simulating trial. . .
espilon = 0.5337; alpha = 0.0040
Simulating trial. . .
espilon = 0.5337; alpha = 0.0040
Simulating trial. . .
espilon = 0.5337; alpha = 0.0040
Simulating trial. . .
espilon = 0.5337; alpha = 0.0040
Simulating trial. . .
espilon = 0.5337; alpha = 0.0040
Simulating trial. . .
espilon = 0.5337; alpha = 0.0040
Simulating trial. . .
espilon = 0.5337; alpha = 0.0040
Simulating trial. . .
espilon = 0.5337; alpha = 0.0040
Simulating trial. . .
espilon = 0.5337; alpha = 0.0040
Simulating trial. . .
espilon = 0.5337; alpha = 0.0040
Simulating trial. . .
espilon = 0.5337; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', 'right')
Agent attempted driving forward through a red light. (rewarded -9.63)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'right')
Agent properly idled at a red light. (rewarded 1.19)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.95)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.72)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.43)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', 'right')
Agent followed the waypoint right. (rewarded 2.46)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.96)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent attempted driving forward through a red light. (rewarded -9.95)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 0.95)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.61)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.40)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent drove right instead of left. (rewarded 0.05)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -10.71)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 0.00)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.92)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.39)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.57)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent followed the waypoint left. (rewarded 0.37)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'forward')
Agent followed the waypoint right. (rewarded 1.82)
5% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 159
\-------------------------
Simulating trial. . .
espilon = 0.5315; alpha = 0.0040
Simulating trial. . .
espilon = 0.5315; alpha = 0.0040
Simulating trial. . .
espilon = 0.5315; alpha = 0.0040
Simulating trial. . .
espilon = 0.5315; alpha = 0.0040
Simulating trial. . .
espilon = 0.5315; alpha = 0.0040
Simulating trial. . .
espilon = 0.5315; alpha = 0.0040
Simulating trial. . .
espilon = 0.5315; alpha = 0.0040
Simulating trial. . .
espilon = 0.5315; alpha = 0.0040
Simulating trial. . .
espilon = 0.5315; alpha = 0.0040
Simulating trial. . .
espilon = 0.5315; alpha = 0.0040
Simulating trial. . .
espilon = 0.5315; alpha = 0.0040
Simulating trial. . .
espilon = 0.5315; alpha = 0.0040
Simulating trial. . .
espilon = 0.5315; alpha = 0.0040
Simulating trial. . .
espilon = 0.5315; alpha = 0.0040
Simulating trial. . .
espilon = 0.5315; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove left instead of forward. (rewarded 1.44)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent drove forward instead of right. (rewarded 1.68)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.69)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.82)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.81)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent drove forward instead of left. (rewarded 1.53)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent attempted driving left through a red light. (rewarded -10.57)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.89)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.36)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.04)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent drove left instead of right. (rewarded 0.69)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.09)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.19)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.11)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'left')
Agent drove forward instead of right. (rewarded 0.59)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 1.23)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'right', None)
Agent properly idled at a red light. (rewarded 2.39)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.81)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.07)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.55)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.15)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.68)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'forward')
Agent attempted driving forward through a red light. (rewarded -10.37)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.58)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.17)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 160
\-------------------------
Simulating trial. . .
espilon = 0.5294; alpha = 0.0040
Simulating trial. . .
espilon = 0.5294; alpha = 0.0040
Simulating trial. . .
espilon = 0.5294; alpha = 0.0040
Simulating trial. . .
espilon = 0.5294; alpha = 0.0040
Simulating trial. . .
espilon = 0.5294; alpha = 0.0040
Simulating trial. . .
espilon = 0.5294; alpha = 0.0040
Simulating trial. . .
espilon = 0.5294; alpha = 0.0040
Simulating trial. . .
espilon = 0.5294; alpha = 0.0040
Simulating trial. . .
espilon = 0.5294; alpha = 0.0040
Simulating trial. . .
espilon = 0.5294; alpha = 0.0040
Simulating trial. . .
espilon = 0.5294; alpha = 0.0040
Simulating trial. . .
espilon = 0.5294; alpha = 0.0040
Simulating trial. . .
espilon = 0.5294; alpha = 0.0040
Simulating trial. . .
espilon = 0.5294; alpha = 0.0040
Simulating trial. . .
espilon = 0.5294; alpha = 0.0040
Simulating trial. . .
espilon = 0.5294; alpha = 0.0040
Simulating trial. . .
espilon = 0.5294; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 1.98)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 1.94)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 2.27)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.87)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.15)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.25)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'forward')
Agent followed the waypoint right. (rewarded 2.45)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 2.24)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 0.99)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.68)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.22)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent drove left instead of forward. (rewarded -0.12)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.14)
35% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 161
\-------------------------
Simulating trial. . .
espilon = 0.5273; alpha = 0.0040
Simulating trial. . .
espilon = 0.5273; alpha = 0.0040
Simulating trial. . .
espilon = 0.5273; alpha = 0.0040
Simulating trial. . .
espilon = 0.5273; alpha = 0.0040
Simulating trial. . .
espilon = 0.5273; alpha = 0.0040
Simulating trial. . .
espilon = 0.5273; alpha = 0.0040
Simulating trial. . .
espilon = 0.5273; alpha = 0.0040
Simulating trial. . .
espilon = 0.5273; alpha = 0.0040
Simulating trial. . .
espilon = 0.5273; alpha = 0.0040
Simulating trial. . .
espilon = 0.5273; alpha = 0.0040
Simulating trial. . .
espilon = 0.5273; alpha = 0.0040
Simulating trial. . .
espilon = 0.5273; alpha = 0.0040
Simulating trial. . .
espilon = 0.5273; alpha = 0.0040
Simulating trial. . .
espilon = 0.5273; alpha = 0.0040
Simulating trial. . .
espilon = 0.5273; alpha = 0.0040
Simulating trial. . .
espilon = 0.5273; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'forward')
Agent followed the waypoint right. (rewarded 2.87)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded 1.28)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.06)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.83)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.42)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.21)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'forward')
Agent drove forward instead of left. (rewarded 1.71)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.02)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'forward')
Agent drove right instead of forward. (rewarded 0.88)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent drove forward instead of left. (rewarded 1.30)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.91)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'right', 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.42)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.31)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 2.45)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'right')
Agent properly idled at a red light. (rewarded 2.30)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.06)
20% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 162
\-------------------------
Simulating trial. . .
espilon = 0.5252; alpha = 0.0040
Simulating trial. . .
espilon = 0.5252; alpha = 0.0040
Simulating trial. . .
espilon = 0.5252; alpha = 0.0040
Simulating trial. . .
espilon = 0.5252; alpha = 0.0040
Simulating trial. . .
espilon = 0.5252; alpha = 0.0040
Simulating trial. . .
espilon = 0.5252; alpha = 0.0040
Simulating trial. . .
espilon = 0.5252; alpha = 0.0040
Simulating trial. . .
espilon = 0.5252; alpha = 0.0040
Simulating trial. . .
espilon = 0.5252; alpha = 0.0040
Simulating trial. . .
espilon = 0.5252; alpha = 0.0040
Simulating trial. . .
espilon = 0.5252; alpha = 0.0040
Simulating trial. . .
espilon = 0.5252; alpha = 0.0040
Simulating trial. . .
espilon = 0.5252; alpha = 0.0040
Simulating trial. . .
espilon = 0.5252; alpha = 0.0040
Simulating trial. . .
espilon = 0.5252; alpha = 0.0040
Simulating trial. . .
espilon = 0.5252; alpha = 0.0040
Simulating trial. . .
espilon = 0.5252; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.83)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.90)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.20)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 2.53)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.34)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.33)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent drove right instead of left. (rewarded 0.54)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.94)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 0.93)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.67)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'right')
Agent drove right instead of forward. (rewarded 1.68)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.29)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.71)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.22)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'right')
Agent properly idled at a red light. (rewarded 1.67)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.01)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.82)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded -0.09)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent drove forward instead of left. (rewarded 0.54)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.18)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove right instead of left. (rewarded 0.11)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'left')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.91)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.09)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.74)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.32)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove right instead of forward. (rewarded 0.32)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove forward instead of left. (rewarded 0.42)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.39)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', 'left')
Agent drove right instead of left. (rewarded -0.37)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent drove forward instead of right. (rewarded 0.65)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 163
\-------------------------
Simulating trial. . .
espilon = 0.5231; alpha = 0.0040
Simulating trial. . .
espilon = 0.5231; alpha = 0.0040
Simulating trial. . .
espilon = 0.5231; alpha = 0.0040
Simulating trial. . .
espilon = 0.5231; alpha = 0.0040
Simulating trial. . .
espilon = 0.5231; alpha = 0.0040
Simulating trial. . .
espilon = 0.5231; alpha = 0.0040
Simulating trial. . .
espilon = 0.5231; alpha = 0.0040
Simulating trial. . .
espilon = 0.5231; alpha = 0.0040
Simulating trial. . .
espilon = 0.5231; alpha = 0.0040
Simulating trial. . .
espilon = 0.5231; alpha = 0.0040
Simulating trial. . .
espilon = 0.5231; alpha = 0.0040
Simulating trial. . .
espilon = 0.5231; alpha = 0.0040
Simulating trial. . .
espilon = 0.5231; alpha = 0.0040
Simulating trial. . .
espilon = 0.5231; alpha = 0.0040
Simulating trial. . .
espilon = 0.5231; alpha = 0.0040
Simulating trial. . .
espilon = 0.5231; alpha = 0.0040
Simulating trial. . .
espilon = 0.5231; alpha = 0.0040
Simulating trial. . .
espilon = 0.5231; alpha = 0.0040
Simulating trial. . .
espilon = 0.5231; alpha = 0.0040
Simulating trial. . .
espilon = 0.5231; alpha = 0.0040
Simulating trial. . .
espilon = 0.5231; alpha = 0.0040
Simulating trial. . .
espilon = 0.5231; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.12)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.85)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.84)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove forward instead of left. (rewarded 0.23)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent attempted driving left through a red light. (rewarded -10.42)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 0.02)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'forward', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.78)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'forward', 'forward')
Agent drove forward instead of left. (rewarded 0.81)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.59)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.86)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 164
\-------------------------
Simulating trial. . .
espilon = 0.5210; alpha = 0.0040
Simulating trial. . .
espilon = 0.5210; alpha = 0.0040
Simulating trial. . .
espilon = 0.5210; alpha = 0.0040
Simulating trial. . .
espilon = 0.5210; alpha = 0.0040
Simulating trial. . .
espilon = 0.5210; alpha = 0.0040
Simulating trial. . .
espilon = 0.5210; alpha = 0.0040
Simulating trial. . .
espilon = 0.5210; alpha = 0.0040
Simulating trial. . .
espilon = 0.5210; alpha = 0.0040
Simulating trial. . .
espilon = 0.5210; alpha = 0.0040
Simulating trial. . .
espilon = 0.5210; alpha = 0.0040
Simulating trial. . .
espilon = 0.5210; alpha = 0.0040
Simulating trial. . .
espilon = 0.5210; alpha = 0.0040
Simulating trial. . .
espilon = 0.5210; alpha = 0.0040
Simulating trial. . .
espilon = 0.5210; alpha = 0.0040
Simulating trial. . .
espilon = 0.5210; alpha = 0.0040
Simulating trial. . .
espilon = 0.5210; alpha = 0.0040
Simulating trial. . .
espilon = 0.5210; alpha = 0.0040
Simulating trial. . .
espilon = 0.5210; alpha = 0.0040
Simulating trial. . .
espilon = 0.5210; alpha = 0.0040
Simulating trial. . .
espilon = 0.5210; alpha = 0.0040
Simulating trial. . .
espilon = 0.5210; alpha = 0.0040
Simulating trial. . .
espilon = 0.5210; alpha = 0.0040
Simulating trial. . .
espilon = 0.5210; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.73)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 1.22)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 2.66)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.13)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.65)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.28)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.46)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 0.51)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 1.65)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 0.16)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.98)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, None)
Agent followed the waypoint right. (rewarded 2.01)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.31)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 0.89)
44% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 165
\-------------------------
Simulating trial. . .
espilon = 0.5189; alpha = 0.0040
Simulating trial. . .
espilon = 0.5189; alpha = 0.0040
Simulating trial. . .
espilon = 0.5189; alpha = 0.0040
Simulating trial. . .
espilon = 0.5189; alpha = 0.0040
Simulating trial. . .
espilon = 0.5189; alpha = 0.0040
Simulating trial. . .
espilon = 0.5189; alpha = 0.0040
Simulating trial. . .
espilon = 0.5189; alpha = 0.0040
Simulating trial. . .
espilon = 0.5189; alpha = 0.0040
Simulating trial. . .
espilon = 0.5189; alpha = 0.0040
Simulating trial. . .
espilon = 0.5189; alpha = 0.0040
Simulating trial. . .
espilon = 0.5189; alpha = 0.0040
Simulating trial. . .
espilon = 0.5189; alpha = 0.0040
Simulating trial. . .
espilon = 0.5189; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.55)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.21)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.50)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'left')
Agent followed the waypoint left. (rewarded 2.48)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.73)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.36)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.57)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.67)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded 1.02)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.84)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 2.37)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', None)
Agent followed the waypoint right. (rewarded 2.24)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove left instead of forward. (rewarded 0.08)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, None)
Agent drove left instead of right. (rewarded 0.25)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.65)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded 0.50)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.33)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.85)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 0.54)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.96)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove right instead of forward. (rewarded -0.57)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 0.87)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent drove left instead of forward. (rewarded 1.09)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.71)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.20)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 166
\-------------------------
Simulating trial. . .
espilon = 0.5169; alpha = 0.0040
Simulating trial. . .
espilon = 0.5169; alpha = 0.0040
Simulating trial. . .
espilon = 0.5169; alpha = 0.0040
Simulating trial. . .
espilon = 0.5169; alpha = 0.0040
Simulating trial. . .
espilon = 0.5169; alpha = 0.0040
Simulating trial. . .
espilon = 0.5169; alpha = 0.0040
Simulating trial. . .
espilon = 0.5169; alpha = 0.0040
Simulating trial. . .
espilon = 0.5169; alpha = 0.0040
Simulating trial. . .
espilon = 0.5169; alpha = 0.0040
Simulating trial. . .
espilon = 0.5169; alpha = 0.0040
Simulating trial. . .
espilon = 0.5169; alpha = 0.0040
Simulating trial. . .
espilon = 0.5169; alpha = 0.0040
Simulating trial. . .
espilon = 0.5169; alpha = 0.0040
Simulating trial. . .
espilon = 0.5169; alpha = 0.0040
Simulating trial. . .
espilon = 0.5169; alpha = 0.0040
Simulating trial. . .
espilon = 0.5169; alpha = 0.0040
Simulating trial. . .
espilon = 0.5169; alpha = 0.0040
Simulating trial. . .
espilon = 0.5169; alpha = 0.0040
Simulating trial. . .
espilon = 0.5169; alpha = 0.0040
Simulating trial. . .
espilon = 0.5169; alpha = 0.0040
Simulating trial. . .
espilon = 0.5169; alpha = 0.0040
Simulating trial. . .
espilon = 0.5169; alpha = 0.0040
Simulating trial. . .
espilon = 0.5169; alpha = 0.0040
Simulating trial. . .
espilon = 0.5169; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', 'forward')
Agent followed the waypoint forward. (rewarded 2.13)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.66)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.59)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 1.37)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent drove forward instead of left. (rewarded -0.01)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.66)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.89)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 0.91)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.00)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.88)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 167
\-------------------------
Simulating trial. . .
espilon = 0.5148; alpha = 0.0040
Simulating trial. . .
espilon = 0.5148; alpha = 0.0040
Simulating trial. . .
espilon = 0.5148; alpha = 0.0040
Simulating trial. . .
espilon = 0.5148; alpha = 0.0040
Simulating trial. . .
espilon = 0.5148; alpha = 0.0040
Simulating trial. . .
espilon = 0.5148; alpha = 0.0040
Simulating trial. . .
espilon = 0.5148; alpha = 0.0040
Simulating trial. . .
espilon = 0.5148; alpha = 0.0040
Simulating trial. . .
espilon = 0.5148; alpha = 0.0040
Simulating trial. . .
espilon = 0.5148; alpha = 0.0040
Simulating trial. . .
espilon = 0.5148; alpha = 0.0040
Simulating trial. . .
espilon = 0.5148; alpha = 0.0040
Simulating trial. . .
espilon = 0.5148; alpha = 0.0040
Simulating trial. . .
espilon = 0.5148; alpha = 0.0040
Simulating trial. . .
espilon = 0.5148; alpha = 0.0040
Simulating trial. . .
espilon = 0.5148; alpha = 0.0040
Simulating trial. . .
espilon = 0.5148; alpha = 0.0040
Simulating trial. . .
espilon = 0.5148; alpha = 0.0040
Simulating trial. . .
espilon = 0.5148; alpha = 0.0040
Simulating trial. . .
espilon = 0.5148; alpha = 0.0040
Simulating trial. . .
espilon = 0.5148; alpha = 0.0040
Simulating trial. . .
espilon = 0.5148; alpha = 0.0040
Simulating trial. . .
espilon = 0.5148; alpha = 0.0040
Simulating trial. . .
espilon = 0.5148; alpha = 0.0040
Simulating trial. . .
espilon = 0.5148; alpha = 0.0040
Simulating trial. . .
espilon = 0.5148; alpha = 0.0040
Simulating trial. . .
espilon = 0.5148; alpha = 0.0040
Simulating trial. . .
espilon = 0.5148; alpha = 0.0040
Simulating trial. . .
espilon = 0.5148; alpha = 0.0040
Simulating trial. . .
espilon = 0.5148; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'right', 'right')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.39)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.15)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.34)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'right', None)
Agent followed the waypoint right. (rewarded 1.35)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.93)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.18)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.11)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.72)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.03)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.19)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.58)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent drove left instead of forward. (rewarded -0.11)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.89)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.15)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.23)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.59)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.70)
15% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 168
\-------------------------
Simulating trial. . .
espilon = 0.5127; alpha = 0.0040
Simulating trial. . .
espilon = 0.5127; alpha = 0.0040
Simulating trial. . .
espilon = 0.5127; alpha = 0.0040
Simulating trial. . .
espilon = 0.5127; alpha = 0.0040
Simulating trial. . .
espilon = 0.5127; alpha = 0.0040
Simulating trial. . .
espilon = 0.5127; alpha = 0.0040
Simulating trial. . .
espilon = 0.5127; alpha = 0.0040
Simulating trial. . .
espilon = 0.5127; alpha = 0.0040
Simulating trial. . .
espilon = 0.5127; alpha = 0.0040
Simulating trial. . .
espilon = 0.5127; alpha = 0.0040
Simulating trial. . .
espilon = 0.5127; alpha = 0.0040
Simulating trial. . .
espilon = 0.5127; alpha = 0.0040
Simulating trial. . .
espilon = 0.5127; alpha = 0.0040
Simulating trial. . .
espilon = 0.5127; alpha = 0.0040
Simulating trial. . .
espilon = 0.5127; alpha = 0.0040
Simulating trial. . .
espilon = 0.5127; alpha = 0.0040
Simulating trial. . .
espilon = 0.5127; alpha = 0.0040
Simulating trial. . .
espilon = 0.5127; alpha = 0.0040
Simulating trial. . .
espilon = 0.5127; alpha = 0.0040
Simulating trial. . .
espilon = 0.5127; alpha = 0.0040
Simulating trial. . .
espilon = 0.5127; alpha = 0.0040
Simulating trial. . .
espilon = 0.5127; alpha = 0.0040
Simulating trial. . .
espilon = 0.5127; alpha = 0.0040
Simulating trial. . .
espilon = 0.5127; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.03)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent drove right instead of forward. (rewarded -0.00)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.89)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.84)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.97)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.28)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'left', None)
Agent drove forward instead of left. (rewarded 1.63)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.45)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.71)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent drove right instead of left. (rewarded 1.66)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.14)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 0.23)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.19)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.92)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.31)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.37)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, None)
Agent drove forward instead of right. (rewarded 0.25)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent attempted driving left through a red light. (rewarded -10.38)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.30)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'forward')
Agent drove forward instead of right. (rewarded 1.43)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.48)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent drove forward instead of right. (rewarded 1.21)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'right')
Agent followed the waypoint right. (rewarded 2.24)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.48)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.16)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 169
\-------------------------
Simulating trial. . .
espilon = 0.5107; alpha = 0.0040
Simulating trial. . .
espilon = 0.5107; alpha = 0.0040
Simulating trial. . .
espilon = 0.5107; alpha = 0.0040
Simulating trial. . .
espilon = 0.5107; alpha = 0.0040
Simulating trial. . .
espilon = 0.5107; alpha = 0.0040
Simulating trial. . .
espilon = 0.5107; alpha = 0.0040
Simulating trial. . .
espilon = 0.5107; alpha = 0.0040
Simulating trial. . .
espilon = 0.5107; alpha = 0.0040
Simulating trial. . .
espilon = 0.5107; alpha = 0.0040
Simulating trial. . .
espilon = 0.5107; alpha = 0.0040
Simulating trial. . .
espilon = 0.5107; alpha = 0.0040
Simulating trial. . .
espilon = 0.5107; alpha = 0.0040
Simulating trial. . .
espilon = 0.5107; alpha = 0.0040
Simulating trial. . .
espilon = 0.5107; alpha = 0.0040
Simulating trial. . .
espilon = 0.5107; alpha = 0.0040
Simulating trial. . .
espilon = 0.5107; alpha = 0.0040
Simulating trial. . .
espilon = 0.5107; alpha = 0.0040
Simulating trial. . .
espilon = 0.5107; alpha = 0.0040
Simulating trial. . .
espilon = 0.5107; alpha = 0.0040
Simulating trial. . .
espilon = 0.5107; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent drove right instead of forward. (rewarded 0.18)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent drove forward instead of left. (rewarded 1.98)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.23)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.32)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent drove right instead of left. (rewarded 1.23)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.88)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.98)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.80)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.67)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent drove right instead of forward. (rewarded -0.03)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 2.69)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 1.02)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.91)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.19)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.03)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, 'forward')
Agent properly idled at a red light. (rewarded 1.09)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.16)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.43)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove left instead of right. (rewarded -0.52)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent drove forward instead of right. (rewarded 0.44)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 170
\-------------------------
Simulating trial. . .
espilon = 0.5086; alpha = 0.0040
Simulating trial. . .
espilon = 0.5086; alpha = 0.0040
Simulating trial. . .
espilon = 0.5086; alpha = 0.0040
Simulating trial. . .
espilon = 0.5086; alpha = 0.0040
Simulating trial. . .
espilon = 0.5086; alpha = 0.0040
Simulating trial. . .
espilon = 0.5086; alpha = 0.0040
Simulating trial. . .
espilon = 0.5086; alpha = 0.0040
Simulating trial. . .
espilon = 0.5086; alpha = 0.0040
Simulating trial. . .
espilon = 0.5086; alpha = 0.0040
Simulating trial. . .
espilon = 0.5086; alpha = 0.0040
Simulating trial. . .
espilon = 0.5086; alpha = 0.0040
Simulating trial. . .
espilon = 0.5086; alpha = 0.0040
Simulating trial. . .
espilon = 0.5086; alpha = 0.0040
Simulating trial. . .
espilon = 0.5086; alpha = 0.0040
Simulating trial. . .
espilon = 0.5086; alpha = 0.0040
Simulating trial. . .
espilon = 0.5086; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.70)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', None)
Agent followed the waypoint right. (rewarded 2.15)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent attempted driving forward through a red light. (rewarded -9.53)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.86)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.69)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.58)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.29)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove right instead of left. (rewarded 0.65)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.09)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 0.68)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.40)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 0.92)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.11)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.09)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.07)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.45)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.95)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 0.87)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -10.37)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 0.44)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 171
\-------------------------
Simulating trial. . .
espilon = 0.5066; alpha = 0.0040
Simulating trial. . .
espilon = 0.5066; alpha = 0.0040
Simulating trial. . .
espilon = 0.5066; alpha = 0.0040
Simulating trial. . .
espilon = 0.5066; alpha = 0.0040
Simulating trial. . .
espilon = 0.5066; alpha = 0.0040
Simulating trial. . .
espilon = 0.5066; alpha = 0.0040
Simulating trial. . .
espilon = 0.5066; alpha = 0.0040
Simulating trial. . .
espilon = 0.5066; alpha = 0.0040
Simulating trial. . .
espilon = 0.5066; alpha = 0.0040
Simulating trial. . .
espilon = 0.5066; alpha = 0.0040
Simulating trial. . .
espilon = 0.5066; alpha = 0.0040
Simulating trial. . .
espilon = 0.5066; alpha = 0.0040
Simulating trial. . .
espilon = 0.5066; alpha = 0.0040
Simulating trial. . .
espilon = 0.5066; alpha = 0.0040
Simulating trial. . .
espilon = 0.5066; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'right', 'left')
Agent drove forward instead of right. (rewarded 0.91)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.36)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 1.55)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.60)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.86)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.10)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.42)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'forward', None)
Agent drove right instead of left. (rewarded 1.59)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', 'left')
Agent attempted driving left through a red light. (rewarded -9.23)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.49)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.77)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.81)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.18)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'forward')
Agent drove forward instead of right. (rewarded 0.17)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.69)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, 'forward')
Agent followed the waypoint right. (rewarded 2.29)
20% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 172
\-------------------------
Simulating trial. . .
espilon = 0.5046; alpha = 0.0040
Simulating trial. . .
espilon = 0.5046; alpha = 0.0040
Simulating trial. . .
espilon = 0.5046; alpha = 0.0040
Simulating trial. . .
espilon = 0.5046; alpha = 0.0040
Simulating trial. . .
espilon = 0.5046; alpha = 0.0040
Simulating trial. . .
espilon = 0.5046; alpha = 0.0040
Simulating trial. . .
espilon = 0.5046; alpha = 0.0040
Simulating trial. . .
espilon = 0.5046; alpha = 0.0040
Simulating trial. . .
espilon = 0.5046; alpha = 0.0040
Simulating trial. . .
espilon = 0.5046; alpha = 0.0040
Simulating trial. . .
espilon = 0.5046; alpha = 0.0040
Simulating trial. . .
espilon = 0.5046; alpha = 0.0040
Simulating trial. . .
espilon = 0.5046; alpha = 0.0040
Simulating trial. . .
espilon = 0.5046; alpha = 0.0040
Simulating trial. . .
espilon = 0.5046; alpha = 0.0040
Simulating trial. . .
espilon = 0.5046; alpha = 0.0040
Simulating trial. . .
espilon = 0.5046; alpha = 0.0040
Simulating trial. . .
espilon = 0.5046; alpha = 0.0040
Simulating trial. . .
espilon = 0.5046; alpha = 0.0040
Simulating trial. . .
espilon = 0.5046; alpha = 0.0040
Simulating trial. . .
espilon = 0.5046; alpha = 0.0040
Simulating trial. . .
espilon = 0.5046; alpha = 0.0040
Simulating trial. . .
espilon = 0.5046; alpha = 0.0040
Simulating trial. . .
espilon = 0.5046; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'right', 'right')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.16)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.73)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent drove right instead of forward. (rewarded 1.00)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'forward')
Agent drove right instead of left. (rewarded 0.12)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 1.23)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'forward')
Agent drove forward instead of left. (rewarded 0.46)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'right', 'forward')
Agent drove right instead of left. (rewarded 0.48)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.58)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 1.67)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'forward')
Agent attempted driving forward through a red light. (rewarded -9.98)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.65)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'forward')
Agent drove right instead of left. (rewarded -0.04)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.77)
63% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.23)
60% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 1.10)
57% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', 'right')
Agent drove forward instead of left. (rewarded 0.79)
54% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 2.08)
51% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.01)
49% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.48)
46% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.45)
43% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 0.89)
40% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded 0.64)
37% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.23)
34% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 0.46)
31% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.47)
29% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.78)
26% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.31)
23% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.06)
20% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.57)
17% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent followed the waypoint left. (rewarded 1.93)
14% of time remaining to reach destination.
/-------------------
| Step 30 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.75)
11% of time remaining to reach destination.
/-------------------
| Step 31 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.39)
9% of time remaining to reach destination.
/-------------------
| Step 32 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.14)
6% of time remaining to reach destination.
/-------------------
| Step 33 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.61)
3% of time remaining to reach destination.
/-------------------
| Step 34 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent drove right instead of forward. (rewarded -0.69)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 173
\-------------------------
Simulating trial. . .
espilon = 0.5026; alpha = 0.0040
Simulating trial. . .
espilon = 0.5026; alpha = 0.0040
Simulating trial. . .
espilon = 0.5026; alpha = 0.0040
Simulating trial. . .
espilon = 0.5026; alpha = 0.0040
Simulating trial. . .
espilon = 0.5026; alpha = 0.0040
Simulating trial. . .
espilon = 0.5026; alpha = 0.0040
Simulating trial. . .
espilon = 0.5026; alpha = 0.0040
Simulating trial. . .
espilon = 0.5026; alpha = 0.0040
Simulating trial. . .
espilon = 0.5026; alpha = 0.0040
Simulating trial. . .
espilon = 0.5026; alpha = 0.0040
Simulating trial. . .
espilon = 0.5026; alpha = 0.0040
Simulating trial. . .
espilon = 0.5026; alpha = 0.0040
Simulating trial. . .
espilon = 0.5026; alpha = 0.0040
Simulating trial. . .
espilon = 0.5026; alpha = 0.0040
Simulating trial. . .
espilon = 0.5026; alpha = 0.0040
Simulating trial. . .
espilon = 0.5026; alpha = 0.0040
Simulating trial. . .
espilon = 0.5026; alpha = 0.0040
Simulating trial. . .
espilon = 0.5026; alpha = 0.0040
Simulating trial. . .
espilon = 0.5026; alpha = 0.0040
Simulating trial. . .
espilon = 0.5026; alpha = 0.0040
Simulating trial. . .
espilon = 0.5026; alpha = 0.0040
Simulating trial. . .
espilon = 0.5026; alpha = 0.0040
Simulating trial. . .
espilon = 0.5026; alpha = 0.0040
Simulating trial. . .
espilon = 0.5026; alpha = 0.0040
Simulating trial. . .
espilon = 0.5026; alpha = 0.0040
Simulating trial. . .
espilon = 0.5026; alpha = 0.0040
Simulating trial. . .
espilon = 0.5026; alpha = 0.0040
Simulating trial. . .
espilon = 0.5026; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.26)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.58)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.19)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.37)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.03)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.26)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.48)
77% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 174
\-------------------------
Simulating trial. . .
espilon = 0.5006; alpha = 0.0040
Simulating trial. . .
espilon = 0.5006; alpha = 0.0040
Simulating trial. . .
espilon = 0.5006; alpha = 0.0040
Simulating trial. . .
espilon = 0.5006; alpha = 0.0040
Simulating trial. . .
espilon = 0.5006; alpha = 0.0040
Simulating trial. . .
espilon = 0.5006; alpha = 0.0040
Simulating trial. . .
espilon = 0.5006; alpha = 0.0040
Simulating trial. . .
espilon = 0.5006; alpha = 0.0040
Simulating trial. . .
espilon = 0.5006; alpha = 0.0040
Simulating trial. . .
espilon = 0.5006; alpha = 0.0040
Simulating trial. . .
espilon = 0.5006; alpha = 0.0040
Simulating trial. . .
espilon = 0.5006; alpha = 0.0040
Simulating trial. . .
espilon = 0.5006; alpha = 0.0040
Simulating trial. . .
espilon = 0.5006; alpha = 0.0040
Simulating trial. . .
espilon = 0.5006; alpha = 0.0040
Simulating trial. . .
espilon = 0.5006; alpha = 0.0040
Simulating trial. . .
espilon = 0.5006; alpha = 0.0040
Simulating trial. . .
espilon = 0.5006; alpha = 0.0040
Simulating trial. . .
espilon = 0.5006; alpha = 0.0040
Simulating trial. . .
espilon = 0.5006; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent drove left instead of forward. (rewarded 1.40)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.06)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.19)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.35)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 2.75)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent followed the waypoint forward. (rewarded 1.17)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.94)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.16)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.47)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.41)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.36)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.15)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent followed the waypoint forward. (rewarded 1.69)
48% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 175
\-------------------------
Simulating trial. . .
espilon = 0.4986; alpha = 0.0040
Simulating trial. . .
espilon = 0.4986; alpha = 0.0040
Simulating trial. . .
espilon = 0.4986; alpha = 0.0040
Simulating trial. . .
espilon = 0.4986; alpha = 0.0040
Simulating trial. . .
espilon = 0.4986; alpha = 0.0040
Simulating trial. . .
espilon = 0.4986; alpha = 0.0040
Simulating trial. . .
espilon = 0.4986; alpha = 0.0040
Simulating trial. . .
espilon = 0.4986; alpha = 0.0040
Simulating trial. . .
espilon = 0.4986; alpha = 0.0040
Simulating trial. . .
espilon = 0.4986; alpha = 0.0040
Simulating trial. . .
espilon = 0.4986; alpha = 0.0040
Simulating trial. . .
espilon = 0.4986; alpha = 0.0040
Simulating trial. . .
espilon = 0.4986; alpha = 0.0040
Simulating trial. . .
espilon = 0.4986; alpha = 0.0040
Simulating trial. . .
espilon = 0.4986; alpha = 0.0040
Simulating trial. . .
espilon = 0.4986; alpha = 0.0040
Simulating trial. . .
espilon = 0.4986; alpha = 0.0040
Simulating trial. . .
espilon = 0.4986; alpha = 0.0040
Simulating trial. . .
espilon = 0.4986; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.95)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.87)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.37)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.39)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'right', 'left')
Agent properly idled at a red light. (rewarded 1.36)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.83)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.23)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.15)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.84)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.87)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.51)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.15)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.58)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded 0.58)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', 'left')
Agent followed the waypoint forward. (rewarded 1.13)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent drove right instead of forward. (rewarded -0.42)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded 0.79)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 0.75)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -9.62)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.47)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 176
\-------------------------
Simulating trial. . .
espilon = 0.4966; alpha = 0.0040
Simulating trial. . .
espilon = 0.4966; alpha = 0.0040
Simulating trial. . .
espilon = 0.4966; alpha = 0.0040
Simulating trial. . .
espilon = 0.4966; alpha = 0.0040
Simulating trial. . .
espilon = 0.4966; alpha = 0.0040
Simulating trial. . .
espilon = 0.4966; alpha = 0.0040
Simulating trial. . .
espilon = 0.4966; alpha = 0.0040
Simulating trial. . .
espilon = 0.4966; alpha = 0.0040
Simulating trial. . .
espilon = 0.4966; alpha = 0.0040
Simulating trial. . .
espilon = 0.4966; alpha = 0.0040
Simulating trial. . .
espilon = 0.4966; alpha = 0.0040
Simulating trial. . .
espilon = 0.4966; alpha = 0.0040
Simulating trial. . .
espilon = 0.4966; alpha = 0.0040
Simulating trial. . .
espilon = 0.4966; alpha = 0.0040
Simulating trial. . .
espilon = 0.4966; alpha = 0.0040
Simulating trial. . .
espilon = 0.4966; alpha = 0.0040
Simulating trial. . .
espilon = 0.4966; alpha = 0.0040
Simulating trial. . .
espilon = 0.4966; alpha = 0.0040
Simulating trial. . .
espilon = 0.4966; alpha = 0.0040
Simulating trial. . .
espilon = 0.4966; alpha = 0.0040
Simulating trial. . .
espilon = 0.4966; alpha = 0.0040
Simulating trial. . .
espilon = 0.4966; alpha = 0.0040
Simulating trial. . .
espilon = 0.4966; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove forward instead of left. (rewarded 0.32)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent drove right instead of left. (rewarded 1.11)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.04)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent drove left instead of forward. (rewarded 0.32)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.52)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent drove forward instead of right. (rewarded 1.07)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'left')
Agent drove forward instead of right. (rewarded 1.16)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.65)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent attempted driving forward through a red light. (rewarded -9.98)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 1.55)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.29)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.81)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.73)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.58)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.17)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.14)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 0.72)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.28)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.51)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.00)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -10.56)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.88)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.14)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.97)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.61)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 177
\-------------------------
Simulating trial. . .
espilon = 0.4946; alpha = 0.0040
Simulating trial. . .
espilon = 0.4946; alpha = 0.0040
Simulating trial. . .
espilon = 0.4946; alpha = 0.0040
Simulating trial. . .
espilon = 0.4946; alpha = 0.0040
Simulating trial. . .
espilon = 0.4946; alpha = 0.0040
Simulating trial. . .
espilon = 0.4946; alpha = 0.0040
Simulating trial. . .
espilon = 0.4946; alpha = 0.0040
Simulating trial. . .
espilon = 0.4946; alpha = 0.0040
Simulating trial. . .
espilon = 0.4946; alpha = 0.0040
Simulating trial. . .
espilon = 0.4946; alpha = 0.0040
Simulating trial. . .
espilon = 0.4946; alpha = 0.0040
Simulating trial. . .
espilon = 0.4946; alpha = 0.0040
Simulating trial. . .
espilon = 0.4946; alpha = 0.0040
Simulating trial. . .
espilon = 0.4946; alpha = 0.0040
Simulating trial. . .
espilon = 0.4946; alpha = 0.0040
Simulating trial. . .
espilon = 0.4946; alpha = 0.0040
Simulating trial. . .
espilon = 0.4946; alpha = 0.0040
Simulating trial. . .
espilon = 0.4946; alpha = 0.0040
Simulating trial. . .
espilon = 0.4946; alpha = 0.0040
Simulating trial. . .
espilon = 0.4946; alpha = 0.0040
Simulating trial. . .
espilon = 0.4946; alpha = 0.0040
Simulating trial. . .
espilon = 0.4946; alpha = 0.0040
Simulating trial. . .
espilon = 0.4946; alpha = 0.0040
Simulating trial. . .
espilon = 0.4946; alpha = 0.0040
Simulating trial. . .
espilon = 0.4946; alpha = 0.0040
Simulating trial. . .
espilon = 0.4946; alpha = 0.0040
Simulating trial. . .
espilon = 0.4946; alpha = 0.0040
Simulating trial. . .
espilon = 0.4946; alpha = 0.0040
Simulating trial. . .
espilon = 0.4946; alpha = 0.0040
Simulating trial. . .
espilon = 0.4946; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.02)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.03)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.99)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.21)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.00)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.50)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.47)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'right')
Agent drove right instead of left. (rewarded 1.25)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.07)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.15)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.06)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.12)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.05)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'forward')
Agent attempted driving forward through a red light. (rewarded -10.14)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.14)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.71)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.78)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.59)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 0.24)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.76)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 178
\-------------------------
Simulating trial. . .
espilon = 0.4926; alpha = 0.0040
Simulating trial. . .
espilon = 0.4926; alpha = 0.0040
Simulating trial. . .
espilon = 0.4926; alpha = 0.0040
Simulating trial. . .
espilon = 0.4926; alpha = 0.0040
Simulating trial. . .
espilon = 0.4926; alpha = 0.0040
Simulating trial. . .
espilon = 0.4926; alpha = 0.0040
Simulating trial. . .
espilon = 0.4926; alpha = 0.0040
Simulating trial. . .
espilon = 0.4926; alpha = 0.0040
Simulating trial. . .
espilon = 0.4926; alpha = 0.0040
Simulating trial. . .
espilon = 0.4926; alpha = 0.0040
Simulating trial. . .
espilon = 0.4926; alpha = 0.0040
Simulating trial. . .
espilon = 0.4926; alpha = 0.0040
Simulating trial. . .
espilon = 0.4926; alpha = 0.0040
Simulating trial. . .
espilon = 0.4926; alpha = 0.0040
Simulating trial. . .
espilon = 0.4926; alpha = 0.0040
Simulating trial. . .
espilon = 0.4926; alpha = 0.0040
Simulating trial. . .
espilon = 0.4926; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent followed the waypoint right. (rewarded 1.38)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.69)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 2.40)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.24)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'left')
Agent followed the waypoint right. (rewarded 2.13)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent drove right instead of forward. (rewarded 1.58)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.44)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -10.28)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', None)
Agent drove right instead of left. (rewarded 0.58)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.23)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.37)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.69)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.16)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.11)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'right')
Agent properly idled at a red light. (rewarded 1.32)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 2.54)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent attempted driving left through a red light. (rewarded -10.36)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent followed the waypoint forward. (rewarded 0.86)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 1.45)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -10.40)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.33)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.83)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent drove right instead of forward. (rewarded -0.35)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.73)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.23)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 179
\-------------------------
Simulating trial. . .
espilon = 0.4907; alpha = 0.0040
Simulating trial. . .
espilon = 0.4907; alpha = 0.0040
Simulating trial. . .
espilon = 0.4907; alpha = 0.0040
Simulating trial. . .
espilon = 0.4907; alpha = 0.0040
Simulating trial. . .
espilon = 0.4907; alpha = 0.0040
Simulating trial. . .
espilon = 0.4907; alpha = 0.0040
Simulating trial. . .
espilon = 0.4907; alpha = 0.0040
Simulating trial. . .
espilon = 0.4907; alpha = 0.0040
Simulating trial. . .
espilon = 0.4907; alpha = 0.0040
Simulating trial. . .
espilon = 0.4907; alpha = 0.0040
Simulating trial. . .
espilon = 0.4907; alpha = 0.0040
Simulating trial. . .
espilon = 0.4907; alpha = 0.0040
Simulating trial. . .
espilon = 0.4907; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', 'forward')
Agent drove right instead of forward. (rewarded 0.67)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.27)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.39)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.66)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.63)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'right', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.00)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.07)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.61)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.28)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.93)
50% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 180
\-------------------------
Simulating trial. . .
espilon = 0.4887; alpha = 0.0040
Simulating trial. . .
espilon = 0.4887; alpha = 0.0040
Simulating trial. . .
espilon = 0.4887; alpha = 0.0040
Simulating trial. . .
espilon = 0.4887; alpha = 0.0040
Simulating trial. . .
espilon = 0.4887; alpha = 0.0040
Simulating trial. . .
espilon = 0.4887; alpha = 0.0040
Simulating trial. . .
espilon = 0.4887; alpha = 0.0040
Simulating trial. . .
espilon = 0.4887; alpha = 0.0040
Simulating trial. . .
espilon = 0.4887; alpha = 0.0040
Simulating trial. . .
espilon = 0.4887; alpha = 0.0040
Simulating trial. . .
espilon = 0.4887; alpha = 0.0040
Simulating trial. . .
espilon = 0.4887; alpha = 0.0040
Simulating trial. . .
espilon = 0.4887; alpha = 0.0040
Simulating trial. . .
espilon = 0.4887; alpha = 0.0040
Simulating trial. . .
espilon = 0.4887; alpha = 0.0040
Simulating trial. . .
espilon = 0.4887; alpha = 0.0040
Simulating trial. . .
espilon = 0.4887; alpha = 0.0040
Simulating trial. . .
espilon = 0.4887; alpha = 0.0040
Simulating trial. . .
espilon = 0.4887; alpha = 0.0040
Simulating trial. . .
espilon = 0.4887; alpha = 0.0040
Simulating trial. . .
espilon = 0.4887; alpha = 0.0040
Simulating trial. . .
espilon = 0.4887; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.09)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.77)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent drove left instead of forward. (rewarded 0.13)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent drove left instead of right. (rewarded 0.04)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 0.95)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'right', None)
Agent drove left instead of right. (rewarded 0.15)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.16)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.17)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent followed the waypoint right. (rewarded 1.38)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent followed the waypoint right. (rewarded 2.75)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.05)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove right instead of forward. (rewarded 0.47)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.54)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.86)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.04)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.96)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove left instead of forward. (rewarded -0.26)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'right', 'forward')
Agent followed the waypoint right. (rewarded 1.34)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'right', 'left')
Agent attempted driving forward through a red light. (rewarded -10.97)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.72)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 181
\-------------------------
Simulating trial. . .
espilon = 0.4868; alpha = 0.0040
Simulating trial. . .
espilon = 0.4868; alpha = 0.0040
Simulating trial. . .
espilon = 0.4868; alpha = 0.0040
Simulating trial. . .
espilon = 0.4868; alpha = 0.0040
Simulating trial. . .
espilon = 0.4868; alpha = 0.0040
Simulating trial. . .
espilon = 0.4868; alpha = 0.0040
Simulating trial. . .
espilon = 0.4868; alpha = 0.0040
Simulating trial. . .
espilon = 0.4868; alpha = 0.0040
Simulating trial. . .
espilon = 0.4868; alpha = 0.0040
Simulating trial. . .
espilon = 0.4868; alpha = 0.0040
Simulating trial. . .
espilon = 0.4868; alpha = 0.0040
Simulating trial. . .
espilon = 0.4868; alpha = 0.0040
Simulating trial. . .
espilon = 0.4868; alpha = 0.0040
Simulating trial. . .
espilon = 0.4868; alpha = 0.0040
Simulating trial. . .
espilon = 0.4868; alpha = 0.0040
Simulating trial. . .
espilon = 0.4868; alpha = 0.0040
Simulating trial. . .
espilon = 0.4868; alpha = 0.0040
Simulating trial. . .
espilon = 0.4868; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'forward')
Agent drove right instead of left. (rewarded 1.87)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.50)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent followed the waypoint right. (rewarded 1.48)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.05)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.88)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.18)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove left instead of forward. (rewarded 1.13)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.75)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.18)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.93)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent drove left instead of right. (rewarded 1.05)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.58)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.21)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.87)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent attempted driving forward through a red light. (rewarded -9.76)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.58)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 1.19)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.63)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded -0.27)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', 'right')
Agent followed the waypoint forward. (rewarded 1.76)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.15)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.46)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.23)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.35)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.35)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 182
\-------------------------
Simulating trial. . .
espilon = 0.4848; alpha = 0.0040
Simulating trial. . .
espilon = 0.4848; alpha = 0.0040
Simulating trial. . .
espilon = 0.4848; alpha = 0.0040
Simulating trial. . .
espilon = 0.4848; alpha = 0.0040
Simulating trial. . .
espilon = 0.4848; alpha = 0.0040
Simulating trial. . .
espilon = 0.4848; alpha = 0.0040
Simulating trial. . .
espilon = 0.4848; alpha = 0.0040
Simulating trial. . .
espilon = 0.4848; alpha = 0.0040
Simulating trial. . .
espilon = 0.4848; alpha = 0.0040
Simulating trial. . .
espilon = 0.4848; alpha = 0.0040
Simulating trial. . .
espilon = 0.4848; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.25)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.22)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.59)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 1.07)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent drove right instead of forward. (rewarded 1.83)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.08)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.07)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.00)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.73)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.65)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.49)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.12)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 0.82)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.96)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 0.82)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 183
\-------------------------
Simulating trial. . .
espilon = 0.4829; alpha = 0.0040
Simulating trial. . .
espilon = 0.4829; alpha = 0.0040
Simulating trial. . .
espilon = 0.4829; alpha = 0.0040
Simulating trial. . .
espilon = 0.4829; alpha = 0.0040
Simulating trial. . .
espilon = 0.4829; alpha = 0.0040
Simulating trial. . .
espilon = 0.4829; alpha = 0.0040
Simulating trial. . .
espilon = 0.4829; alpha = 0.0040
Simulating trial. . .
espilon = 0.4829; alpha = 0.0040
Simulating trial. . .
espilon = 0.4829; alpha = 0.0040
Simulating trial. . .
espilon = 0.4829; alpha = 0.0040
Simulating trial. . .
espilon = 0.4829; alpha = 0.0040
Simulating trial. . .
espilon = 0.4829; alpha = 0.0040
Simulating trial. . .
espilon = 0.4829; alpha = 0.0040
Simulating trial. . .
espilon = 0.4829; alpha = 0.0040
Simulating trial. . .
espilon = 0.4829; alpha = 0.0040
Simulating trial. . .
espilon = 0.4829; alpha = 0.0040
Simulating trial. . .
espilon = 0.4829; alpha = 0.0040
Simulating trial. . .
espilon = 0.4829; alpha = 0.0040
Simulating trial. . .
espilon = 0.4829; alpha = 0.0040
Simulating trial. . .
espilon = 0.4829; alpha = 0.0040
Simulating trial. . .
espilon = 0.4829; alpha = 0.0040
Simulating trial. . .
espilon = 0.4829; alpha = 0.0040
Simulating trial. . .
espilon = 0.4829; alpha = 0.0040
Simulating trial. . .
espilon = 0.4829; alpha = 0.0040
Simulating trial. . .
espilon = 0.4829; alpha = 0.0040
Simulating trial. . .
espilon = 0.4829; alpha = 0.0040
Simulating trial. . .
espilon = 0.4829; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', 'left')
Agent followed the waypoint right. (rewarded 2.28)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.82)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.69)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.70)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.98)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 1.50)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -4.17)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.50)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.36)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.17)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.22)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent attempted driving left through a red light. (rewarded -9.48)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.25)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove right instead of forward. (rewarded 1.19)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.30)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 1.97)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 1.18)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'left')
Agent idled at a green light with oncoming traffic. (rewarded -0.51)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'left')
Agent drove left instead of right. (rewarded 0.64)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.40)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 184
\-------------------------
Simulating trial. . .
espilon = 0.4809; alpha = 0.0040
Simulating trial. . .
espilon = 0.4809; alpha = 0.0040
Simulating trial. . .
espilon = 0.4809; alpha = 0.0040
Simulating trial. . .
espilon = 0.4809; alpha = 0.0040
Simulating trial. . .
espilon = 0.4809; alpha = 0.0040
Simulating trial. . .
espilon = 0.4809; alpha = 0.0040
Simulating trial. . .
espilon = 0.4809; alpha = 0.0040
Simulating trial. . .
espilon = 0.4809; alpha = 0.0040
Simulating trial. . .
espilon = 0.4809; alpha = 0.0040
Simulating trial. . .
espilon = 0.4809; alpha = 0.0040
Simulating trial. . .
espilon = 0.4809; alpha = 0.0040
Simulating trial. . .
espilon = 0.4809; alpha = 0.0040
Simulating trial. . .
espilon = 0.4809; alpha = 0.0040
Simulating trial. . .
espilon = 0.4809; alpha = 0.0040
Simulating trial. . .
espilon = 0.4809; alpha = 0.0040
Simulating trial. . .
espilon = 0.4809; alpha = 0.0040
Simulating trial. . .
espilon = 0.4809; alpha = 0.0040
Simulating trial. . .
espilon = 0.4809; alpha = 0.0040
Simulating trial. . .
espilon = 0.4809; alpha = 0.0040
Simulating trial. . .
espilon = 0.4809; alpha = 0.0040
Simulating trial. . .
espilon = 0.4809; alpha = 0.0040
Simulating trial. . .
espilon = 0.4809; alpha = 0.0040
Simulating trial. . .
espilon = 0.4809; alpha = 0.0040
Simulating trial. . .
espilon = 0.4809; alpha = 0.0040
Simulating trial. . .
espilon = 0.4809; alpha = 0.0040
Simulating trial. . .
espilon = 0.4809; alpha = 0.0040
Simulating trial. . .
espilon = 0.4809; alpha = 0.0040
Simulating trial. . .
espilon = 0.4809; alpha = 0.0040
Simulating trial. . .
espilon = 0.4809; alpha = 0.0040
Simulating trial. . .
espilon = 0.4809; alpha = 0.0040
Simulating trial. . .
espilon = 0.4809; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.54)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'forward')
Agent properly idled at a red light. (rewarded 1.34)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.45)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.50)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.67)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.29)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent followed the waypoint left. (rewarded 2.82)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.71)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.22)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.11)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, None)
Agent followed the waypoint right. (rewarded 2.65)
56% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 185
\-------------------------
Simulating trial. . .
espilon = 0.4790; alpha = 0.0040
Simulating trial. . .
espilon = 0.4790; alpha = 0.0040
Simulating trial. . .
espilon = 0.4790; alpha = 0.0040
Simulating trial. . .
espilon = 0.4790; alpha = 0.0040
Simulating trial. . .
espilon = 0.4790; alpha = 0.0040
Simulating trial. . .
espilon = 0.4790; alpha = 0.0040
Simulating trial. . .
espilon = 0.4790; alpha = 0.0040
Simulating trial. . .
espilon = 0.4790; alpha = 0.0040
Simulating trial. . .
espilon = 0.4790; alpha = 0.0040
Simulating trial. . .
espilon = 0.4790; alpha = 0.0040
Simulating trial. . .
espilon = 0.4790; alpha = 0.0040
Simulating trial. . .
espilon = 0.4790; alpha = 0.0040
Simulating trial. . .
espilon = 0.4790; alpha = 0.0040
Simulating trial. . .
espilon = 0.4790; alpha = 0.0040
Simulating trial. . .
espilon = 0.4790; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', 'forward')
Agent followed the waypoint right. (rewarded 1.08)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'left', 'forward')
Agent drove forward instead of right. (rewarded 0.03)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.31)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.05)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.57)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.26)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove left instead of right. (rewarded 1.04)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'right', None)
Agent drove forward instead of left. (rewarded 0.90)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent drove right instead of forward. (rewarded -0.03)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', 'right')
Agent drove forward instead of left. (rewarded -0.12)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.37)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.22)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 0.86)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.37)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.74)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent drove right instead of forward. (rewarded 0.34)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 0.67)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.36)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.69)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 0.63)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 186
\-------------------------
Simulating trial. . .
espilon = 0.4771; alpha = 0.0040
Simulating trial. . .
espilon = 0.4771; alpha = 0.0040
Simulating trial. . .
espilon = 0.4771; alpha = 0.0040
Simulating trial. . .
espilon = 0.4771; alpha = 0.0040
Simulating trial. . .
espilon = 0.4771; alpha = 0.0040
Simulating trial. . .
espilon = 0.4771; alpha = 0.0040
Simulating trial. . .
espilon = 0.4771; alpha = 0.0040
Simulating trial. . .
espilon = 0.4771; alpha = 0.0040
Simulating trial. . .
espilon = 0.4771; alpha = 0.0040
Simulating trial. . .
espilon = 0.4771; alpha = 0.0040
Simulating trial. . .
espilon = 0.4771; alpha = 0.0040
Simulating trial. . .
espilon = 0.4771; alpha = 0.0040
Simulating trial. . .
espilon = 0.4771; alpha = 0.0040
Simulating trial. . .
espilon = 0.4771; alpha = 0.0040
Simulating trial. . .
espilon = 0.4771; alpha = 0.0040
Simulating trial. . .
espilon = 0.4771; alpha = 0.0040
Simulating trial. . .
espilon = 0.4771; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'right', None)
Agent followed the waypoint forward. (rewarded 2.88)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'right')
Agent drove right instead of forward. (rewarded 0.32)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.27)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.54)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -9.41)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent drove forward instead of left. (rewarded 0.86)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.38)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.29)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.36)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.44)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded -0.14)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.55)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.79)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.38)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded -0.14)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent followed the waypoint left. (rewarded 0.81)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.84)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.82)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 0.75)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.85)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 187
\-------------------------
Simulating trial. . .
espilon = 0.4752; alpha = 0.0040
Simulating trial. . .
espilon = 0.4752; alpha = 0.0040
Simulating trial. . .
espilon = 0.4752; alpha = 0.0040
Simulating trial. . .
espilon = 0.4752; alpha = 0.0040
Simulating trial. . .
espilon = 0.4752; alpha = 0.0040
Simulating trial. . .
espilon = 0.4752; alpha = 0.0040
Simulating trial. . .
espilon = 0.4752; alpha = 0.0040
Simulating trial. . .
espilon = 0.4752; alpha = 0.0040
Simulating trial. . .
espilon = 0.4752; alpha = 0.0040
Simulating trial. . .
espilon = 0.4752; alpha = 0.0040
Simulating trial. . .
espilon = 0.4752; alpha = 0.0040
Simulating trial. . .
espilon = 0.4752; alpha = 0.0040
Simulating trial. . .
espilon = 0.4752; alpha = 0.0040
Simulating trial. . .
espilon = 0.4752; alpha = 0.0040
Simulating trial. . .
espilon = 0.4752; alpha = 0.0040
Simulating trial. . .
espilon = 0.4752; alpha = 0.0040
Simulating trial. . .
espilon = 0.4752; alpha = 0.0040
Simulating trial. . .
espilon = 0.4752; alpha = 0.0040
Simulating trial. . .
espilon = 0.4752; alpha = 0.0040
Simulating trial. . .
espilon = 0.4752; alpha = 0.0040
Simulating trial. . .
espilon = 0.4752; alpha = 0.0040
Simulating trial. . .
espilon = 0.4752; alpha = 0.0040
Simulating trial. . .
espilon = 0.4752; alpha = 0.0040
Simulating trial. . .
espilon = 0.4752; alpha = 0.0040
Simulating trial. . .
espilon = 0.4752; alpha = 0.0040
Simulating trial. . .
espilon = 0.4752; alpha = 0.0040
Simulating trial. . .
espilon = 0.4752; alpha = 0.0040
Simulating trial. . .
espilon = 0.4752; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.26)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.18)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.28)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'forward', 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.65)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.70)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.70)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.16)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.32)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.85)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'right', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.77)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.77)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent drove right instead of left. (rewarded 0.10)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'right')
Agent followed the waypoint right. (rewarded 1.41)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 2.11)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent attempted driving forward through a red light. (rewarded -9.35)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent followed the waypoint right. (rewarded 0.83)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.25)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent drove left instead of forward. (rewarded -0.16)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.77)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.08)
20% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 188
\-------------------------
Simulating trial. . .
espilon = 0.4733; alpha = 0.0040
Simulating trial. . .
espilon = 0.4733; alpha = 0.0040
Simulating trial. . .
espilon = 0.4733; alpha = 0.0040
Simulating trial. . .
espilon = 0.4733; alpha = 0.0040
Simulating trial. . .
espilon = 0.4733; alpha = 0.0040
Simulating trial. . .
espilon = 0.4733; alpha = 0.0040
Simulating trial. . .
espilon = 0.4733; alpha = 0.0040
Simulating trial. . .
espilon = 0.4733; alpha = 0.0040
Simulating trial. . .
espilon = 0.4733; alpha = 0.0040
Simulating trial. . .
espilon = 0.4733; alpha = 0.0040
Simulating trial. . .
espilon = 0.4733; alpha = 0.0040
Simulating trial. . .
espilon = 0.4733; alpha = 0.0040
Simulating trial. . .
espilon = 0.4733; alpha = 0.0040
Simulating trial. . .
espilon = 0.4733; alpha = 0.0040
Simulating trial. . .
espilon = 0.4733; alpha = 0.0040
Simulating trial. . .
espilon = 0.4733; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.95)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent drove right instead of forward. (rewarded 0.86)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 2.92)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.15)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded 1.44)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.71)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.84)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent attempted driving left through a red light. (rewarded -9.03)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.85)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent drove right instead of forward. (rewarded 1.64)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.47)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.28)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent followed the waypoint right. (rewarded 2.33)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.47)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.52)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.08)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 1.10)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.56)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.22)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.52)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.90)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.55)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 2.16)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.55)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.61)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 189
\-------------------------
Simulating trial. . .
espilon = 0.4714; alpha = 0.0040
Simulating trial. . .
espilon = 0.4714; alpha = 0.0040
Simulating trial. . .
espilon = 0.4714; alpha = 0.0040
Simulating trial. . .
espilon = 0.4714; alpha = 0.0040
Simulating trial. . .
espilon = 0.4714; alpha = 0.0040
Simulating trial. . .
espilon = 0.4714; alpha = 0.0040
Simulating trial. . .
espilon = 0.4714; alpha = 0.0040
Simulating trial. . .
espilon = 0.4714; alpha = 0.0040
Simulating trial. . .
espilon = 0.4714; alpha = 0.0040
Simulating trial. . .
espilon = 0.4714; alpha = 0.0040
Simulating trial. . .
espilon = 0.4714; alpha = 0.0040
Simulating trial. . .
espilon = 0.4714; alpha = 0.0040
Simulating trial. . .
espilon = 0.4714; alpha = 0.0040
Simulating trial. . .
espilon = 0.4714; alpha = 0.0040
Simulating trial. . .
espilon = 0.4714; alpha = 0.0040
Simulating trial. . .
espilon = 0.4714; alpha = 0.0040
Simulating trial. . .
espilon = 0.4714; alpha = 0.0040
Simulating trial. . .
espilon = 0.4714; alpha = 0.0040
Simulating trial. . .
espilon = 0.4714; alpha = 0.0040
Simulating trial. . .
espilon = 0.4714; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.77)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove forward instead of left. (rewarded 1.72)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 2.21)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.92)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.05)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.58)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent followed the waypoint right. (rewarded 1.55)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 0.46)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'right', 'forward')
Agent followed the waypoint right. (rewarded 1.33)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 0.93)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 190
\-------------------------
Simulating trial. . .
espilon = 0.4695; alpha = 0.0040
Simulating trial. . .
espilon = 0.4695; alpha = 0.0040
Simulating trial. . .
espilon = 0.4695; alpha = 0.0040
Simulating trial. . .
espilon = 0.4695; alpha = 0.0040
Simulating trial. . .
espilon = 0.4695; alpha = 0.0040
Simulating trial. . .
espilon = 0.4695; alpha = 0.0040
Simulating trial. . .
espilon = 0.4695; alpha = 0.0040
Simulating trial. . .
espilon = 0.4695; alpha = 0.0040
Simulating trial. . .
espilon = 0.4695; alpha = 0.0040
Simulating trial. . .
espilon = 0.4695; alpha = 0.0040
Simulating trial. . .
espilon = 0.4695; alpha = 0.0040
Simulating trial. . .
espilon = 0.4695; alpha = 0.0040
Simulating trial. . .
espilon = 0.4695; alpha = 0.0040
Simulating trial. . .
espilon = 0.4695; alpha = 0.0040
Simulating trial. . .
espilon = 0.4695; alpha = 0.0040
Simulating trial. . .
espilon = 0.4695; alpha = 0.0040
Simulating trial. . .
espilon = 0.4695; alpha = 0.0040
Simulating trial. . .
espilon = 0.4695; alpha = 0.0040
Simulating trial. . .
espilon = 0.4695; alpha = 0.0040
Simulating trial. . .
espilon = 0.4695; alpha = 0.0040
Simulating trial. . .
espilon = 0.4695; alpha = 0.0040
Simulating trial. . .
espilon = 0.4695; alpha = 0.0040
Simulating trial. . .
espilon = 0.4695; alpha = 0.0040
Simulating trial. . .
espilon = 0.4695; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', 'forward')
Agent attempted driving forward through a red light. (rewarded -9.62)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 1.21)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'right')
Agent drove right instead of forward. (rewarded 0.63)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent drove forward instead of left. (rewarded 1.44)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.43)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.45)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 0.11)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 0.81)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.18)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent attempted driving left through a red light. (rewarded -10.79)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 1.02)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove left instead of forward. (rewarded 1.69)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'forward')
Agent followed the waypoint right. (rewarded 1.39)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.81)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 0.71)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.28)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 1.21)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.24)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.27)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.58)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 191
\-------------------------
Simulating trial. . .
espilon = 0.4677; alpha = 0.0040
Simulating trial. . .
espilon = 0.4677; alpha = 0.0040
Simulating trial. . .
espilon = 0.4677; alpha = 0.0040
Simulating trial. . .
espilon = 0.4677; alpha = 0.0040
Simulating trial. . .
espilon = 0.4677; alpha = 0.0040
Simulating trial. . .
espilon = 0.4677; alpha = 0.0040
Simulating trial. . .
espilon = 0.4677; alpha = 0.0040
Simulating trial. . .
espilon = 0.4677; alpha = 0.0040
Simulating trial. . .
espilon = 0.4677; alpha = 0.0040
Simulating trial. . .
espilon = 0.4677; alpha = 0.0040
Simulating trial. . .
espilon = 0.4677; alpha = 0.0040
Simulating trial. . .
espilon = 0.4677; alpha = 0.0040
Simulating trial. . .
espilon = 0.4677; alpha = 0.0040
Simulating trial. . .
espilon = 0.4677; alpha = 0.0040
Simulating trial. . .
espilon = 0.4677; alpha = 0.0040
Simulating trial. . .
espilon = 0.4677; alpha = 0.0040
Simulating trial. . .
espilon = 0.4677; alpha = 0.0040
Simulating trial. . .
espilon = 0.4677; alpha = 0.0040
Simulating trial. . .
espilon = 0.4677; alpha = 0.0040
Simulating trial. . .
espilon = 0.4677; alpha = 0.0040
Simulating trial. . .
espilon = 0.4677; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.96)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.44)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.98)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.91)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.35)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.55)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent drove forward instead of left. (rewarded 1.67)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.10)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.23)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.75)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 0.17)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 1.70)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.95)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'right')
Agent properly idled at a red light. (rewarded 0.82)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.45)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent drove right instead of left. (rewarded -0.07)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove forward instead of right. (rewarded 0.12)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.20)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 0.68)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'forward')
Agent drove forward instead of right. (rewarded -0.37)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 0.70)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.04)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.98)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'right', 'forward')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.27)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent drove right instead of forward. (rewarded -0.29)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 192
\-------------------------
Simulating trial. . .
espilon = 0.4658; alpha = 0.0040
Simulating trial. . .
espilon = 0.4658; alpha = 0.0040
Simulating trial. . .
espilon = 0.4658; alpha = 0.0040
Simulating trial. . .
espilon = 0.4658; alpha = 0.0040
Simulating trial. . .
espilon = 0.4658; alpha = 0.0040
Simulating trial. . .
espilon = 0.4658; alpha = 0.0040
Simulating trial. . .
espilon = 0.4658; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', 'left')
Agent drove forward instead of left. (rewarded 1.53)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', 'forward')
Agent attempted driving left through a red light. (rewarded -10.09)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.29)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.56)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.48)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent drove right instead of left. (rewarded 0.39)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent drove right instead of forward. (rewarded 0.80)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'right')
Agent drove forward instead of left. (rewarded 1.45)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent drove forward instead of left. (rewarded 1.70)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'right')
Agent properly idled at a red light. (rewarded 1.36)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.80)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.16)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.22)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.70)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded -0.26)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.50)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.86)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent followed the waypoint left. (rewarded 2.05)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 0.24)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.08)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 193
\-------------------------
Simulating trial. . .
espilon = 0.4639; alpha = 0.0040
Simulating trial. . .
espilon = 0.4639; alpha = 0.0040
Simulating trial. . .
espilon = 0.4639; alpha = 0.0040
Simulating trial. . .
espilon = 0.4639; alpha = 0.0040
Simulating trial. . .
espilon = 0.4639; alpha = 0.0040
Simulating trial. . .
espilon = 0.4639; alpha = 0.0040
Simulating trial. . .
espilon = 0.4639; alpha = 0.0040
Simulating trial. . .
espilon = 0.4639; alpha = 0.0040
Simulating trial. . .
espilon = 0.4639; alpha = 0.0040
Simulating trial. . .
espilon = 0.4639; alpha = 0.0040
Simulating trial. . .
espilon = 0.4639; alpha = 0.0040
Simulating trial. . .
espilon = 0.4639; alpha = 0.0040
Simulating trial. . .
espilon = 0.4639; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.07)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent drove forward instead of left. (rewarded 1.24)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.80)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 1.43)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', 'forward')
Agent followed the waypoint forward. (rewarded 2.52)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.11)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent drove right instead of forward. (rewarded 0.78)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 0.69)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.04)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove left instead of right. (rewarded 0.84)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 1.25)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.98)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.70)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'right', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.65)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.21)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.07)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 1.36)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.87)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.66)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -5.65)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 194
\-------------------------
Simulating trial. . .
espilon = 0.4621; alpha = 0.0040
Simulating trial. . .
espilon = 0.4621; alpha = 0.0040
Simulating trial. . .
espilon = 0.4621; alpha = 0.0040
Simulating trial. . .
espilon = 0.4621; alpha = 0.0040
Simulating trial. . .
espilon = 0.4621; alpha = 0.0040
Simulating trial. . .
espilon = 0.4621; alpha = 0.0040
Simulating trial. . .
espilon = 0.4621; alpha = 0.0040
Simulating trial. . .
espilon = 0.4621; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent drove right instead of left. (rewarded 1.30)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.56)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.36)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.10)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.79)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 1.89)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.08)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove right instead of left. (rewarded 1.77)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.25)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'right', 'right')
Agent properly idled at a red light. (rewarded 0.92)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', None)
Agent properly idled at a red light. (rewarded 1.71)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.05)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.21)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.91)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.00)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 0.88)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.94)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.29)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent followed the waypoint right. (rewarded 0.96)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent drove right instead of forward. (rewarded -0.74)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 195
\-------------------------
Simulating trial. . .
espilon = 0.4602; alpha = 0.0040
Simulating trial. . .
espilon = 0.4602; alpha = 0.0040
Simulating trial. . .
espilon = 0.4602; alpha = 0.0040
Simulating trial. . .
espilon = 0.4602; alpha = 0.0040
Simulating trial. . .
espilon = 0.4602; alpha = 0.0040
Simulating trial. . .
espilon = 0.4602; alpha = 0.0040
Simulating trial. . .
espilon = 0.4602; alpha = 0.0040
Simulating trial. . .
espilon = 0.4602; alpha = 0.0040
Simulating trial. . .
espilon = 0.4602; alpha = 0.0040
Simulating trial. . .
espilon = 0.4602; alpha = 0.0040
Simulating trial. . .
espilon = 0.4602; alpha = 0.0040
Simulating trial. . .
espilon = 0.4602; alpha = 0.0040
Simulating trial. . .
espilon = 0.4602; alpha = 0.0040
Simulating trial. . .
espilon = 0.4602; alpha = 0.0040
Simulating trial. . .
espilon = 0.4602; alpha = 0.0040
Simulating trial. . .
espilon = 0.4602; alpha = 0.0040
Simulating trial. . .
espilon = 0.4602; alpha = 0.0040
Simulating trial. . .
espilon = 0.4602; alpha = 0.0040
Simulating trial. . .
espilon = 0.4602; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', 'left')
Agent followed the waypoint right. (rewarded 2.49)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 1.44)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.77)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.80)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.46)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.18)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent drove right instead of forward. (rewarded 1.20)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent drove right instead of left. (rewarded 0.40)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.59)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.52)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.23)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.02)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.87)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'right', None)
Agent followed the waypoint forward. (rewarded 2.20)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.85)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'right', 'left')
Agent drove forward instead of right. (rewarded 1.48)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.32)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent drove forward instead of right. (rewarded 0.68)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'right', None)
Agent drove forward instead of right. (rewarded 0.73)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.56)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.73)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 1.61)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -10.31)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 0.61)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent drove left instead of forward. (rewarded 0.54)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 0.84)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.60)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 0.83)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.26)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove left instead of forward. (rewarded -0.05)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 196
\-------------------------
Simulating trial. . .
espilon = 0.4584; alpha = 0.0040
Simulating trial. . .
espilon = 0.4584; alpha = 0.0040
Simulating trial. . .
espilon = 0.4584; alpha = 0.0040
Simulating trial. . .
espilon = 0.4584; alpha = 0.0040
Simulating trial. . .
espilon = 0.4584; alpha = 0.0040
Simulating trial. . .
espilon = 0.4584; alpha = 0.0040
Simulating trial. . .
espilon = 0.4584; alpha = 0.0040
Simulating trial. . .
espilon = 0.4584; alpha = 0.0040
Simulating trial. . .
espilon = 0.4584; alpha = 0.0040
Simulating trial. . .
espilon = 0.4584; alpha = 0.0040
Simulating trial. . .
espilon = 0.4584; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 1.07)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.38)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent drove right instead of forward. (rewarded 1.87)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.98)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.78)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.43)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.94)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 1.05)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.41)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.62)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'left')
Agent followed the waypoint right. (rewarded 1.29)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.55)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent drove right instead of left. (rewarded 1.10)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.77)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'right')
Agent followed the waypoint forward. (rewarded 1.06)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.52)
36% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 197
\-------------------------
Simulating trial. . .
espilon = 0.4566; alpha = 0.0040
Simulating trial. . .
espilon = 0.4566; alpha = 0.0040
Simulating trial. . .
espilon = 0.4566; alpha = 0.0040
Simulating trial. . .
espilon = 0.4566; alpha = 0.0040
Simulating trial. . .
espilon = 0.4566; alpha = 0.0040
Simulating trial. . .
espilon = 0.4566; alpha = 0.0040
Simulating trial. . .
espilon = 0.4566; alpha = 0.0040
Simulating trial. . .
espilon = 0.4566; alpha = 0.0040
Simulating trial. . .
espilon = 0.4566; alpha = 0.0040
Simulating trial. . .
espilon = 0.4566; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.82)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded 1.46)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.92)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.94)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent drove left instead of forward. (rewarded 0.11)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove left instead of right. (rewarded 1.08)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.35)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.30)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.82)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.52)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.72)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'forward')
Agent followed the waypoint right. (rewarded 1.52)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -10.75)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.87)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.63)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.31)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.91)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent drove right instead of forward. (rewarded -0.37)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.41)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', None)
Agent drove right instead of left. (rewarded -0.26)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', 'right')
Agent drove forward instead of right. (rewarded 0.22)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.24)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove forward instead of right. (rewarded 1.19)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.64)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.27)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 198
\-------------------------
Simulating trial. . .
espilon = 0.4548; alpha = 0.0040
Simulating trial. . .
espilon = 0.4548; alpha = 0.0040
Simulating trial. . .
espilon = 0.4548; alpha = 0.0040
Simulating trial. . .
espilon = 0.4548; alpha = 0.0040
Simulating trial. . .
espilon = 0.4548; alpha = 0.0040
Simulating trial. . .
espilon = 0.4548; alpha = 0.0040
Simulating trial. . .
espilon = 0.4548; alpha = 0.0040
Simulating trial. . .
espilon = 0.4548; alpha = 0.0040
Simulating trial. . .
espilon = 0.4548; alpha = 0.0040
Simulating trial. . .
espilon = 0.4548; alpha = 0.0040
Simulating trial. . .
espilon = 0.4548; alpha = 0.0040
Simulating trial. . .
espilon = 0.4548; alpha = 0.0040
Simulating trial. . .
espilon = 0.4548; alpha = 0.0040
Simulating trial. . .
espilon = 0.4548; alpha = 0.0040
Simulating trial. . .
espilon = 0.4548; alpha = 0.0040
Simulating trial. . .
espilon = 0.4548; alpha = 0.0040
Simulating trial. . .
espilon = 0.4548; alpha = 0.0040
Simulating trial. . .
espilon = 0.4548; alpha = 0.0040
Simulating trial. . .
espilon = 0.4548; alpha = 0.0040
Simulating trial. . .
espilon = 0.4548; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent drove right instead of left. (rewarded 1.36)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.34)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.55)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.05)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'right')
Agent drove right instead of forward. (rewarded 1.05)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.09)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.39)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.98)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.06)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.52)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.54)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 1.78)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.90)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.93)
53% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 199
\-------------------------
Simulating trial. . .
espilon = 0.4529; alpha = 0.0040
Simulating trial. . .
espilon = 0.4529; alpha = 0.0040
Simulating trial. . .
espilon = 0.4529; alpha = 0.0040
Simulating trial. . .
espilon = 0.4529; alpha = 0.0040
Simulating trial. . .
espilon = 0.4529; alpha = 0.0040
Simulating trial. . .
espilon = 0.4529; alpha = 0.0040
Simulating trial. . .
espilon = 0.4529; alpha = 0.0040
Simulating trial. . .
espilon = 0.4529; alpha = 0.0040
Simulating trial. . .
espilon = 0.4529; alpha = 0.0040
Simulating trial. . .
espilon = 0.4529; alpha = 0.0040
Simulating trial. . .
espilon = 0.4529; alpha = 0.0040
Simulating trial. . .
espilon = 0.4529; alpha = 0.0040
Simulating trial. . .
espilon = 0.4529; alpha = 0.0040
Simulating trial. . .
espilon = 0.4529; alpha = 0.0040
Simulating trial. . .
espilon = 0.4529; alpha = 0.0040
Simulating trial. . .
espilon = 0.4529; alpha = 0.0040
Simulating trial. . .
espilon = 0.4529; alpha = 0.0040
Simulating trial. . .
espilon = 0.4529; alpha = 0.0040
Simulating trial. . .
espilon = 0.4529; alpha = 0.0040
Simulating trial. . .
espilon = 0.4529; alpha = 0.0040
Simulating trial. . .
espilon = 0.4529; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', 'left')
Agent followed the waypoint forward. (rewarded 2.33)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.65)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent drove forward instead of left. (rewarded 0.02)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.70)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.15)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'right')
Agent drove forward instead of left. (rewarded 1.44)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'right', 'left')
Agent followed the waypoint left. (rewarded 2.31)
65% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 200
\-------------------------
Simulating trial. . .
espilon = 0.4511; alpha = 0.0040
Simulating trial. . .
espilon = 0.4511; alpha = 0.0040
Simulating trial. . .
espilon = 0.4511; alpha = 0.0040
Simulating trial. . .
espilon = 0.4511; alpha = 0.0040
Simulating trial. . .
espilon = 0.4511; alpha = 0.0040
Simulating trial. . .
espilon = 0.4511; alpha = 0.0040
Simulating trial. . .
espilon = 0.4511; alpha = 0.0040
Simulating trial. . .
espilon = 0.4511; alpha = 0.0040
Simulating trial. . .
espilon = 0.4511; alpha = 0.0040
Simulating trial. . .
espilon = 0.4511; alpha = 0.0040
Simulating trial. . .
espilon = 0.4511; alpha = 0.0040
Simulating trial. . .
espilon = 0.4511; alpha = 0.0040
Simulating trial. . .
espilon = 0.4511; alpha = 0.0040
Simulating trial. . .
espilon = 0.4511; alpha = 0.0040
Simulating trial. . .
espilon = 0.4511; alpha = 0.0040
Simulating trial. . .
espilon = 0.4511; alpha = 0.0040
Simulating trial. . .
espilon = 0.4511; alpha = 0.0040
Simulating trial. . .
espilon = 0.4511; alpha = 0.0040
Simulating trial. . .
espilon = 0.4511; alpha = 0.0040
Simulating trial. . .
espilon = 0.4511; alpha = 0.0040
Simulating trial. . .
espilon = 0.4511; alpha = 0.0040
Simulating trial. . .
espilon = 0.4511; alpha = 0.0040
Simulating trial. . .
espilon = 0.4511; alpha = 0.0040
Simulating trial. . .
espilon = 0.4511; alpha = 0.0040
Simulating trial. . .
espilon = 0.4511; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove right instead of left. (rewarded 1.13)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.73)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.31)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.58)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove forward instead of left. (rewarded 0.04)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.17)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.53)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.82)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.16)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'left')
Agent drove right instead of left. (rewarded 1.47)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.85)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.23)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -11.00)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 0.93)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', 'left')
Agent attempted driving left through a red light. (rewarded -10.56)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.81)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'right', 'left')
Agent followed the waypoint forward. (rewarded 1.68)
15% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 201
\-------------------------
Simulating trial. . .
espilon = 0.4493; alpha = 0.0040
Simulating trial. . .
espilon = 0.4493; alpha = 0.0040
Simulating trial. . .
espilon = 0.4493; alpha = 0.0040
Simulating trial. . .
espilon = 0.4493; alpha = 0.0040
Simulating trial. . .
espilon = 0.4493; alpha = 0.0040
Simulating trial. . .
espilon = 0.4493; alpha = 0.0040
Simulating trial. . .
espilon = 0.4493; alpha = 0.0040
Simulating trial. . .
espilon = 0.4493; alpha = 0.0040
Simulating trial. . .
espilon = 0.4493; alpha = 0.0040
Simulating trial. . .
espilon = 0.4493; alpha = 0.0040
Simulating trial. . .
espilon = 0.4493; alpha = 0.0040
Simulating trial. . .
espilon = 0.4493; alpha = 0.0040
Simulating trial. . .
espilon = 0.4493; alpha = 0.0040
Simulating trial. . .
espilon = 0.4493; alpha = 0.0040
Simulating trial. . .
espilon = 0.4493; alpha = 0.0040
Simulating trial. . .
espilon = 0.4493; alpha = 0.0040
Simulating trial. . .
espilon = 0.4493; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.05)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.12)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.34)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'forward')
Agent properly idled at a red light. (rewarded 1.37)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'right')
Agent properly idled at a red light. (rewarded 1.87)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.41)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.40)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.64)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.32)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.13)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.77)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.96)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.87)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.77)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 1.56)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.80)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.23)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove left instead of forward. (rewarded -0.67)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'right', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.14)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent attempted driving forward through a red light. (rewarded -10.22)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 202
\-------------------------
Simulating trial. . .
espilon = 0.4475; alpha = 0.0040
Simulating trial. . .
espilon = 0.4475; alpha = 0.0040
Simulating trial. . .
espilon = 0.4475; alpha = 0.0040
Simulating trial. . .
espilon = 0.4475; alpha = 0.0040
Simulating trial. . .
espilon = 0.4475; alpha = 0.0040
Simulating trial. . .
espilon = 0.4475; alpha = 0.0040
Simulating trial. . .
espilon = 0.4475; alpha = 0.0040
Simulating trial. . .
espilon = 0.4475; alpha = 0.0040
Simulating trial. . .
espilon = 0.4475; alpha = 0.0040
Simulating trial. . .
espilon = 0.4475; alpha = 0.0040
Simulating trial. . .
espilon = 0.4475; alpha = 0.0040
Simulating trial. . .
espilon = 0.4475; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'forward')
Agent drove forward instead of right. (rewarded 1.20)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.00)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.94)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.65)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.93)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove right instead of forward. (rewarded 1.00)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.38)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.82)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.82)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.44)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 1.14)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.37)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 0.14)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded 0.13)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.37)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.89)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.21)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 0.42)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.76)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.69)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 203
\-------------------------
Simulating trial. . .
espilon = 0.4457; alpha = 0.0040
Simulating trial. . .
espilon = 0.4457; alpha = 0.0040
Simulating trial. . .
espilon = 0.4457; alpha = 0.0040
Simulating trial. . .
espilon = 0.4457; alpha = 0.0040
Simulating trial. . .
espilon = 0.4457; alpha = 0.0040
Simulating trial. . .
espilon = 0.4457; alpha = 0.0040
Simulating trial. . .
espilon = 0.4457; alpha = 0.0040
Simulating trial. . .
espilon = 0.4457; alpha = 0.0040
Simulating trial. . .
espilon = 0.4457; alpha = 0.0040
Simulating trial. . .
espilon = 0.4457; alpha = 0.0040
Simulating trial. . .
espilon = 0.4457; alpha = 0.0040
Simulating trial. . .
espilon = 0.4457; alpha = 0.0040
Simulating trial. . .
espilon = 0.4457; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.91)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent drove right instead of left. (rewarded 0.17)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.25)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.22)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 1.93)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.61)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 1.00)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.93)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.47)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -10.05)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.69)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.27)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.66)
48% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 204
\-------------------------
Simulating trial. . .
espilon = 0.4440; alpha = 0.0040
Simulating trial. . .
espilon = 0.4440; alpha = 0.0040
Simulating trial. . .
espilon = 0.4440; alpha = 0.0040
Simulating trial. . .
espilon = 0.4440; alpha = 0.0040
Simulating trial. . .
espilon = 0.4440; alpha = 0.0040
Simulating trial. . .
espilon = 0.4440; alpha = 0.0040
Simulating trial. . .
espilon = 0.4440; alpha = 0.0040
Simulating trial. . .
espilon = 0.4440; alpha = 0.0040
Simulating trial. . .
espilon = 0.4440; alpha = 0.0040
Simulating trial. . .
espilon = 0.4440; alpha = 0.0040
Simulating trial. . .
espilon = 0.4440; alpha = 0.0040
Simulating trial. . .
espilon = 0.4440; alpha = 0.0040
Simulating trial. . .
espilon = 0.4440; alpha = 0.0040
Simulating trial. . .
espilon = 0.4440; alpha = 0.0040
Simulating trial. . .
espilon = 0.4440; alpha = 0.0040
Simulating trial. . .
espilon = 0.4440; alpha = 0.0040
Simulating trial. . .
espilon = 0.4440; alpha = 0.0040
Simulating trial. . .
espilon = 0.4440; alpha = 0.0040
Simulating trial. . .
espilon = 0.4440; alpha = 0.0040
Simulating trial. . .
espilon = 0.4440; alpha = 0.0040
Simulating trial. . .
espilon = 0.4440; alpha = 0.0040
Simulating trial. . .
espilon = 0.4440; alpha = 0.0040
Simulating trial. . .
espilon = 0.4440; alpha = 0.0040
Simulating trial. . .
espilon = 0.4440; alpha = 0.0040
Simulating trial. . .
espilon = 0.4440; alpha = 0.0040
Simulating trial. . .
espilon = 0.4440; alpha = 0.0040
Simulating trial. . .
espilon = 0.4440; alpha = 0.0040
Simulating trial. . .
espilon = 0.4440; alpha = 0.0040
Simulating trial. . .
espilon = 0.4440; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.28)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.71)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.38)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.56)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.32)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.79)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.38)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.41)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.73)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.14)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -10.56)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 1.97)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.76)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.94)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.58)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.16)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.25)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.40)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent drove right instead of forward. (rewarded 0.61)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 1.78)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.26)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', 'forward')
Agent drove forward instead of left. (rewarded 1.30)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.92)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 0.60)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.02)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.72)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 1.63)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent drove forward instead of left. (rewarded 1.15)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.17)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.98)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 205
\-------------------------
Simulating trial. . .
espilon = 0.4422; alpha = 0.0040
Simulating trial. . .
espilon = 0.4422; alpha = 0.0040
Simulating trial. . .
espilon = 0.4422; alpha = 0.0040
Simulating trial. . .
espilon = 0.4422; alpha = 0.0040
Simulating trial. . .
espilon = 0.4422; alpha = 0.0040
Simulating trial. . .
espilon = 0.4422; alpha = 0.0040
Simulating trial. . .
espilon = 0.4422; alpha = 0.0040
Simulating trial. . .
espilon = 0.4422; alpha = 0.0040
Simulating trial. . .
espilon = 0.4422; alpha = 0.0040
Simulating trial. . .
espilon = 0.4422; alpha = 0.0040
Simulating trial. . .
espilon = 0.4422; alpha = 0.0040
Simulating trial. . .
espilon = 0.4422; alpha = 0.0040
Simulating trial. . .
espilon = 0.4422; alpha = 0.0040
Simulating trial. . .
espilon = 0.4422; alpha = 0.0040
Simulating trial. . .
espilon = 0.4422; alpha = 0.0040
Simulating trial. . .
espilon = 0.4422; alpha = 0.0040
Simulating trial. . .
espilon = 0.4422; alpha = 0.0040
Simulating trial. . .
espilon = 0.4422; alpha = 0.0040
Simulating trial. . .
espilon = 0.4422; alpha = 0.0040
Simulating trial. . .
espilon = 0.4422; alpha = 0.0040
Simulating trial. . .
espilon = 0.4422; alpha = 0.0040
Simulating trial. . .
espilon = 0.4422; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'left')
Agent followed the waypoint right. (rewarded 2.62)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.78)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.41)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.44)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.44)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'right', None)
Agent drove left instead of forward. (rewarded 0.79)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.38)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove forward instead of right. (rewarded 0.67)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 2.61)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.06)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.04)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, 'right')
Agent followed the waypoint right. (rewarded 1.38)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 206
\-------------------------
Simulating trial. . .
espilon = 0.4404; alpha = 0.0040
Simulating trial. . .
espilon = 0.4404; alpha = 0.0040
Simulating trial. . .
espilon = 0.4404; alpha = 0.0040
Simulating trial. . .
espilon = 0.4404; alpha = 0.0040
Simulating trial. . .
espilon = 0.4404; alpha = 0.0040
Simulating trial. . .
espilon = 0.4404; alpha = 0.0040
Simulating trial. . .
espilon = 0.4404; alpha = 0.0040
Simulating trial. . .
espilon = 0.4404; alpha = 0.0040
Simulating trial. . .
espilon = 0.4404; alpha = 0.0040
Simulating trial. . .
espilon = 0.4404; alpha = 0.0040
Simulating trial. . .
espilon = 0.4404; alpha = 0.0040
Simulating trial. . .
espilon = 0.4404; alpha = 0.0040
Simulating trial. . .
espilon = 0.4404; alpha = 0.0040
Simulating trial. . .
espilon = 0.4404; alpha = 0.0040
Simulating trial. . .
espilon = 0.4404; alpha = 0.0040
Simulating trial. . .
espilon = 0.4404; alpha = 0.0040
Simulating trial. . .
espilon = 0.4404; alpha = 0.0040
Simulating trial. . .
espilon = 0.4404; alpha = 0.0040
Simulating trial. . .
espilon = 0.4404; alpha = 0.0040
Simulating trial. . .
espilon = 0.4404; alpha = 0.0040
Simulating trial. . .
espilon = 0.4404; alpha = 0.0040
Simulating trial. . .
espilon = 0.4404; alpha = 0.0040
Simulating trial. . .
espilon = 0.4404; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', 'left')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.33)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.41)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.20)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.65)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent drove forward instead of left. (rewarded 0.84)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.86)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.17)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.49)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.67)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove left instead of forward. (rewarded 0.62)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.55)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent followed the waypoint forward. (rewarded 1.72)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 2.13)
35% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 207
\-------------------------
Simulating trial. . .
espilon = 0.4387; alpha = 0.0040
Simulating trial. . .
espilon = 0.4387; alpha = 0.0040
Simulating trial. . .
espilon = 0.4387; alpha = 0.0040
Simulating trial. . .
espilon = 0.4387; alpha = 0.0040
Simulating trial. . .
espilon = 0.4387; alpha = 0.0040
Simulating trial. . .
espilon = 0.4387; alpha = 0.0040
Simulating trial. . .
espilon = 0.4387; alpha = 0.0040
Simulating trial. . .
espilon = 0.4387; alpha = 0.0040
Simulating trial. . .
espilon = 0.4387; alpha = 0.0040
Simulating trial. . .
espilon = 0.4387; alpha = 0.0040
Simulating trial. . .
espilon = 0.4387; alpha = 0.0040
Simulating trial. . .
espilon = 0.4387; alpha = 0.0040
Simulating trial. . .
espilon = 0.4387; alpha = 0.0040
Simulating trial. . .
espilon = 0.4387; alpha = 0.0040
Simulating trial. . .
espilon = 0.4387; alpha = 0.0040
Simulating trial. . .
espilon = 0.4387; alpha = 0.0040
Simulating trial. . .
espilon = 0.4387; alpha = 0.0040
Simulating trial. . .
espilon = 0.4387; alpha = 0.0040
Simulating trial. . .
espilon = 0.4387; alpha = 0.0040
Simulating trial. . .
espilon = 0.4387; alpha = 0.0040
Simulating trial. . .
espilon = 0.4387; alpha = 0.0040
Simulating trial. . .
espilon = 0.4387; alpha = 0.0040
Simulating trial. . .
espilon = 0.4387; alpha = 0.0040
Simulating trial. . .
espilon = 0.4387; alpha = 0.0040
Simulating trial. . .
espilon = 0.4387; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.43)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', 'forward')
Agent drove forward instead of right. (rewarded 0.05)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.54)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 2.60)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 2.27)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent drove right instead of forward. (rewarded 0.30)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', 'forward')
Agent attempted driving left through a red light. (rewarded -9.02)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 1.33)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent drove forward instead of right. (rewarded 1.65)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.48)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.63)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.55)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.12)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.79)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.75)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.76)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.13)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent attempted driving left through a red light. (rewarded -9.28)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.57)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.16)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 208
\-------------------------
Simulating trial. . .
espilon = 0.4369; alpha = 0.0040
Simulating trial. . .
espilon = 0.4369; alpha = 0.0040
Simulating trial. . .
espilon = 0.4369; alpha = 0.0040
Simulating trial. . .
espilon = 0.4369; alpha = 0.0040
Simulating trial. . .
espilon = 0.4369; alpha = 0.0040
Simulating trial. . .
espilon = 0.4369; alpha = 0.0040
Simulating trial. . .
espilon = 0.4369; alpha = 0.0040
Simulating trial. . .
espilon = 0.4369; alpha = 0.0040
Simulating trial. . .
espilon = 0.4369; alpha = 0.0040
Simulating trial. . .
espilon = 0.4369; alpha = 0.0040
Simulating trial. . .
espilon = 0.4369; alpha = 0.0040
Simulating trial. . .
espilon = 0.4369; alpha = 0.0040
Simulating trial. . .
espilon = 0.4369; alpha = 0.0040
Simulating trial. . .
espilon = 0.4369; alpha = 0.0040
Simulating trial. . .
espilon = 0.4369; alpha = 0.0040
Simulating trial. . .
espilon = 0.4369; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -9.40)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', None)
Agent properly idled at a red light. (rewarded 2.48)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.79)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent attempted driving left through a red light. (rewarded -9.55)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.10)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 0.11)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 0.91)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 1.62)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.00)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.44)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 2.61)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.61)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 2.62)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.15)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent drove left instead of right. (rewarded 0.30)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent followed the waypoint right. (rewarded 1.24)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.11)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -9.74)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.83)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 0.68)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.95)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded -0.50)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', 'left')
Agent drove left instead of right. (rewarded 0.46)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 0.66)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent drove right instead of left. (rewarded 0.28)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 209
\-------------------------
Simulating trial. . .
espilon = 0.4352; alpha = 0.0040
Simulating trial. . .
espilon = 0.4352; alpha = 0.0040
Simulating trial. . .
espilon = 0.4352; alpha = 0.0040
Simulating trial. . .
espilon = 0.4352; alpha = 0.0040
Simulating trial. . .
espilon = 0.4352; alpha = 0.0040
Simulating trial. . .
espilon = 0.4352; alpha = 0.0040
Simulating trial. . .
espilon = 0.4352; alpha = 0.0040
Simulating trial. . .
espilon = 0.4352; alpha = 0.0040
Simulating trial. . .
espilon = 0.4352; alpha = 0.0040
Simulating trial. . .
espilon = 0.4352; alpha = 0.0040
Simulating trial. . .
espilon = 0.4352; alpha = 0.0040
Simulating trial. . .
espilon = 0.4352; alpha = 0.0040
Simulating trial. . .
espilon = 0.4352; alpha = 0.0040
Simulating trial. . .
espilon = 0.4352; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.57)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.98)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.79)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent drove right instead of forward. (rewarded 1.43)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent drove right instead of left. (rewarded 1.43)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.43)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.34)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.77)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.13)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.69)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent drove right instead of left. (rewarded 0.45)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.19)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 1.84)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.13)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'right')
Agent properly idled at a red light. (rewarded 1.55)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 1.56)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 1.89)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.45)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent drove left instead of forward. (rewarded 0.49)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 0.79)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.87)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove forward instead of right. (rewarded 0.24)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 1.88)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.94)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.84)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 210
\-------------------------
Simulating trial. . .
espilon = 0.4334; alpha = 0.0040
Simulating trial. . .
espilon = 0.4334; alpha = 0.0040
Simulating trial. . .
espilon = 0.4334; alpha = 0.0040
Simulating trial. . .
espilon = 0.4334; alpha = 0.0040
Simulating trial. . .
espilon = 0.4334; alpha = 0.0040
Simulating trial. . .
espilon = 0.4334; alpha = 0.0040
Simulating trial. . .
espilon = 0.4334; alpha = 0.0040
Simulating trial. . .
espilon = 0.4334; alpha = 0.0040
Simulating trial. . .
espilon = 0.4334; alpha = 0.0040
Simulating trial. . .
espilon = 0.4334; alpha = 0.0040
Simulating trial. . .
espilon = 0.4334; alpha = 0.0040
Simulating trial. . .
espilon = 0.4334; alpha = 0.0040
Simulating trial. . .
espilon = 0.4334; alpha = 0.0040
Simulating trial. . .
espilon = 0.4334; alpha = 0.0040
Simulating trial. . .
espilon = 0.4334; alpha = 0.0040
Simulating trial. . .
espilon = 0.4334; alpha = 0.0040
Simulating trial. . .
espilon = 0.4334; alpha = 0.0040
Simulating trial. . .
espilon = 0.4334; alpha = 0.0040
Simulating trial. . .
espilon = 0.4334; alpha = 0.0040
Simulating trial. . .
espilon = 0.4334; alpha = 0.0040
Simulating trial. . .
espilon = 0.4334; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.30)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent attempted driving left through a red light. (rewarded -9.29)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.46)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.46)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.75)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove left instead of forward. (rewarded 1.86)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove forward instead of right. (rewarded 1.70)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent attempted driving left through a red light. (rewarded -9.60)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.23)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 1.73)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent drove right instead of left. (rewarded -0.14)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.62)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'right')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.00)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.67)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.57)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent followed the waypoint right. (rewarded 1.30)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent drove forward instead of right. (rewarded 1.03)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 1.35)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.73)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent followed the waypoint forward. (rewarded 1.85)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 211
\-------------------------
Simulating trial. . .
espilon = 0.4317; alpha = 0.0040
Simulating trial. . .
espilon = 0.4317; alpha = 0.0040
Simulating trial. . .
espilon = 0.4317; alpha = 0.0040
Simulating trial. . .
espilon = 0.4317; alpha = 0.0040
Simulating trial. . .
espilon = 0.4317; alpha = 0.0040
Simulating trial. . .
espilon = 0.4317; alpha = 0.0040
Simulating trial. . .
espilon = 0.4317; alpha = 0.0040
Simulating trial. . .
espilon = 0.4317; alpha = 0.0040
Simulating trial. . .
espilon = 0.4317; alpha = 0.0040
Simulating trial. . .
espilon = 0.4317; alpha = 0.0040
Simulating trial. . .
espilon = 0.4317; alpha = 0.0040
Simulating trial. . .
espilon = 0.4317; alpha = 0.0040
Simulating trial. . .
espilon = 0.4317; alpha = 0.0040
Simulating trial. . .
espilon = 0.4317; alpha = 0.0040
Simulating trial. . .
espilon = 0.4317; alpha = 0.0040
Simulating trial. . .
espilon = 0.4317; alpha = 0.0040
Simulating trial. . .
espilon = 0.4317; alpha = 0.0040
Simulating trial. . .
espilon = 0.4317; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 1.26)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.49)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.09)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.35)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.97)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.76)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 1.35)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.82)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.20)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'forward')
Agent drove forward instead of left. (rewarded 1.12)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'right', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.88)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.68)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.34)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 2.26)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 1.33)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.33)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.99)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 0.76)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.55)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded -0.27)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 212
\-------------------------
Simulating trial. . .
espilon = 0.4300; alpha = 0.0040
Simulating trial. . .
espilon = 0.4300; alpha = 0.0040
Simulating trial. . .
espilon = 0.4300; alpha = 0.0040
Simulating trial. . .
espilon = 0.4300; alpha = 0.0040
Simulating trial. . .
espilon = 0.4300; alpha = 0.0040
Simulating trial. . .
espilon = 0.4300; alpha = 0.0040
Simulating trial. . .
espilon = 0.4300; alpha = 0.0040
Simulating trial. . .
espilon = 0.4300; alpha = 0.0040
Simulating trial. . .
espilon = 0.4300; alpha = 0.0040
Simulating trial. . .
espilon = 0.4300; alpha = 0.0040
Simulating trial. . .
espilon = 0.4300; alpha = 0.0040
Simulating trial. . .
espilon = 0.4300; alpha = 0.0040
Simulating trial. . .
espilon = 0.4300; alpha = 0.0040
Simulating trial. . .
espilon = 0.4300; alpha = 0.0040
Simulating trial. . .
espilon = 0.4300; alpha = 0.0040
Simulating trial. . .
espilon = 0.4300; alpha = 0.0040
Simulating trial. . .
espilon = 0.4300; alpha = 0.0040
Simulating trial. . .
espilon = 0.4300; alpha = 0.0040
Simulating trial. . .
espilon = 0.4300; alpha = 0.0040
Simulating trial. . .
espilon = 0.4300; alpha = 0.0040
Simulating trial. . .
espilon = 0.4300; alpha = 0.0040
Simulating trial. . .
espilon = 0.4300; alpha = 0.0040
Simulating trial. . .
espilon = 0.4300; alpha = 0.0040
Simulating trial. . .
espilon = 0.4300; alpha = 0.0040
Simulating trial. . .
espilon = 0.4300; alpha = 0.0040
Simulating trial. . .
espilon = 0.4300; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.54)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.48)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent drove right instead of forward. (rewarded 1.03)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent attempted driving left through a red light. (rewarded -9.68)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.74)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.14)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.26)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.30)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.54)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent attempted driving left through a red light. (rewarded -10.73)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent attempted driving left through a red light. (rewarded -10.32)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.34)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.11)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.09)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.51)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.76)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.56)
43% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 213
\-------------------------
Simulating trial. . .
espilon = 0.4283; alpha = 0.0040
Simulating trial. . .
espilon = 0.4283; alpha = 0.0040
Simulating trial. . .
espilon = 0.4283; alpha = 0.0040
Simulating trial. . .
espilon = 0.4283; alpha = 0.0040
Simulating trial. . .
espilon = 0.4283; alpha = 0.0040
Simulating trial. . .
espilon = 0.4283; alpha = 0.0040
Simulating trial. . .
espilon = 0.4283; alpha = 0.0040
Simulating trial. . .
espilon = 0.4283; alpha = 0.0040
Simulating trial. . .
espilon = 0.4283; alpha = 0.0040
Simulating trial. . .
espilon = 0.4283; alpha = 0.0040
Simulating trial. . .
espilon = 0.4283; alpha = 0.0040
Simulating trial. . .
espilon = 0.4283; alpha = 0.0040
Simulating trial. . .
espilon = 0.4283; alpha = 0.0040
Simulating trial. . .
espilon = 0.4283; alpha = 0.0040
Simulating trial. . .
espilon = 0.4283; alpha = 0.0040
Simulating trial. . .
espilon = 0.4283; alpha = 0.0040
Simulating trial. . .
espilon = 0.4283; alpha = 0.0040
Simulating trial. . .
espilon = 0.4283; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent attempted driving left through a red light. (rewarded -10.25)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.36)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.84)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.58)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.70)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.60)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'forward')
Agent drove forward instead of right. (rewarded 1.18)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.63)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.41)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 0.05)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.35)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 1.39)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 0.34)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.83)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.87)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 0.61)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.75)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.90)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent followed the waypoint right. (rewarded 0.90)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.04)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 214
\-------------------------
Simulating trial. . .
espilon = 0.4266; alpha = 0.0040
Simulating trial. . .
espilon = 0.4266; alpha = 0.0040
Simulating trial. . .
espilon = 0.4266; alpha = 0.0040
Simulating trial. . .
espilon = 0.4266; alpha = 0.0040
Simulating trial. . .
espilon = 0.4266; alpha = 0.0040
Simulating trial. . .
espilon = 0.4266; alpha = 0.0040
Simulating trial. . .
espilon = 0.4266; alpha = 0.0040
Simulating trial. . .
espilon = 0.4266; alpha = 0.0040
Simulating trial. . .
espilon = 0.4266; alpha = 0.0040
Simulating trial. . .
espilon = 0.4266; alpha = 0.0040
Simulating trial. . .
espilon = 0.4266; alpha = 0.0040
Simulating trial. . .
espilon = 0.4266; alpha = 0.0040
Simulating trial. . .
espilon = 0.4266; alpha = 0.0040
Simulating trial. . .
espilon = 0.4266; alpha = 0.0040
Simulating trial. . .
espilon = 0.4266; alpha = 0.0040
Simulating trial. . .
espilon = 0.4266; alpha = 0.0040
Simulating trial. . .
espilon = 0.4266; alpha = 0.0040
Simulating trial. . .
espilon = 0.4266; alpha = 0.0040
Simulating trial. . .
espilon = 0.4266; alpha = 0.0040
Simulating trial. . .
espilon = 0.4266; alpha = 0.0040
Simulating trial. . .
espilon = 0.4266; alpha = 0.0040
Simulating trial. . .
espilon = 0.4266; alpha = 0.0040
Simulating trial. . .
espilon = 0.4266; alpha = 0.0040
Simulating trial. . .
espilon = 0.4266; alpha = 0.0040
Simulating trial. . .
espilon = 0.4266; alpha = 0.0040
Simulating trial. . .
espilon = 0.4266; alpha = 0.0040
Simulating trial. . .
espilon = 0.4266; alpha = 0.0040
Simulating trial. . .
espilon = 0.4266; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.49)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.69)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.78)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.36)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 2.90)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.11)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent drove left instead of forward. (rewarded 1.06)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'left')
Agent followed the waypoint right. (rewarded 1.28)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 2.12)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.76)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.92)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.58)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.06)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.20)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 2.36)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 215
\-------------------------
Simulating trial. . .
espilon = 0.4249; alpha = 0.0040
Simulating trial. . .
espilon = 0.4249; alpha = 0.0040
Simulating trial. . .
espilon = 0.4249; alpha = 0.0040
Simulating trial. . .
espilon = 0.4249; alpha = 0.0040
Simulating trial. . .
espilon = 0.4249; alpha = 0.0040
Simulating trial. . .
espilon = 0.4249; alpha = 0.0040
Simulating trial. . .
espilon = 0.4249; alpha = 0.0040
Simulating trial. . .
espilon = 0.4249; alpha = 0.0040
Simulating trial. . .
espilon = 0.4249; alpha = 0.0040
Simulating trial. . .
espilon = 0.4249; alpha = 0.0040
Simulating trial. . .
espilon = 0.4249; alpha = 0.0040
Simulating trial. . .
espilon = 0.4249; alpha = 0.0040
Simulating trial. . .
espilon = 0.4249; alpha = 0.0040
Simulating trial. . .
espilon = 0.4249; alpha = 0.0040
Simulating trial. . .
espilon = 0.4249; alpha = 0.0040
Simulating trial. . .
espilon = 0.4249; alpha = 0.0040
Simulating trial. . .
espilon = 0.4249; alpha = 0.0040
Simulating trial. . .
espilon = 0.4249; alpha = 0.0040
Simulating trial. . .
espilon = 0.4249; alpha = 0.0040
Simulating trial. . .
espilon = 0.4249; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.83)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.21)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent attempted driving forward through a red light. (rewarded -10.61)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.19)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.13)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove left instead of forward. (rewarded 1.33)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.29)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.62)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.36)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.43)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent drove forward instead of left. (rewarded 1.67)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.25)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 0.79)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.09)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.25)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.71)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.84)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.46)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.38)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 0.85)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 216
\-------------------------
Simulating trial. . .
espilon = 0.4232; alpha = 0.0040
Simulating trial. . .
espilon = 0.4232; alpha = 0.0040
Simulating trial. . .
espilon = 0.4232; alpha = 0.0040
Simulating trial. . .
espilon = 0.4232; alpha = 0.0040
Simulating trial. . .
espilon = 0.4232; alpha = 0.0040
Simulating trial. . .
espilon = 0.4232; alpha = 0.0040
Simulating trial. . .
espilon = 0.4232; alpha = 0.0040
Simulating trial. . .
espilon = 0.4232; alpha = 0.0040
Simulating trial. . .
espilon = 0.4232; alpha = 0.0040
Simulating trial. . .
espilon = 0.4232; alpha = 0.0040
Simulating trial. . .
espilon = 0.4232; alpha = 0.0040
Simulating trial. . .
espilon = 0.4232; alpha = 0.0040
Simulating trial. . .
espilon = 0.4232; alpha = 0.0040
Simulating trial. . .
espilon = 0.4232; alpha = 0.0040
Simulating trial. . .
espilon = 0.4232; alpha = 0.0040
Simulating trial. . .
espilon = 0.4232; alpha = 0.0040
Simulating trial. . .
espilon = 0.4232; alpha = 0.0040
Simulating trial. . .
espilon = 0.4232; alpha = 0.0040
Simulating trial. . .
espilon = 0.4232; alpha = 0.0040
Simulating trial. . .
espilon = 0.4232; alpha = 0.0040
Simulating trial. . .
espilon = 0.4232; alpha = 0.0040
Simulating trial. . .
espilon = 0.4232; alpha = 0.0040
Simulating trial. . .
espilon = 0.4232; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.21)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.64)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 0.63)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.17)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.45)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.73)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'right')
Agent attempted driving forward through a red light. (rewarded -9.39)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'right')
Agent followed the waypoint forward. (rewarded 2.01)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 217
\-------------------------
Simulating trial. . .
espilon = 0.4215; alpha = 0.0040
Simulating trial. . .
espilon = 0.4215; alpha = 0.0040
Simulating trial. . .
espilon = 0.4215; alpha = 0.0040
Simulating trial. . .
espilon = 0.4215; alpha = 0.0040
Simulating trial. . .
espilon = 0.4215; alpha = 0.0040
Simulating trial. . .
espilon = 0.4215; alpha = 0.0040
Simulating trial. . .
espilon = 0.4215; alpha = 0.0040
Simulating trial. . .
espilon = 0.4215; alpha = 0.0040
Simulating trial. . .
espilon = 0.4215; alpha = 0.0040
Simulating trial. . .
espilon = 0.4215; alpha = 0.0040
Simulating trial. . .
espilon = 0.4215; alpha = 0.0040
Simulating trial. . .
espilon = 0.4215; alpha = 0.0040
Simulating trial. . .
espilon = 0.4215; alpha = 0.0040
Simulating trial. . .
espilon = 0.4215; alpha = 0.0040
Simulating trial. . .
espilon = 0.4215; alpha = 0.0040
Simulating trial. . .
espilon = 0.4215; alpha = 0.0040
Simulating trial. . .
espilon = 0.4215; alpha = 0.0040
Simulating trial. . .
espilon = 0.4215; alpha = 0.0040
Simulating trial. . .
espilon = 0.4215; alpha = 0.0040
Simulating trial. . .
espilon = 0.4215; alpha = 0.0040
Simulating trial. . .
espilon = 0.4215; alpha = 0.0040
Simulating trial. . .
espilon = 0.4215; alpha = 0.0040
Simulating trial. . .
espilon = 0.4215; alpha = 0.0040
Simulating trial. . .
espilon = 0.4215; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.78)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 1.21)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent attempted driving forward through a red light. (rewarded -9.46)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 1.26)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.94)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.53)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.32)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.65)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent attempted driving left through a red light. (rewarded -10.90)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded -0.02)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.54)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded -0.09)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.11)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 2.09)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.71)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.09)
20% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 218
\-------------------------
Simulating trial. . .
espilon = 0.4198; alpha = 0.0040
Simulating trial. . .
espilon = 0.4198; alpha = 0.0040
Simulating trial. . .
espilon = 0.4198; alpha = 0.0040
Simulating trial. . .
espilon = 0.4198; alpha = 0.0040
Simulating trial. . .
espilon = 0.4198; alpha = 0.0040
Simulating trial. . .
espilon = 0.4198; alpha = 0.0040
Simulating trial. . .
espilon = 0.4198; alpha = 0.0040
Simulating trial. . .
espilon = 0.4198; alpha = 0.0040
Simulating trial. . .
espilon = 0.4198; alpha = 0.0040
Simulating trial. . .
espilon = 0.4198; alpha = 0.0040
Simulating trial. . .
espilon = 0.4198; alpha = 0.0040
Simulating trial. . .
espilon = 0.4198; alpha = 0.0040
Simulating trial. . .
espilon = 0.4198; alpha = 0.0040
Simulating trial. . .
espilon = 0.4198; alpha = 0.0040
Simulating trial. . .
espilon = 0.4198; alpha = 0.0040
Simulating trial. . .
espilon = 0.4198; alpha = 0.0040
Simulating trial. . .
espilon = 0.4198; alpha = 0.0040
Simulating trial. . .
espilon = 0.4198; alpha = 0.0040
Simulating trial. . .
espilon = 0.4198; alpha = 0.0040
Simulating trial. . .
espilon = 0.4198; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.07)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded -0.01)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 1.72)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.10)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.24)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 1.19)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 0.77)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent drove forward instead of left. (rewarded 0.57)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.70)
64% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 219
\-------------------------
Simulating trial. . .
espilon = 0.4181; alpha = 0.0040
Simulating trial. . .
espilon = 0.4181; alpha = 0.0040
Simulating trial. . .
espilon = 0.4181; alpha = 0.0040
Simulating trial. . .
espilon = 0.4181; alpha = 0.0040
Simulating trial. . .
espilon = 0.4181; alpha = 0.0040
Simulating trial. . .
espilon = 0.4181; alpha = 0.0040
Simulating trial. . .
espilon = 0.4181; alpha = 0.0040
Simulating trial. . .
espilon = 0.4181; alpha = 0.0040
Simulating trial. . .
espilon = 0.4181; alpha = 0.0040
Simulating trial. . .
espilon = 0.4181; alpha = 0.0040
Simulating trial. . .
espilon = 0.4181; alpha = 0.0040
Simulating trial. . .
espilon = 0.4181; alpha = 0.0040
Simulating trial. . .
espilon = 0.4181; alpha = 0.0040
Simulating trial. . .
espilon = 0.4181; alpha = 0.0040
Simulating trial. . .
espilon = 0.4181; alpha = 0.0040
Simulating trial. . .
espilon = 0.4181; alpha = 0.0040
Simulating trial. . .
espilon = 0.4181; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent drove forward instead of right. (rewarded 1.69)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.90)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.00)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.17)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.85)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.27)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 0.45)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 0.10)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent drove right instead of left. (rewarded 0.58)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 1.62)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.30)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 1.61)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.36)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent drove forward instead of left. (rewarded 0.53)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'right')
Agent drove forward instead of left. (rewarded 1.26)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent followed the waypoint left. (rewarded 1.97)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.38)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.13)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.95)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -4.27)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove left instead of forward. (rewarded 1.16)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -10.79)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 1.77)
23% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 220
\-------------------------
Simulating trial. . .
espilon = 0.4164; alpha = 0.0040
Simulating trial. . .
espilon = 0.4164; alpha = 0.0040
Simulating trial. . .
espilon = 0.4164; alpha = 0.0040
Simulating trial. . .
espilon = 0.4164; alpha = 0.0040
Simulating trial. . .
espilon = 0.4164; alpha = 0.0040
Simulating trial. . .
espilon = 0.4164; alpha = 0.0040
Simulating trial. . .
espilon = 0.4164; alpha = 0.0040
Simulating trial. . .
espilon = 0.4164; alpha = 0.0040
Simulating trial. . .
espilon = 0.4164; alpha = 0.0040
Simulating trial. . .
espilon = 0.4164; alpha = 0.0040
Simulating trial. . .
espilon = 0.4164; alpha = 0.0040
Simulating trial. . .
espilon = 0.4164; alpha = 0.0040
Simulating trial. . .
espilon = 0.4164; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 1.31)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 1.78)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.27)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.54)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent followed the waypoint left. (rewarded 1.52)
75% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 221
\-------------------------
Simulating trial. . .
espilon = 0.4148; alpha = 0.0040
Simulating trial. . .
espilon = 0.4148; alpha = 0.0040
Simulating trial. . .
espilon = 0.4148; alpha = 0.0040
Simulating trial. . .
espilon = 0.4148; alpha = 0.0040
Simulating trial. . .
espilon = 0.4148; alpha = 0.0040
Simulating trial. . .
espilon = 0.4148; alpha = 0.0040
Simulating trial. . .
espilon = 0.4148; alpha = 0.0040
Simulating trial. . .
espilon = 0.4148; alpha = 0.0040
Simulating trial. . .
espilon = 0.4148; alpha = 0.0040
Simulating trial. . .
espilon = 0.4148; alpha = 0.0040
Simulating trial. . .
espilon = 0.4148; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.21)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent followed the waypoint right. (rewarded 1.06)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent drove right instead of forward. (rewarded 0.62)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.11)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.61)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.75)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 2.24)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded 0.83)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.91)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.85)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.64)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.54)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.06)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove forward instead of left. (rewarded 0.61)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 0.82)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'right', 'left')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.42)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.65)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded 1.26)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.35)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.09)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'right', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.25)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'right', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.67)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'right')
Agent properly idled at a red light. (rewarded 0.88)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.69)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.66)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 222
\-------------------------
Simulating trial. . .
espilon = 0.4131; alpha = 0.0040
Simulating trial. . .
espilon = 0.4131; alpha = 0.0040
Simulating trial. . .
espilon = 0.4131; alpha = 0.0040
Simulating trial. . .
espilon = 0.4131; alpha = 0.0040
Simulating trial. . .
espilon = 0.4131; alpha = 0.0040
Simulating trial. . .
espilon = 0.4131; alpha = 0.0040
Simulating trial. . .
espilon = 0.4131; alpha = 0.0040
Simulating trial. . .
espilon = 0.4131; alpha = 0.0040
Simulating trial. . .
espilon = 0.4131; alpha = 0.0040
Simulating trial. . .
espilon = 0.4131; alpha = 0.0040
Simulating trial. . .
espilon = 0.4131; alpha = 0.0040
Simulating trial. . .
espilon = 0.4131; alpha = 0.0040
Simulating trial. . .
espilon = 0.4131; alpha = 0.0040
Simulating trial. . .
espilon = 0.4131; alpha = 0.0040
Simulating trial. . .
espilon = 0.4131; alpha = 0.0040
Simulating trial. . .
espilon = 0.4131; alpha = 0.0040
Simulating trial. . .
espilon = 0.4131; alpha = 0.0040
Simulating trial. . .
espilon = 0.4131; alpha = 0.0040
Simulating trial. . .
espilon = 0.4131; alpha = 0.0040
Simulating trial. . .
espilon = 0.4131; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.21)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.20)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.29)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.24)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.66)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'right', 'forward')
Agent followed the waypoint forward. (rewarded 1.05)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.39)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.30)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.23)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 1.63)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.72)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'left', 'forward')
Agent drove forward instead of right. (rewarded 1.66)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -10.64)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.92)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.10)
50% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 223
\-------------------------
Simulating trial. . .
espilon = 0.4115; alpha = 0.0040
Simulating trial. . .
espilon = 0.4115; alpha = 0.0040
Simulating trial. . .
espilon = 0.4115; alpha = 0.0040
Simulating trial. . .
espilon = 0.4115; alpha = 0.0040
Simulating trial. . .
espilon = 0.4115; alpha = 0.0040
Simulating trial. . .
espilon = 0.4115; alpha = 0.0040
Simulating trial. . .
espilon = 0.4115; alpha = 0.0040
Simulating trial. . .
espilon = 0.4115; alpha = 0.0040
Simulating trial. . .
espilon = 0.4115; alpha = 0.0040
Simulating trial. . .
espilon = 0.4115; alpha = 0.0040
Simulating trial. . .
espilon = 0.4115; alpha = 0.0040
Simulating trial. . .
espilon = 0.4115; alpha = 0.0040
Simulating trial. . .
espilon = 0.4115; alpha = 0.0040
Simulating trial. . .
espilon = 0.4115; alpha = 0.0040
Simulating trial. . .
espilon = 0.4115; alpha = 0.0040
Simulating trial. . .
espilon = 0.4115; alpha = 0.0040
Simulating trial. . .
espilon = 0.4115; alpha = 0.0040
Simulating trial. . .
espilon = 0.4115; alpha = 0.0040
Simulating trial. . .
espilon = 0.4115; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'right', 'right')
Agent drove right instead of left. (rewarded 0.16)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 1.19)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.21)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.86)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.23)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.72)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.17)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.69)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', None)
Agent properly idled at a red light. (rewarded 2.53)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'right', None)
Agent attempted driving forward through a red light. (rewarded -10.40)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.65)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'left')
Agent followed the waypoint forward. (rewarded 2.04)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 224
\-------------------------
Simulating trial. . .
espilon = 0.4098; alpha = 0.0040
Simulating trial. . .
espilon = 0.4098; alpha = 0.0040
Simulating trial. . .
espilon = 0.4098; alpha = 0.0040
Simulating trial. . .
espilon = 0.4098; alpha = 0.0040
Simulating trial. . .
espilon = 0.4098; alpha = 0.0040
Simulating trial. . .
espilon = 0.4098; alpha = 0.0040
Simulating trial. . .
espilon = 0.4098; alpha = 0.0040
Simulating trial. . .
espilon = 0.4098; alpha = 0.0040
Simulating trial. . .
espilon = 0.4098; alpha = 0.0040
Simulating trial. . .
espilon = 0.4098; alpha = 0.0040
Simulating trial. . .
espilon = 0.4098; alpha = 0.0040
Simulating trial. . .
espilon = 0.4098; alpha = 0.0040
Simulating trial. . .
espilon = 0.4098; alpha = 0.0040
Simulating trial. . .
espilon = 0.4098; alpha = 0.0040
Simulating trial. . .
espilon = 0.4098; alpha = 0.0040
Simulating trial. . .
espilon = 0.4098; alpha = 0.0040
Simulating trial. . .
espilon = 0.4098; alpha = 0.0040
Simulating trial. . .
espilon = 0.4098; alpha = 0.0040
Simulating trial. . .
espilon = 0.4098; alpha = 0.0040
Simulating trial. . .
espilon = 0.4098; alpha = 0.0040
Simulating trial. . .
espilon = 0.4098; alpha = 0.0040
Simulating trial. . .
espilon = 0.4098; alpha = 0.0040
Simulating trial. . .
espilon = 0.4098; alpha = 0.0040
Simulating trial. . .
espilon = 0.4098; alpha = 0.0040
Simulating trial. . .
espilon = 0.4098; alpha = 0.0040
Simulating trial. . .
espilon = 0.4098; alpha = 0.0040
Simulating trial. . .
espilon = 0.4098; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.63)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.19)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.87)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.69)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded 0.42)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.32)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.65)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -10.95)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.84)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.89)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.10)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.38)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded 1.23)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.30)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.71)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.35)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.21)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 0.35)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent drove forward instead of left. (rewarded 0.13)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.36)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 225
\-------------------------
Simulating trial. . .
espilon = 0.4082; alpha = 0.0040
Simulating trial. . .
espilon = 0.4082; alpha = 0.0040
Simulating trial. . .
espilon = 0.4082; alpha = 0.0040
Simulating trial. . .
espilon = 0.4082; alpha = 0.0040
Simulating trial. . .
espilon = 0.4082; alpha = 0.0040
Simulating trial. . .
espilon = 0.4082; alpha = 0.0040
Simulating trial. . .
espilon = 0.4082; alpha = 0.0040
Simulating trial. . .
espilon = 0.4082; alpha = 0.0040
Simulating trial. . .
espilon = 0.4082; alpha = 0.0040
Simulating trial. . .
espilon = 0.4082; alpha = 0.0040
Simulating trial. . .
espilon = 0.4082; alpha = 0.0040
Simulating trial. . .
espilon = 0.4082; alpha = 0.0040
Simulating trial. . .
espilon = 0.4082; alpha = 0.0040
Simulating trial. . .
espilon = 0.4082; alpha = 0.0040
Simulating trial. . .
espilon = 0.4082; alpha = 0.0040
Simulating trial. . .
espilon = 0.4082; alpha = 0.0040
Simulating trial. . .
espilon = 0.4082; alpha = 0.0040
Simulating trial. . .
espilon = 0.4082; alpha = 0.0040
Simulating trial. . .
espilon = 0.4082; alpha = 0.0040
Simulating trial. . .
espilon = 0.4082; alpha = 0.0040
Simulating trial. . .
espilon = 0.4082; alpha = 0.0040
Simulating trial. . .
espilon = 0.4082; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.72)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 2.88)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.49)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.27)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.32)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.57)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.23)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.19)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', None)
Agent attempted driving forward through a red light. (rewarded -10.60)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent drove left instead of forward. (rewarded 1.18)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.15)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.35)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', None)
Agent followed the waypoint right. (rewarded 1.67)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove left instead of forward. (rewarded 0.59)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.59)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'right', 'right')
Agent drove forward instead of right. (rewarded 1.35)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.69)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.08)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.52)
24% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 226
\-------------------------
Simulating trial. . .
espilon = 0.4066; alpha = 0.0040
Simulating trial. . .
espilon = 0.4066; alpha = 0.0040
Simulating trial. . .
espilon = 0.4066; alpha = 0.0040
Simulating trial. . .
espilon = 0.4066; alpha = 0.0040
Simulating trial. . .
espilon = 0.4066; alpha = 0.0040
Simulating trial. . .
espilon = 0.4066; alpha = 0.0040
Simulating trial. . .
espilon = 0.4066; alpha = 0.0040
Simulating trial. . .
espilon = 0.4066; alpha = 0.0040
Simulating trial. . .
espilon = 0.4066; alpha = 0.0040
Simulating trial. . .
espilon = 0.4066; alpha = 0.0040
Simulating trial. . .
espilon = 0.4066; alpha = 0.0040
Simulating trial. . .
espilon = 0.4066; alpha = 0.0040
Simulating trial. . .
espilon = 0.4066; alpha = 0.0040
Simulating trial. . .
espilon = 0.4066; alpha = 0.0040
Simulating trial. . .
espilon = 0.4066; alpha = 0.0040
Simulating trial. . .
espilon = 0.4066; alpha = 0.0040
Simulating trial. . .
espilon = 0.4066; alpha = 0.0040
Simulating trial. . .
espilon = 0.4066; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.00)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.37)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.58)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.55)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.50)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.83)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 1.44)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.22)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.92)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.01)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent drove right instead of forward. (rewarded 1.57)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.43)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.18)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', 'forward')
Agent drove forward instead of left. (rewarded 1.21)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.46)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.84)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'forward')
Agent drove forward instead of left. (rewarded 0.93)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.87)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent followed the waypoint left. (rewarded 1.18)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.79)
20% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 227
\-------------------------
Simulating trial. . .
espilon = 0.4049; alpha = 0.0040
Simulating trial. . .
espilon = 0.4049; alpha = 0.0040
Simulating trial. . .
espilon = 0.4049; alpha = 0.0040
Simulating trial. . .
espilon = 0.4049; alpha = 0.0040
Simulating trial. . .
espilon = 0.4049; alpha = 0.0040
Simulating trial. . .
espilon = 0.4049; alpha = 0.0040
Simulating trial. . .
espilon = 0.4049; alpha = 0.0040
Simulating trial. . .
espilon = 0.4049; alpha = 0.0040
Simulating trial. . .
espilon = 0.4049; alpha = 0.0040
Simulating trial. . .
espilon = 0.4049; alpha = 0.0040
Simulating trial. . .
espilon = 0.4049; alpha = 0.0040
Simulating trial. . .
espilon = 0.4049; alpha = 0.0040
Simulating trial. . .
espilon = 0.4049; alpha = 0.0040
Simulating trial. . .
espilon = 0.4049; alpha = 0.0040
Simulating trial. . .
espilon = 0.4049; alpha = 0.0040
Simulating trial. . .
espilon = 0.4049; alpha = 0.0040
Simulating trial. . .
espilon = 0.4049; alpha = 0.0040
Simulating trial. . .
espilon = 0.4049; alpha = 0.0040
Simulating trial. . .
espilon = 0.4049; alpha = 0.0040
Simulating trial. . .
espilon = 0.4049; alpha = 0.0040
Simulating trial. . .
espilon = 0.4049; alpha = 0.0040
Simulating trial. . .
espilon = 0.4049; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 1.41)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.41)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.60)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.75)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.65)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.05)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 0.11)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.64)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.13)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.33)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, None)
Agent followed the waypoint right. (rewarded 2.30)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.64)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.39)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'forward')
Agent drove forward instead of left. (rewarded 1.18)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.91)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.80)
36% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 228
\-------------------------
Simulating trial. . .
espilon = 0.4033; alpha = 0.0040
Simulating trial. . .
espilon = 0.4033; alpha = 0.0040
Simulating trial. . .
espilon = 0.4033; alpha = 0.0040
Simulating trial. . .
espilon = 0.4033; alpha = 0.0040
Simulating trial. . .
espilon = 0.4033; alpha = 0.0040
Simulating trial. . .
espilon = 0.4033; alpha = 0.0040
Simulating trial. . .
espilon = 0.4033; alpha = 0.0040
Simulating trial. . .
espilon = 0.4033; alpha = 0.0040
Simulating trial. . .
espilon = 0.4033; alpha = 0.0040
Simulating trial. . .
espilon = 0.4033; alpha = 0.0040
Simulating trial. . .
espilon = 0.4033; alpha = 0.0040
Simulating trial. . .
espilon = 0.4033; alpha = 0.0040
Simulating trial. . .
espilon = 0.4033; alpha = 0.0040
Simulating trial. . .
espilon = 0.4033; alpha = 0.0040
Simulating trial. . .
espilon = 0.4033; alpha = 0.0040
Simulating trial. . .
espilon = 0.4033; alpha = 0.0040
Simulating trial. . .
espilon = 0.4033; alpha = 0.0040
Simulating trial. . .
espilon = 0.4033; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.66)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.32)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent drove right instead of left. (rewarded 1.42)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.49)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.44)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.71)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'right')
Agent followed the waypoint right. (rewarded 1.94)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded -0.08)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 0.84)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 0.54)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'forward')
Agent drove forward instead of left. (rewarded 1.36)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.48)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -10.66)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.39)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.69)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.35)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 0.66)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 0.51)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.34)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent attempted driving left through a red light. (rewarded -10.93)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 229
\-------------------------
Simulating trial. . .
espilon = 0.4017; alpha = 0.0040
Simulating trial. . .
espilon = 0.4017; alpha = 0.0040
Simulating trial. . .
espilon = 0.4017; alpha = 0.0040
Simulating trial. . .
espilon = 0.4017; alpha = 0.0040
Simulating trial. . .
espilon = 0.4017; alpha = 0.0040
Simulating trial. . .
espilon = 0.4017; alpha = 0.0040
Simulating trial. . .
espilon = 0.4017; alpha = 0.0040
Simulating trial. . .
espilon = 0.4017; alpha = 0.0040
Simulating trial. . .
espilon = 0.4017; alpha = 0.0040
Simulating trial. . .
espilon = 0.4017; alpha = 0.0040
Simulating trial. . .
espilon = 0.4017; alpha = 0.0040
Simulating trial. . .
espilon = 0.4017; alpha = 0.0040
Simulating trial. . .
espilon = 0.4017; alpha = 0.0040
Simulating trial. . .
espilon = 0.4017; alpha = 0.0040
Simulating trial. . .
espilon = 0.4017; alpha = 0.0040
Simulating trial. . .
espilon = 0.4017; alpha = 0.0040
Simulating trial. . .
espilon = 0.4017; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.52)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -9.53)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', 'right')
Agent drove right instead of left. (rewarded 0.46)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.73)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.45)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -5.16)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.13)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.70)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.68)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.32)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent drove right instead of forward. (rewarded 1.43)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.70)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 230
\-------------------------
Simulating trial. . .
espilon = 0.4001; alpha = 0.0040
Simulating trial. . .
espilon = 0.4001; alpha = 0.0040
Simulating trial. . .
espilon = 0.4001; alpha = 0.0040
Simulating trial. . .
espilon = 0.4001; alpha = 0.0040
Simulating trial. . .
espilon = 0.4001; alpha = 0.0040
Simulating trial. . .
espilon = 0.4001; alpha = 0.0040
Simulating trial. . .
espilon = 0.4001; alpha = 0.0040
Simulating trial. . .
espilon = 0.4001; alpha = 0.0040
Simulating trial. . .
espilon = 0.4001; alpha = 0.0040
Simulating trial. . .
espilon = 0.4001; alpha = 0.0040
Simulating trial. . .
espilon = 0.4001; alpha = 0.0040
Simulating trial. . .
espilon = 0.4001; alpha = 0.0040
Simulating trial. . .
espilon = 0.4001; alpha = 0.0040
Simulating trial. . .
espilon = 0.4001; alpha = 0.0040
Simulating trial. . .
espilon = 0.4001; alpha = 0.0040
Simulating trial. . .
espilon = 0.4001; alpha = 0.0040
Simulating trial. . .
espilon = 0.4001; alpha = 0.0040
Simulating trial. . .
espilon = 0.4001; alpha = 0.0040
Simulating trial. . .
espilon = 0.4001; alpha = 0.0040
Simulating trial. . .
espilon = 0.4001; alpha = 0.0040
Simulating trial. . .
espilon = 0.4001; alpha = 0.0040
Simulating trial. . .
espilon = 0.4001; alpha = 0.0040
Simulating trial. . .
espilon = 0.4001; alpha = 0.0040
Simulating trial. . .
espilon = 0.4001; alpha = 0.0040
Simulating trial. . .
espilon = 0.4001; alpha = 0.0040
Simulating trial. . .
espilon = 0.4001; alpha = 0.0040
Simulating trial. . .
espilon = 0.4001; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.42)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.14)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.87)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded 0.55)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 1.02)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'right', 'forward')
Agent drove forward instead of left. (rewarded 1.64)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent followed the waypoint left. (rewarded 0.97)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.72)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 1.33)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.28)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.81)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.24)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.63)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.93)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent drove forward instead of left. (rewarded 1.44)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.98)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.12)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.99)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.04)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.66)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 231
\-------------------------
Simulating trial. . .
espilon = 0.3985; alpha = 0.0040
Simulating trial. . .
espilon = 0.3985; alpha = 0.0040
Simulating trial. . .
espilon = 0.3985; alpha = 0.0040
Simulating trial. . .
espilon = 0.3985; alpha = 0.0040
Simulating trial. . .
espilon = 0.3985; alpha = 0.0040
Simulating trial. . .
espilon = 0.3985; alpha = 0.0040
Simulating trial. . .
espilon = 0.3985; alpha = 0.0040
Simulating trial. . .
espilon = 0.3985; alpha = 0.0040
Simulating trial. . .
espilon = 0.3985; alpha = 0.0040
Simulating trial. . .
espilon = 0.3985; alpha = 0.0040
Simulating trial. . .
espilon = 0.3985; alpha = 0.0040
Simulating trial. . .
espilon = 0.3985; alpha = 0.0040
Simulating trial. . .
espilon = 0.3985; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent drove right instead of left. (rewarded 1.81)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.30)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.69)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.68)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.02)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.43)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 0.27)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.47)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.31)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.55)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.86)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', None)
Agent attempted driving left through a red light. (rewarded -9.53)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.86)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.96)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.05)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'forward')
Agent drove right instead of forward. (rewarded 1.21)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent drove right instead of left. (rewarded 1.25)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.14)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', 'left')
Agent drove right instead of left. (rewarded 1.20)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove forward instead of right. (rewarded -0.27)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.27)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.58)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.65)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.12)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'right')
Agent followed the waypoint forward. (rewarded 1.49)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.96)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.05)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent attempted driving forward through a red light. (rewarded -9.34)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded 0.45)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 0.25)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 232
\-------------------------
Simulating trial. . .
espilon = 0.3969; alpha = 0.0040
Simulating trial. . .
espilon = 0.3969; alpha = 0.0040
Simulating trial. . .
espilon = 0.3969; alpha = 0.0040
Simulating trial. . .
espilon = 0.3969; alpha = 0.0040
Simulating trial. . .
espilon = 0.3969; alpha = 0.0040
Simulating trial. . .
espilon = 0.3969; alpha = 0.0040
Simulating trial. . .
espilon = 0.3969; alpha = 0.0040
Simulating trial. . .
espilon = 0.3969; alpha = 0.0040
Simulating trial. . .
espilon = 0.3969; alpha = 0.0040
Simulating trial. . .
espilon = 0.3969; alpha = 0.0040
Simulating trial. . .
espilon = 0.3969; alpha = 0.0040
Simulating trial. . .
espilon = 0.3969; alpha = 0.0040
Simulating trial. . .
espilon = 0.3969; alpha = 0.0040
Simulating trial. . .
espilon = 0.3969; alpha = 0.0040
Simulating trial. . .
espilon = 0.3969; alpha = 0.0040
Simulating trial. . .
espilon = 0.3969; alpha = 0.0040
Simulating trial. . .
espilon = 0.3969; alpha = 0.0040
Simulating trial. . .
espilon = 0.3969; alpha = 0.0040
Simulating trial. . .
espilon = 0.3969; alpha = 0.0040
Simulating trial. . .
espilon = 0.3969; alpha = 0.0040
Simulating trial. . .
espilon = 0.3969; alpha = 0.0040
Simulating trial. . .
espilon = 0.3969; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 1.45)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -9.54)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent attempted driving left through a red light. (rewarded -10.84)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.55)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.39)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.66)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'right')
Agent drove right instead of left. (rewarded 0.30)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', None)
Agent properly idled at a red light. (rewarded 1.35)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.15)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent drove right instead of left. (rewarded 0.94)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.40)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 0.75)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.15)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.58)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.02)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.39)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.28)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'right')
Agent drove forward instead of left. (rewarded -0.66)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.75)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.27)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 233
\-------------------------
Simulating trial. . .
espilon = 0.3953; alpha = 0.0040
Simulating trial. . .
espilon = 0.3953; alpha = 0.0040
Simulating trial. . .
espilon = 0.3953; alpha = 0.0040
Simulating trial. . .
espilon = 0.3953; alpha = 0.0040
Simulating trial. . .
espilon = 0.3953; alpha = 0.0040
Simulating trial. . .
espilon = 0.3953; alpha = 0.0040
Simulating trial. . .
espilon = 0.3953; alpha = 0.0040
Simulating trial. . .
espilon = 0.3953; alpha = 0.0040
Simulating trial. . .
espilon = 0.3953; alpha = 0.0040
Simulating trial. . .
espilon = 0.3953; alpha = 0.0040
Simulating trial. . .
espilon = 0.3953; alpha = 0.0040
Simulating trial. . .
espilon = 0.3953; alpha = 0.0040
Simulating trial. . .
espilon = 0.3953; alpha = 0.0040
Simulating trial. . .
espilon = 0.3953; alpha = 0.0040
Simulating trial. . .
espilon = 0.3953; alpha = 0.0040
Simulating trial. . .
espilon = 0.3953; alpha = 0.0040
Simulating trial. . .
espilon = 0.3953; alpha = 0.0040
Simulating trial. . .
espilon = 0.3953; alpha = 0.0040
Simulating trial. . .
espilon = 0.3953; alpha = 0.0040
Simulating trial. . .
espilon = 0.3953; alpha = 0.0040
Simulating trial. . .
espilon = 0.3953; alpha = 0.0040
Simulating trial. . .
espilon = 0.3953; alpha = 0.0040
Simulating trial. . .
espilon = 0.3953; alpha = 0.0040
Simulating trial. . .
espilon = 0.3953; alpha = 0.0040
Simulating trial. . .
espilon = 0.3953; alpha = 0.0040
Simulating trial. . .
espilon = 0.3953; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 1.97)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.95)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.11)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded 1.83)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.68)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.96)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.67)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent drove right instead of left. (rewarded 1.11)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'right')
Agent properly idled at a red light. (rewarded 2.83)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'right')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.47)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.13)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent followed the waypoint left. (rewarded 2.15)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.54)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', None)
Agent properly idled at a red light. (rewarded 1.73)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.99)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.20)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.34)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.91)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent attempted driving forward through a red light. (rewarded -10.01)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.81)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 0.86)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded -0.21)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.60)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.44)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 0.32)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 234
\-------------------------
Simulating trial. . .
espilon = 0.3938; alpha = 0.0040
Simulating trial. . .
espilon = 0.3938; alpha = 0.0040
Simulating trial. . .
espilon = 0.3938; alpha = 0.0040
Simulating trial. . .
espilon = 0.3938; alpha = 0.0040
Simulating trial. . .
espilon = 0.3938; alpha = 0.0040
Simulating trial. . .
espilon = 0.3938; alpha = 0.0040
Simulating trial. . .
espilon = 0.3938; alpha = 0.0040
Simulating trial. . .
espilon = 0.3938; alpha = 0.0040
Simulating trial. . .
espilon = 0.3938; alpha = 0.0040
Simulating trial. . .
espilon = 0.3938; alpha = 0.0040
Simulating trial. . .
espilon = 0.3938; alpha = 0.0040
Simulating trial. . .
espilon = 0.3938; alpha = 0.0040
Simulating trial. . .
espilon = 0.3938; alpha = 0.0040
Simulating trial. . .
espilon = 0.3938; alpha = 0.0040
Simulating trial. . .
espilon = 0.3938; alpha = 0.0040
Simulating trial. . .
espilon = 0.3938; alpha = 0.0040
Simulating trial. . .
espilon = 0.3938; alpha = 0.0040
Simulating trial. . .
espilon = 0.3938; alpha = 0.0040
Simulating trial. . .
espilon = 0.3938; alpha = 0.0040
Simulating trial. . .
espilon = 0.3938; alpha = 0.0040
Simulating trial. . .
espilon = 0.3938; alpha = 0.0040
Simulating trial. . .
espilon = 0.3938; alpha = 0.0040
Simulating trial. . .
espilon = 0.3938; alpha = 0.0040
Simulating trial. . .
espilon = 0.3938; alpha = 0.0040
Simulating trial. . .
espilon = 0.3938; alpha = 0.0040
Simulating trial. . .
espilon = 0.3938; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.40)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.35)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.01)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 0.50)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.05)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent followed the waypoint right. (rewarded 2.39)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.33)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.44)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.99)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.65)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.33)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.85)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove forward instead of right. (rewarded 0.72)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.27)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.97)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.17)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.25)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 0.19)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.54)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.62)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 235
\-------------------------
Simulating trial. . .
espilon = 0.3922; alpha = 0.0040
Simulating trial. . .
espilon = 0.3922; alpha = 0.0040
Simulating trial. . .
espilon = 0.3922; alpha = 0.0040
Simulating trial. . .
espilon = 0.3922; alpha = 0.0040
Simulating trial. . .
espilon = 0.3922; alpha = 0.0040
Simulating trial. . .
espilon = 0.3922; alpha = 0.0040
Simulating trial. . .
espilon = 0.3922; alpha = 0.0040
Simulating trial. . .
espilon = 0.3922; alpha = 0.0040
Simulating trial. . .
espilon = 0.3922; alpha = 0.0040
Simulating trial. . .
espilon = 0.3922; alpha = 0.0040
Simulating trial. . .
espilon = 0.3922; alpha = 0.0040
Simulating trial. . .
espilon = 0.3922; alpha = 0.0040
Simulating trial. . .
espilon = 0.3922; alpha = 0.0040
Simulating trial. . .
espilon = 0.3922; alpha = 0.0040
Simulating trial. . .
espilon = 0.3922; alpha = 0.0040
Simulating trial. . .
espilon = 0.3922; alpha = 0.0040
Simulating trial. . .
espilon = 0.3922; alpha = 0.0040
Simulating trial. . .
espilon = 0.3922; alpha = 0.0040
Simulating trial. . .
espilon = 0.3922; alpha = 0.0040
Simulating trial. . .
espilon = 0.3922; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.68)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.69)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.14)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.91)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.39)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.68)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.66)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.58)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.13)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.43)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.56)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 2.12)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 1.36)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', 'left')
Agent properly idled at a red light. (rewarded 0.92)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent attempted driving forward through a red light. (rewarded -10.49)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.35)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.16)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'right')
Agent followed the waypoint forward. (rewarded 1.34)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.15)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.45)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 236
\-------------------------
Simulating trial. . .
espilon = 0.3906; alpha = 0.0040
Simulating trial. . .
espilon = 0.3906; alpha = 0.0040
Simulating trial. . .
espilon = 0.3906; alpha = 0.0040
Simulating trial. . .
espilon = 0.3906; alpha = 0.0040
Simulating trial. . .
espilon = 0.3906; alpha = 0.0040
Simulating trial. . .
espilon = 0.3906; alpha = 0.0040
Simulating trial. . .
espilon = 0.3906; alpha = 0.0040
Simulating trial. . .
espilon = 0.3906; alpha = 0.0040
Simulating trial. . .
espilon = 0.3906; alpha = 0.0040
Simulating trial. . .
espilon = 0.3906; alpha = 0.0040
Simulating trial. . .
espilon = 0.3906; alpha = 0.0040
Simulating trial. . .
espilon = 0.3906; alpha = 0.0040
Simulating trial. . .
espilon = 0.3906; alpha = 0.0040
Simulating trial. . .
espilon = 0.3906; alpha = 0.0040
Simulating trial. . .
espilon = 0.3906; alpha = 0.0040
Simulating trial. . .
espilon = 0.3906; alpha = 0.0040
Simulating trial. . .
espilon = 0.3906; alpha = 0.0040
Simulating trial. . .
espilon = 0.3906; alpha = 0.0040
Simulating trial. . .
espilon = 0.3906; alpha = 0.0040
Simulating trial. . .
espilon = 0.3906; alpha = 0.0040
Simulating trial. . .
espilon = 0.3906; alpha = 0.0040
Simulating trial. . .
espilon = 0.3906; alpha = 0.0040
Simulating trial. . .
espilon = 0.3906; alpha = 0.0040
Simulating trial. . .
espilon = 0.3906; alpha = 0.0040
Simulating trial. . .
espilon = 0.3906; alpha = 0.0040
Simulating trial. . .
espilon = 0.3906; alpha = 0.0040
Simulating trial. . .
espilon = 0.3906; alpha = 0.0040
Simulating trial. . .
espilon = 0.3906; alpha = 0.0040
Simulating trial. . .
espilon = 0.3906; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 2.37)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.91)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.69)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.09)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.37)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.75)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 1.71)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 1.81)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent drove left instead of right. (rewarded 0.85)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 1.01)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.71)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.53)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.37)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 1.24)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.62)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.61)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'forward')
Agent drove right instead of left. (rewarded -0.56)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.61)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.65)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.97)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 237
\-------------------------
Simulating trial. . .
espilon = 0.3891; alpha = 0.0040
Simulating trial. . .
espilon = 0.3891; alpha = 0.0040
Simulating trial. . .
espilon = 0.3891; alpha = 0.0040
Simulating trial. . .
espilon = 0.3891; alpha = 0.0040
Simulating trial. . .
espilon = 0.3891; alpha = 0.0040
Simulating trial. . .
espilon = 0.3891; alpha = 0.0040
Simulating trial. . .
espilon = 0.3891; alpha = 0.0040
Simulating trial. . .
espilon = 0.3891; alpha = 0.0040
Simulating trial. . .
espilon = 0.3891; alpha = 0.0040
Simulating trial. . .
espilon = 0.3891; alpha = 0.0040
Simulating trial. . .
espilon = 0.3891; alpha = 0.0040
Simulating trial. . .
espilon = 0.3891; alpha = 0.0040
Simulating trial. . .
espilon = 0.3891; alpha = 0.0040
Simulating trial. . .
espilon = 0.3891; alpha = 0.0040
Simulating trial. . .
espilon = 0.3891; alpha = 0.0040
Simulating trial. . .
espilon = 0.3891; alpha = 0.0040
Simulating trial. . .
espilon = 0.3891; alpha = 0.0040
Simulating trial. . .
espilon = 0.3891; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.46)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.77)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.73)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.78)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.46)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.03)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -10.88)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.48)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.32)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded -0.07)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.45)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.86)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.61)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.48)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 1.77)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded -0.28)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 0.64)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'left')
Agent drove left instead of right. (rewarded 0.29)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'right')
Agent properly idled at a red light. (rewarded 0.42)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.33)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 238
\-------------------------
Simulating trial. . .
espilon = 0.3875; alpha = 0.0040
Simulating trial. . .
espilon = 0.3875; alpha = 0.0040
Simulating trial. . .
espilon = 0.3875; alpha = 0.0040
Simulating trial. . .
espilon = 0.3875; alpha = 0.0040
Simulating trial. . .
espilon = 0.3875; alpha = 0.0040
Simulating trial. . .
espilon = 0.3875; alpha = 0.0040
Simulating trial. . .
espilon = 0.3875; alpha = 0.0040
Simulating trial. . .
espilon = 0.3875; alpha = 0.0040
Simulating trial. . .
espilon = 0.3875; alpha = 0.0040
Simulating trial. . .
espilon = 0.3875; alpha = 0.0040
Simulating trial. . .
espilon = 0.3875; alpha = 0.0040
Simulating trial. . .
espilon = 0.3875; alpha = 0.0040
Simulating trial. . .
espilon = 0.3875; alpha = 0.0040
Simulating trial. . .
espilon = 0.3875; alpha = 0.0040
Simulating trial. . .
espilon = 0.3875; alpha = 0.0040
Simulating trial. . .
espilon = 0.3875; alpha = 0.0040
Simulating trial. . .
espilon = 0.3875; alpha = 0.0040
Simulating trial. . .
espilon = 0.3875; alpha = 0.0040
Simulating trial. . .
espilon = 0.3875; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.30)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.18)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.06)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.26)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.35)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 1.79)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.90)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', None)
Agent attempted driving left through a red light. (rewarded -10.96)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.68)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.08)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.50)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.74)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.31)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.10)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.22)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 239
\-------------------------
Simulating trial. . .
espilon = 0.3860; alpha = 0.0040
Simulating trial. . .
espilon = 0.3860; alpha = 0.0040
Simulating trial. . .
espilon = 0.3860; alpha = 0.0040
Simulating trial. . .
espilon = 0.3860; alpha = 0.0040
Simulating trial. . .
espilon = 0.3860; alpha = 0.0040
Simulating trial. . .
espilon = 0.3860; alpha = 0.0040
Simulating trial. . .
espilon = 0.3860; alpha = 0.0040
Simulating trial. . .
espilon = 0.3860; alpha = 0.0040
Simulating trial. . .
espilon = 0.3860; alpha = 0.0040
Simulating trial. . .
espilon = 0.3860; alpha = 0.0040
Simulating trial. . .
espilon = 0.3860; alpha = 0.0040
Simulating trial. . .
espilon = 0.3860; alpha = 0.0040
Simulating trial. . .
espilon = 0.3860; alpha = 0.0040
Simulating trial. . .
espilon = 0.3860; alpha = 0.0040
Simulating trial. . .
espilon = 0.3860; alpha = 0.0040
Simulating trial. . .
espilon = 0.3860; alpha = 0.0040
Simulating trial. . .
espilon = 0.3860; alpha = 0.0040
Simulating trial. . .
espilon = 0.3860; alpha = 0.0040
Simulating trial. . .
espilon = 0.3860; alpha = 0.0040
Simulating trial. . .
espilon = 0.3860; alpha = 0.0040
Simulating trial. . .
espilon = 0.3860; alpha = 0.0040
Simulating trial. . .
espilon = 0.3860; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', 'forward')
Agent properly idled at a red light. (rewarded 2.37)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.21)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded 0.58)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.04)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.32)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.64)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', 'right')
Agent followed the waypoint right. (rewarded 1.26)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.03)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded 1.11)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.55)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'forward')
Agent attempted driving left through a red light. (rewarded -10.15)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.96)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.01)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.76)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.25)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'right')
Agent drove right instead of forward. (rewarded 0.40)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent drove right instead of left. (rewarded 0.19)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent drove right instead of left. (rewarded 0.27)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 0.32)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.26)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 240
\-------------------------
Simulating trial. . .
espilon = 0.3844; alpha = 0.0040
Simulating trial. . .
espilon = 0.3844; alpha = 0.0040
Simulating trial. . .
espilon = 0.3844; alpha = 0.0040
Simulating trial. . .
espilon = 0.3844; alpha = 0.0040
Simulating trial. . .
espilon = 0.3844; alpha = 0.0040
Simulating trial. . .
espilon = 0.3844; alpha = 0.0040
Simulating trial. . .
espilon = 0.3844; alpha = 0.0040
Simulating trial. . .
espilon = 0.3844; alpha = 0.0040
Simulating trial. . .
espilon = 0.3844; alpha = 0.0040
Simulating trial. . .
espilon = 0.3844; alpha = 0.0040
Simulating trial. . .
espilon = 0.3844; alpha = 0.0040
Simulating trial. . .
espilon = 0.3844; alpha = 0.0040
Simulating trial. . .
espilon = 0.3844; alpha = 0.0040
Simulating trial. . .
espilon = 0.3844; alpha = 0.0040
Simulating trial. . .
espilon = 0.3844; alpha = 0.0040
Simulating trial. . .
espilon = 0.3844; alpha = 0.0040
Simulating trial. . .
espilon = 0.3844; alpha = 0.0040
Simulating trial. . .
espilon = 0.3844; alpha = 0.0040
Simulating trial. . .
espilon = 0.3844; alpha = 0.0040
Simulating trial. . .
espilon = 0.3844; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.31)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent drove right instead of left. (rewarded 0.91)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.60)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.70)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.68)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.79)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.61)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 0.95)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', 'right')
Agent drove right instead of left. (rewarded 0.88)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.37)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.48)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.77)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.89)
35% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 241
\-------------------------
Simulating trial. . .
espilon = 0.3829; alpha = 0.0040
Simulating trial. . .
espilon = 0.3829; alpha = 0.0040
Simulating trial. . .
espilon = 0.3829; alpha = 0.0040
Simulating trial. . .
espilon = 0.3829; alpha = 0.0040
Simulating trial. . .
espilon = 0.3829; alpha = 0.0040
Simulating trial. . .
espilon = 0.3829; alpha = 0.0040
Simulating trial. . .
espilon = 0.3829; alpha = 0.0040
Simulating trial. . .
espilon = 0.3829; alpha = 0.0040
Simulating trial. . .
espilon = 0.3829; alpha = 0.0040
Simulating trial. . .
espilon = 0.3829; alpha = 0.0040
Simulating trial. . .
espilon = 0.3829; alpha = 0.0040
Simulating trial. . .
espilon = 0.3829; alpha = 0.0040
Simulating trial. . .
espilon = 0.3829; alpha = 0.0040
Simulating trial. . .
espilon = 0.3829; alpha = 0.0040
Simulating trial. . .
espilon = 0.3829; alpha = 0.0040
Simulating trial. . .
espilon = 0.3829; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 1.15)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.90)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.93)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.18)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.49)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 2.39)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.94)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.77)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.48)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.59)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'forward')
Agent drove right instead of left. (rewarded 0.90)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 1.34)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.24)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.28)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'forward')
Agent drove forward instead of left. (rewarded 1.19)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.97)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.49)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.29)
28% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 242
\-------------------------
Simulating trial. . .
espilon = 0.3814; alpha = 0.0040
Simulating trial. . .
espilon = 0.3814; alpha = 0.0040
Simulating trial. . .
espilon = 0.3814; alpha = 0.0040
Simulating trial. . .
espilon = 0.3814; alpha = 0.0040
Simulating trial. . .
espilon = 0.3814; alpha = 0.0040
Simulating trial. . .
espilon = 0.3814; alpha = 0.0040
Simulating trial. . .
espilon = 0.3814; alpha = 0.0040
Simulating trial. . .
espilon = 0.3814; alpha = 0.0040
Simulating trial. . .
espilon = 0.3814; alpha = 0.0040
Simulating trial. . .
espilon = 0.3814; alpha = 0.0040
Simulating trial. . .
espilon = 0.3814; alpha = 0.0040
Simulating trial. . .
espilon = 0.3814; alpha = 0.0040
Simulating trial. . .
espilon = 0.3814; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 1.67)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'right', 'forward')
Agent drove forward instead of left. (rewarded 1.68)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent drove right instead of left. (rewarded 1.45)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.08)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.15)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.40)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent followed the waypoint right. (rewarded 1.24)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove right instead of forward. (rewarded 0.77)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.46)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.04)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.83)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent drove forward instead of right. (rewarded 0.51)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.35)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.31)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.87)
25% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 243
\-------------------------
Simulating trial. . .
espilon = 0.3798; alpha = 0.0040
Simulating trial. . .
espilon = 0.3798; alpha = 0.0040
Simulating trial. . .
espilon = 0.3798; alpha = 0.0040
Simulating trial. . .
espilon = 0.3798; alpha = 0.0040
Simulating trial. . .
espilon = 0.3798; alpha = 0.0040
Simulating trial. . .
espilon = 0.3798; alpha = 0.0040
Simulating trial. . .
espilon = 0.3798; alpha = 0.0040
Simulating trial. . .
espilon = 0.3798; alpha = 0.0040
Simulating trial. . .
espilon = 0.3798; alpha = 0.0040
Simulating trial. . .
espilon = 0.3798; alpha = 0.0040
Simulating trial. . .
espilon = 0.3798; alpha = 0.0040
Simulating trial. . .
espilon = 0.3798; alpha = 0.0040
Simulating trial. . .
espilon = 0.3798; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'left')
Agent followed the waypoint right. (rewarded 1.08)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'right', None)
Agent followed the waypoint right. (rewarded 2.47)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.73)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 2.44)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 2.11)
80% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 244
\-------------------------
Simulating trial. . .
espilon = 0.3783; alpha = 0.0040
Simulating trial. . .
espilon = 0.3783; alpha = 0.0040
Simulating trial. . .
espilon = 0.3783; alpha = 0.0040
Simulating trial. . .
espilon = 0.3783; alpha = 0.0040
Simulating trial. . .
espilon = 0.3783; alpha = 0.0040
Simulating trial. . .
espilon = 0.3783; alpha = 0.0040
Simulating trial. . .
espilon = 0.3783; alpha = 0.0040
Simulating trial. . .
espilon = 0.3783; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.63)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.81)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent attempted driving forward through a red light. (rewarded -10.96)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.59)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent drove right instead of forward. (rewarded 1.15)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.30)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove forward instead of left. (rewarded 0.94)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 1.11)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.06)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent followed the waypoint right. (rewarded 0.94)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.89)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 0.97)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.09)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent drove right instead of forward. (rewarded 0.95)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', 'right')
Agent attempted driving forward through a red light. (rewarded -10.37)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'forward')
Agent drove right instead of left. (rewarded 0.01)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.64)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.16)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 0.94)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'right', 'left')
Agent idled at a green light with oncoming traffic. (rewarded -0.15)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 0.94)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.68)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.70)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.09)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.92)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 245
\-------------------------
Simulating trial. . .
espilon = 0.3768; alpha = 0.0040
Simulating trial. . .
espilon = 0.3768; alpha = 0.0040
Simulating trial. . .
espilon = 0.3768; alpha = 0.0040
Simulating trial. . .
espilon = 0.3768; alpha = 0.0040
Simulating trial. . .
espilon = 0.3768; alpha = 0.0040
Simulating trial. . .
espilon = 0.3768; alpha = 0.0040
Simulating trial. . .
espilon = 0.3768; alpha = 0.0040
Simulating trial. . .
espilon = 0.3768; alpha = 0.0040
Simulating trial. . .
espilon = 0.3768; alpha = 0.0040
Simulating trial. . .
espilon = 0.3768; alpha = 0.0040
Simulating trial. . .
espilon = 0.3768; alpha = 0.0040
Simulating trial. . .
espilon = 0.3768; alpha = 0.0040
Simulating trial. . .
espilon = 0.3768; alpha = 0.0040
Simulating trial. . .
espilon = 0.3768; alpha = 0.0040
Simulating trial. . .
espilon = 0.3768; alpha = 0.0040
Simulating trial. . .
espilon = 0.3768; alpha = 0.0040
Simulating trial. . .
espilon = 0.3768; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent drove right instead of forward. (rewarded 1.53)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.97)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.24)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 0.44)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.70)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 2.78)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 0.92)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.53)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent attempted driving left through a red light. (rewarded -9.07)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.69)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'right')
Agent drove forward instead of left. (rewarded 0.81)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.95)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 0.81)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'right', 'right')
Agent drove forward instead of right. (rewarded 1.32)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 0.57)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.64)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.96)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.79)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.60)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.06)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 246
\-------------------------
Simulating trial. . .
espilon = 0.3753; alpha = 0.0040
Simulating trial. . .
espilon = 0.3753; alpha = 0.0040
Simulating trial. . .
espilon = 0.3753; alpha = 0.0040
Simulating trial. . .
espilon = 0.3753; alpha = 0.0040
Simulating trial. . .
espilon = 0.3753; alpha = 0.0040
Simulating trial. . .
espilon = 0.3753; alpha = 0.0040
Simulating trial. . .
espilon = 0.3753; alpha = 0.0040
Simulating trial. . .
espilon = 0.3753; alpha = 0.0040
Simulating trial. . .
espilon = 0.3753; alpha = 0.0040
Simulating trial. . .
espilon = 0.3753; alpha = 0.0040
Simulating trial. . .
espilon = 0.3753; alpha = 0.0040
Simulating trial. . .
espilon = 0.3753; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent drove forward instead of right. (rewarded 0.04)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', 'left')
Agent followed the waypoint right. (rewarded 1.63)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 1.44)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -9.02)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.37)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent followed the waypoint right. (rewarded 2.29)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.89)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent drove right instead of forward. (rewarded 1.23)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 0.64)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent followed the waypoint left. (rewarded 2.43)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.61)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.39)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.46)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 2.47)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.61)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 247
\-------------------------
Simulating trial. . .
espilon = 0.3738; alpha = 0.0040
Simulating trial. . .
espilon = 0.3738; alpha = 0.0040
Simulating trial. . .
espilon = 0.3738; alpha = 0.0040
Simulating trial. . .
espilon = 0.3738; alpha = 0.0040
Simulating trial. . .
espilon = 0.3738; alpha = 0.0040
Simulating trial. . .
espilon = 0.3738; alpha = 0.0040
Simulating trial. . .
espilon = 0.3738; alpha = 0.0040
Simulating trial. . .
espilon = 0.3738; alpha = 0.0040
Simulating trial. . .
espilon = 0.3738; alpha = 0.0040
Simulating trial. . .
espilon = 0.3738; alpha = 0.0040
Simulating trial. . .
espilon = 0.3738; alpha = 0.0040
Simulating trial. . .
espilon = 0.3738; alpha = 0.0040
Simulating trial. . .
espilon = 0.3738; alpha = 0.0040
Simulating trial. . .
espilon = 0.3738; alpha = 0.0040
Simulating trial. . .
espilon = 0.3738; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.23)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.11)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.22)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.61)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 1.76)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.17)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded -0.08)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent drove forward instead of right. (rewarded 0.49)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.30)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.13)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.48)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.25)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.30)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.62)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'left')
Agent drove right instead of forward. (rewarded 0.83)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.36)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.08)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent followed the waypoint left. (rewarded 1.91)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.64)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.05)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 248
\-------------------------
Simulating trial. . .
espilon = 0.3723; alpha = 0.0040
Simulating trial. . .
espilon = 0.3723; alpha = 0.0040
Simulating trial. . .
espilon = 0.3723; alpha = 0.0040
Simulating trial. . .
espilon = 0.3723; alpha = 0.0040
Simulating trial. . .
espilon = 0.3723; alpha = 0.0040
Simulating trial. . .
espilon = 0.3723; alpha = 0.0040
Simulating trial. . .
espilon = 0.3723; alpha = 0.0040
Simulating trial. . .
espilon = 0.3723; alpha = 0.0040
Simulating trial. . .
espilon = 0.3723; alpha = 0.0040
Simulating trial. . .
espilon = 0.3723; alpha = 0.0040
Simulating trial. . .
espilon = 0.3723; alpha = 0.0040
Simulating trial. . .
espilon = 0.3723; alpha = 0.0040
Simulating trial. . .
espilon = 0.3723; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', 'right')
Agent drove forward instead of right. (rewarded 0.36)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.33)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.48)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.97)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent drove left instead of forward. (rewarded 0.52)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', None)
Agent followed the waypoint right. (rewarded 1.82)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'right', None)
Agent properly idled at a red light. (rewarded 2.26)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.48)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.72)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.14)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent attempted driving left through a red light. (rewarded -9.77)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.04)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.70)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.55)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.46)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'left')
Agent drove left instead of right. (rewarded -0.04)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 1.45)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.21)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'right')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.88)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 1.85)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.36)
16% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 249
\-------------------------
Simulating trial. . .
espilon = 0.3708; alpha = 0.0040
Simulating trial. . .
espilon = 0.3708; alpha = 0.0040
Simulating trial. . .
espilon = 0.3708; alpha = 0.0040
Simulating trial. . .
espilon = 0.3708; alpha = 0.0040
Simulating trial. . .
espilon = 0.3708; alpha = 0.0040
Simulating trial. . .
espilon = 0.3708; alpha = 0.0040
Simulating trial. . .
espilon = 0.3708; alpha = 0.0040
Simulating trial. . .
espilon = 0.3708; alpha = 0.0040
Simulating trial. . .
espilon = 0.3708; alpha = 0.0040
Simulating trial. . .
espilon = 0.3708; alpha = 0.0040
Simulating trial. . .
espilon = 0.3708; alpha = 0.0040
Simulating trial. . .
espilon = 0.3708; alpha = 0.0040
Simulating trial. . .
espilon = 0.3708; alpha = 0.0040
Simulating trial. . .
espilon = 0.3708; alpha = 0.0040
Simulating trial. . .
espilon = 0.3708; alpha = 0.0040
Simulating trial. . .
espilon = 0.3708; alpha = 0.0040
Simulating trial. . .
espilon = 0.3708; alpha = 0.0040
Simulating trial. . .
espilon = 0.3708; alpha = 0.0040
Simulating trial. . .
espilon = 0.3708; alpha = 0.0040
Simulating trial. . .
espilon = 0.3708; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', 'forward')
Agent attempted driving left through a red light. (rewarded -9.84)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.82)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 1.57)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 0.96)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.75)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.09)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent drove left instead of right. (rewarded 0.44)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.80)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 1.33)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'right')
Agent drove forward instead of left. (rewarded -0.09)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.84)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 1.07)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', 'forward')
Agent drove right instead of left. (rewarded 1.49)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.82)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent followed the waypoint forward. (rewarded 2.22)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.40)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 0.95)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 0.94)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.27)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent followed the waypoint forward. (rewarded 0.23)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 250
\-------------------------
Simulating trial. . .
espilon = 0.3694; alpha = 0.0040
Simulating trial. . .
espilon = 0.3694; alpha = 0.0040
Simulating trial. . .
espilon = 0.3694; alpha = 0.0040
Simulating trial. . .
espilon = 0.3694; alpha = 0.0040
Simulating trial. . .
espilon = 0.3694; alpha = 0.0040
Simulating trial. . .
espilon = 0.3694; alpha = 0.0040
Simulating trial. . .
espilon = 0.3694; alpha = 0.0040
Simulating trial. . .
espilon = 0.3694; alpha = 0.0040
Simulating trial. . .
espilon = 0.3694; alpha = 0.0040
Simulating trial. . .
espilon = 0.3694; alpha = 0.0040
Simulating trial. . .
espilon = 0.3694; alpha = 0.0040
Simulating trial. . .
espilon = 0.3694; alpha = 0.0040
Simulating trial. . .
espilon = 0.3694; alpha = 0.0040
Simulating trial. . .
espilon = 0.3694; alpha = 0.0040
Simulating trial. . .
espilon = 0.3694; alpha = 0.0040
Simulating trial. . .
espilon = 0.3694; alpha = 0.0040
Simulating trial. . .
espilon = 0.3694; alpha = 0.0040
Simulating trial. . .
espilon = 0.3694; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', 'left')
Agent drove right instead of forward. (rewarded 0.49)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent attempted driving forward through a red light. (rewarded -9.38)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.43)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.78)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.06)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.33)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.09)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.53)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -9.67)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.76)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.64)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.79)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.77)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.83)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.18)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.36)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.00)
32% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 251
\-------------------------
Simulating trial. . .
espilon = 0.3679; alpha = 0.0040
Simulating trial. . .
espilon = 0.3679; alpha = 0.0040
Simulating trial. . .
espilon = 0.3679; alpha = 0.0040
Simulating trial. . .
espilon = 0.3679; alpha = 0.0040
Simulating trial. . .
espilon = 0.3679; alpha = 0.0040
Simulating trial. . .
espilon = 0.3679; alpha = 0.0040
Simulating trial. . .
espilon = 0.3679; alpha = 0.0040
Simulating trial. . .
espilon = 0.3679; alpha = 0.0040
Simulating trial. . .
espilon = 0.3679; alpha = 0.0040
Simulating trial. . .
espilon = 0.3679; alpha = 0.0040
Simulating trial. . .
espilon = 0.3679; alpha = 0.0040
Simulating trial. . .
espilon = 0.3679; alpha = 0.0040
Simulating trial. . .
espilon = 0.3679; alpha = 0.0040
Simulating trial. . .
espilon = 0.3679; alpha = 0.0040
Simulating trial. . .
espilon = 0.3679; alpha = 0.0040
Simulating trial. . .
espilon = 0.3679; alpha = 0.0040
Simulating trial. . .
espilon = 0.3679; alpha = 0.0040
Simulating trial. . .
espilon = 0.3679; alpha = 0.0040
Simulating trial. . .
espilon = 0.3679; alpha = 0.0040
Simulating trial. . .
espilon = 0.3679; alpha = 0.0040
Simulating trial. . .
espilon = 0.3679; alpha = 0.0040
Simulating trial. . .
espilon = 0.3679; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'right')
Agent drove right instead of forward. (rewarded 0.38)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 2.49)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded 0.63)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.76)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.79)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.51)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 2.64)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent drove left instead of forward. (rewarded 1.66)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.79)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.45)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'right')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.36)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent drove forward instead of right. (rewarded 0.58)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'forward')
Agent drove forward instead of right. (rewarded 1.19)
63% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'forward')
Agent followed the waypoint right. (rewarded 2.67)
60% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded -0.10)
57% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.54)
54% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.35)
51% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.47)
49% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 1.56)
46% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.93)
43% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', 'right')
Agent followed the waypoint forward. (rewarded 1.10)
40% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.57)
37% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 252
\-------------------------
Simulating trial. . .
espilon = 0.3664; alpha = 0.0040
Simulating trial. . .
espilon = 0.3664; alpha = 0.0040
Simulating trial. . .
espilon = 0.3664; alpha = 0.0040
Simulating trial. . .
espilon = 0.3664; alpha = 0.0040
Simulating trial. . .
espilon = 0.3664; alpha = 0.0040
Simulating trial. . .
espilon = 0.3664; alpha = 0.0040
Simulating trial. . .
espilon = 0.3664; alpha = 0.0040
Simulating trial. . .
espilon = 0.3664; alpha = 0.0040
Simulating trial. . .
espilon = 0.3664; alpha = 0.0040
Simulating trial. . .
espilon = 0.3664; alpha = 0.0040
Simulating trial. . .
espilon = 0.3664; alpha = 0.0040
Simulating trial. . .
espilon = 0.3664; alpha = 0.0040
Simulating trial. . .
espilon = 0.3664; alpha = 0.0040
Simulating trial. . .
espilon = 0.3664; alpha = 0.0040
Simulating trial. . .
espilon = 0.3664; alpha = 0.0040
Simulating trial. . .
espilon = 0.3664; alpha = 0.0040
Simulating trial. . .
espilon = 0.3664; alpha = 0.0040
Simulating trial. . .
espilon = 0.3664; alpha = 0.0040
Simulating trial. . .
espilon = 0.3664; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'right')
Agent drove right instead of left. (rewarded 1.25)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.78)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.89)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.37)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.76)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'right', 'left')
Agent drove forward instead of left. (rewarded 0.28)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.96)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent drove right instead of left. (rewarded 0.55)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent followed the waypoint right. (rewarded 1.23)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent drove right instead of forward. (rewarded 1.29)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.96)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent followed the waypoint forward. (rewarded 1.55)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent drove forward instead of right. (rewarded 0.60)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded -0.20)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.14)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'right')
Agent followed the waypoint right. (rewarded 2.38)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.65)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded -0.56)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove forward instead of right. (rewarded 0.49)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.48)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 253
\-------------------------
Simulating trial. . .
espilon = 0.3649; alpha = 0.0040
Simulating trial. . .
espilon = 0.3649; alpha = 0.0040
Simulating trial. . .
espilon = 0.3649; alpha = 0.0040
Simulating trial. . .
espilon = 0.3649; alpha = 0.0040
Simulating trial. . .
espilon = 0.3649; alpha = 0.0040
Simulating trial. . .
espilon = 0.3649; alpha = 0.0040
Simulating trial. . .
espilon = 0.3649; alpha = 0.0040
Simulating trial. . .
espilon = 0.3649; alpha = 0.0040
Simulating trial. . .
espilon = 0.3649; alpha = 0.0040
Simulating trial. . .
espilon = 0.3649; alpha = 0.0040
Simulating trial. . .
espilon = 0.3649; alpha = 0.0040
Simulating trial. . .
espilon = 0.3649; alpha = 0.0040
Simulating trial. . .
espilon = 0.3649; alpha = 0.0040
Simulating trial. . .
espilon = 0.3649; alpha = 0.0040
Simulating trial. . .
espilon = 0.3649; alpha = 0.0040
Simulating trial. . .
espilon = 0.3649; alpha = 0.0040
Simulating trial. . .
espilon = 0.3649; alpha = 0.0040
Simulating trial. . .
espilon = 0.3649; alpha = 0.0040
Simulating trial. . .
espilon = 0.3649; alpha = 0.0040
Simulating trial. . .
espilon = 0.3649; alpha = 0.0040
Simulating trial. . .
espilon = 0.3649; alpha = 0.0040
Simulating trial. . .
espilon = 0.3649; alpha = 0.0040
Simulating trial. . .
espilon = 0.3649; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.77)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.58)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.83)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.33)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.12)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.01)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.59)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.87)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.37)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent drove left instead of forward. (rewarded 0.80)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.10)
45% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 254
\-------------------------
Simulating trial. . .
espilon = 0.3635; alpha = 0.0040
Simulating trial. . .
espilon = 0.3635; alpha = 0.0040
Simulating trial. . .
espilon = 0.3635; alpha = 0.0040
Simulating trial. . .
espilon = 0.3635; alpha = 0.0040
Simulating trial. . .
espilon = 0.3635; alpha = 0.0040
Simulating trial. . .
espilon = 0.3635; alpha = 0.0040
Simulating trial. . .
espilon = 0.3635; alpha = 0.0040
Simulating trial. . .
espilon = 0.3635; alpha = 0.0040
Simulating trial. . .
espilon = 0.3635; alpha = 0.0040
Simulating trial. . .
espilon = 0.3635; alpha = 0.0040
Simulating trial. . .
espilon = 0.3635; alpha = 0.0040
Simulating trial. . .
espilon = 0.3635; alpha = 0.0040
Simulating trial. . .
espilon = 0.3635; alpha = 0.0040
Simulating trial. . .
espilon = 0.3635; alpha = 0.0040
Simulating trial. . .
espilon = 0.3635; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'left')
Agent followed the waypoint left. (rewarded 2.62)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.18)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded 1.11)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'forward')
Agent properly idled at a red light. (rewarded 2.34)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.49)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent drove right instead of left. (rewarded 1.49)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 0.85)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'right', 'forward')
Agent attempted driving forward through a red light. (rewarded -9.29)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 2.42)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'right')
Agent drove right instead of forward. (rewarded 1.27)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.58)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.80)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.08)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -9.65)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', None)
Agent attempted driving forward through a red light. (rewarded -9.68)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -10.17)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.76)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', 'right')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.49)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.10)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.00)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 0.59)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.57)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded -0.72)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 0.74)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.10)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 255
\-------------------------
Simulating trial. . .
espilon = 0.3620; alpha = 0.0040
Simulating trial. . .
espilon = 0.3620; alpha = 0.0040
Simulating trial. . .
espilon = 0.3620; alpha = 0.0040
Simulating trial. . .
espilon = 0.3620; alpha = 0.0040
Simulating trial. . .
espilon = 0.3620; alpha = 0.0040
Simulating trial. . .
espilon = 0.3620; alpha = 0.0040
Simulating trial. . .
espilon = 0.3620; alpha = 0.0040
Simulating trial. . .
espilon = 0.3620; alpha = 0.0040
Simulating trial. . .
espilon = 0.3620; alpha = 0.0040
Simulating trial. . .
espilon = 0.3620; alpha = 0.0040
Simulating trial. . .
espilon = 0.3620; alpha = 0.0040
Simulating trial. . .
espilon = 0.3620; alpha = 0.0040
Simulating trial. . .
espilon = 0.3620; alpha = 0.0040
Simulating trial. . .
espilon = 0.3620; alpha = 0.0040
Simulating trial. . .
espilon = 0.3620; alpha = 0.0040
Simulating trial. . .
espilon = 0.3620; alpha = 0.0040
Simulating trial. . .
espilon = 0.3620; alpha = 0.0040
Simulating trial. . .
espilon = 0.3620; alpha = 0.0040
Simulating trial. . .
espilon = 0.3620; alpha = 0.0040
Simulating trial. . .
espilon = 0.3620; alpha = 0.0040
Simulating trial. . .
espilon = 0.3620; alpha = 0.0040
Simulating trial. . .
espilon = 0.3620; alpha = 0.0040
Simulating trial. . .
espilon = 0.3620; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.42)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.79)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.37)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.35)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent drove right instead of left. (rewarded 0.47)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.28)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.43)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 2.42)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.50)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.31)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.03)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.00)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.47)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove left instead of forward. (rewarded 0.28)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.09)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.53)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 1.03)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.69)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.50)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.28)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.59)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.13)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.44)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded -0.38)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.15)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 256
\-------------------------
Simulating trial. . .
espilon = 0.3606; alpha = 0.0040
Simulating trial. . .
espilon = 0.3606; alpha = 0.0040
Simulating trial. . .
espilon = 0.3606; alpha = 0.0040
Simulating trial. . .
espilon = 0.3606; alpha = 0.0040
Simulating trial. . .
espilon = 0.3606; alpha = 0.0040
Simulating trial. . .
espilon = 0.3606; alpha = 0.0040
Simulating trial. . .
espilon = 0.3606; alpha = 0.0040
Simulating trial. . .
espilon = 0.3606; alpha = 0.0040
Simulating trial. . .
espilon = 0.3606; alpha = 0.0040
Simulating trial. . .
espilon = 0.3606; alpha = 0.0040
Simulating trial. . .
espilon = 0.3606; alpha = 0.0040
Simulating trial. . .
espilon = 0.3606; alpha = 0.0040
Simulating trial. . .
espilon = 0.3606; alpha = 0.0040
Simulating trial. . .
espilon = 0.3606; alpha = 0.0040
Simulating trial. . .
espilon = 0.3606; alpha = 0.0040
Simulating trial. . .
espilon = 0.3606; alpha = 0.0040
Simulating trial. . .
espilon = 0.3606; alpha = 0.0040
Simulating trial. . .
espilon = 0.3606; alpha = 0.0040
Simulating trial. . .
espilon = 0.3606; alpha = 0.0040
Simulating trial. . .
espilon = 0.3606; alpha = 0.0040
Simulating trial. . .
espilon = 0.3606; alpha = 0.0040
Simulating trial. . .
espilon = 0.3606; alpha = 0.0040
Simulating trial. . .
espilon = 0.3606; alpha = 0.0040
Simulating trial. . .
espilon = 0.3606; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', 'left')
Agent drove forward instead of left. (rewarded 1.53)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent followed the waypoint left. (rewarded 2.23)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.97)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.45)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.92)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent attempted driving left through a red light. (rewarded -10.21)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.27)
65% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 257
\-------------------------
Simulating trial. . .
espilon = 0.3592; alpha = 0.0040
Simulating trial. . .
espilon = 0.3592; alpha = 0.0040
Simulating trial. . .
espilon = 0.3592; alpha = 0.0040
Simulating trial. . .
espilon = 0.3592; alpha = 0.0040
Simulating trial. . .
espilon = 0.3592; alpha = 0.0040
Simulating trial. . .
espilon = 0.3592; alpha = 0.0040
Simulating trial. . .
espilon = 0.3592; alpha = 0.0040
Simulating trial. . .
espilon = 0.3592; alpha = 0.0040
Simulating trial. . .
espilon = 0.3592; alpha = 0.0040
Simulating trial. . .
espilon = 0.3592; alpha = 0.0040
Simulating trial. . .
espilon = 0.3592; alpha = 0.0040
Simulating trial. . .
espilon = 0.3592; alpha = 0.0040
Simulating trial. . .
espilon = 0.3592; alpha = 0.0040
Simulating trial. . .
espilon = 0.3592; alpha = 0.0040
Simulating trial. . .
espilon = 0.3592; alpha = 0.0040
Simulating trial. . .
espilon = 0.3592; alpha = 0.0040
Simulating trial. . .
espilon = 0.3592; alpha = 0.0040
Simulating trial. . .
espilon = 0.3592; alpha = 0.0040
Simulating trial. . .
espilon = 0.3592; alpha = 0.0040
Simulating trial. . .
espilon = 0.3592; alpha = 0.0040
Simulating trial. . .
espilon = 0.3592; alpha = 0.0040
Simulating trial. . .
espilon = 0.3592; alpha = 0.0040
Simulating trial. . .
espilon = 0.3592; alpha = 0.0040
Simulating trial. . .
espilon = 0.3592; alpha = 0.0040
Simulating trial. . .
espilon = 0.3592; alpha = 0.0040
Simulating trial. . .
espilon = 0.3592; alpha = 0.0040
Simulating trial. . .
espilon = 0.3592; alpha = 0.0040
Simulating trial. . .
espilon = 0.3592; alpha = 0.0040
Simulating trial. . .
espilon = 0.3592; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.11)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 2.41)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.93)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.35)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.66)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.09)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.89)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.59)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.13)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 1.16)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.11)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, None)
Agent followed the waypoint right. (rewarded 2.67)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.45)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent followed the waypoint right. (rewarded 2.48)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 0.40)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.16)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'right', None)
Agent properly idled at a red light. (rewarded 1.05)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.80)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'left')
Agent followed the waypoint right. (rewarded 1.46)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.89)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 258
\-------------------------
Simulating trial. . .
espilon = 0.3577; alpha = 0.0040
Simulating trial. . .
espilon = 0.3577; alpha = 0.0040
Simulating trial. . .
espilon = 0.3577; alpha = 0.0040
Simulating trial. . .
espilon = 0.3577; alpha = 0.0040
Simulating trial. . .
espilon = 0.3577; alpha = 0.0040
Simulating trial. . .
espilon = 0.3577; alpha = 0.0040
Simulating trial. . .
espilon = 0.3577; alpha = 0.0040
Simulating trial. . .
espilon = 0.3577; alpha = 0.0040
Simulating trial. . .
espilon = 0.3577; alpha = 0.0040
Simulating trial. . .
espilon = 0.3577; alpha = 0.0040
Simulating trial. . .
espilon = 0.3577; alpha = 0.0040
Simulating trial. . .
espilon = 0.3577; alpha = 0.0040
Simulating trial. . .
espilon = 0.3577; alpha = 0.0040
Simulating trial. . .
espilon = 0.3577; alpha = 0.0040
Simulating trial. . .
espilon = 0.3577; alpha = 0.0040
Simulating trial. . .
espilon = 0.3577; alpha = 0.0040
Simulating trial. . .
espilon = 0.3577; alpha = 0.0040
Simulating trial. . .
espilon = 0.3577; alpha = 0.0040
Simulating trial. . .
espilon = 0.3577; alpha = 0.0040
Simulating trial. . .
espilon = 0.3577; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.50)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.23)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.39)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.67)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.08)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.24)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 0.98)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'right')
Agent properly idled at a red light. (rewarded 1.43)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.31)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 1.68)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 2.56)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', 'left')
Agent drove right instead of left. (rewarded 1.59)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.86)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.77)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.20)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded 1.22)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.51)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', 'right')
Agent drove forward instead of left. (rewarded 0.49)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent drove forward instead of left. (rewarded 0.33)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent followed the waypoint left. (rewarded 2.09)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 259
\-------------------------
Simulating trial. . .
espilon = 0.3563; alpha = 0.0040
Simulating trial. . .
espilon = 0.3563; alpha = 0.0040
Simulating trial. . .
espilon = 0.3563; alpha = 0.0040
Simulating trial. . .
espilon = 0.3563; alpha = 0.0040
Simulating trial. . .
espilon = 0.3563; alpha = 0.0040
Simulating trial. . .
espilon = 0.3563; alpha = 0.0040
Simulating trial. . .
espilon = 0.3563; alpha = 0.0040
Simulating trial. . .
espilon = 0.3563; alpha = 0.0040
Simulating trial. . .
espilon = 0.3563; alpha = 0.0040
Simulating trial. . .
espilon = 0.3563; alpha = 0.0040
Simulating trial. . .
espilon = 0.3563; alpha = 0.0040
Simulating trial. . .
espilon = 0.3563; alpha = 0.0040
Simulating trial. . .
espilon = 0.3563; alpha = 0.0040
Simulating trial. . .
espilon = 0.3563; alpha = 0.0040
Simulating trial. . .
espilon = 0.3563; alpha = 0.0040
Simulating trial. . .
espilon = 0.3563; alpha = 0.0040
Simulating trial. . .
espilon = 0.3563; alpha = 0.0040
Simulating trial. . .
espilon = 0.3563; alpha = 0.0040
Simulating trial. . .
espilon = 0.3563; alpha = 0.0040
Simulating trial. . .
espilon = 0.3563; alpha = 0.0040
Simulating trial. . .
espilon = 0.3563; alpha = 0.0040
Simulating trial. . .
espilon = 0.3563; alpha = 0.0040
Simulating trial. . .
espilon = 0.3563; alpha = 0.0040
Simulating trial. . .
espilon = 0.3563; alpha = 0.0040
Simulating trial. . .
espilon = 0.3563; alpha = 0.0040
Simulating trial. . .
espilon = 0.3563; alpha = 0.0040
Simulating trial. . .
espilon = 0.3563; alpha = 0.0040
Simulating trial. . .
espilon = 0.3563; alpha = 0.0040
Simulating trial. . .
espilon = 0.3563; alpha = 0.0040
Simulating trial. . .
espilon = 0.3563; alpha = 0.0040
Simulating trial. . .
espilon = 0.3563; alpha = 0.0040
Simulating trial. . .
espilon = 0.3563; alpha = 0.0040
Simulating trial. . .
espilon = 0.3563; alpha = 0.0040
Simulating trial. . .
espilon = 0.3563; alpha = 0.0040
Simulating trial. . .
espilon = 0.3563; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.37)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.54)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.53)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.22)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove left instead of forward. (rewarded 0.56)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'left')
Agent drove left instead of right. (rewarded 0.60)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.79)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'right')
Agent drove forward instead of right. (rewarded 0.74)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.25)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.01)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.04)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.10)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -10.91)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.78)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'left')
Agent followed the waypoint right. (rewarded 1.74)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.88)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent drove left instead of forward. (rewarded 0.45)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded -0.16)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 1.19)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.86)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 260
\-------------------------
Simulating trial. . .
espilon = 0.3549; alpha = 0.0040
Simulating trial. . .
espilon = 0.3549; alpha = 0.0040
Simulating trial. . .
espilon = 0.3549; alpha = 0.0040
Simulating trial. . .
espilon = 0.3549; alpha = 0.0040
Simulating trial. . .
espilon = 0.3549; alpha = 0.0040
Simulating trial. . .
espilon = 0.3549; alpha = 0.0040
Simulating trial. . .
espilon = 0.3549; alpha = 0.0040
Simulating trial. . .
espilon = 0.3549; alpha = 0.0040
Simulating trial. . .
espilon = 0.3549; alpha = 0.0040
Simulating trial. . .
espilon = 0.3549; alpha = 0.0040
Simulating trial. . .
espilon = 0.3549; alpha = 0.0040
Simulating trial. . .
espilon = 0.3549; alpha = 0.0040
Simulating trial. . .
espilon = 0.3549; alpha = 0.0040
Simulating trial. . .
espilon = 0.3549; alpha = 0.0040
Simulating trial. . .
espilon = 0.3549; alpha = 0.0040
Simulating trial. . .
espilon = 0.3549; alpha = 0.0040
Simulating trial. . .
espilon = 0.3549; alpha = 0.0040
Simulating trial. . .
espilon = 0.3549; alpha = 0.0040
Simulating trial. . .
espilon = 0.3549; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.47)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'left', None)
Agent followed the waypoint right. (rewarded 2.51)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', None)
Agent drove right instead of forward. (rewarded 1.66)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded -0.01)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.22)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 1.16)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'left')
Agent followed the waypoint right. (rewarded 1.99)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.10)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.09)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 0.58)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.57)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 1.12)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.99)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.70)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', 'right')
Agent drove right instead of left. (rewarded 1.35)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.39)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.92)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 0.05)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove right instead of left. (rewarded 1.57)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent followed the waypoint right. (rewarded 1.82)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.58)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.20)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove left instead of forward. (rewarded 0.80)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 2.31)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 1.89)
17% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 261
\-------------------------
Simulating trial. . .
espilon = 0.3535; alpha = 0.0040
Simulating trial. . .
espilon = 0.3535; alpha = 0.0040
Simulating trial. . .
espilon = 0.3535; alpha = 0.0040
Simulating trial. . .
espilon = 0.3535; alpha = 0.0040
Simulating trial. . .
espilon = 0.3535; alpha = 0.0040
Simulating trial. . .
espilon = 0.3535; alpha = 0.0040
Simulating trial. . .
espilon = 0.3535; alpha = 0.0040
Simulating trial. . .
espilon = 0.3535; alpha = 0.0040
Simulating trial. . .
espilon = 0.3535; alpha = 0.0040
Simulating trial. . .
espilon = 0.3535; alpha = 0.0040
Simulating trial. . .
espilon = 0.3535; alpha = 0.0040
Simulating trial. . .
espilon = 0.3535; alpha = 0.0040
Simulating trial. . .
espilon = 0.3535; alpha = 0.0040
Simulating trial. . .
espilon = 0.3535; alpha = 0.0040
Simulating trial. . .
espilon = 0.3535; alpha = 0.0040
Simulating trial. . .
espilon = 0.3535; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.20)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.85)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 1.42)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 0.61)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.80)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.55)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.16)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'right')
Agent properly idled at a red light. (rewarded 1.84)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.09)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.19)
50% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 262
\-------------------------
Simulating trial. . .
espilon = 0.3520; alpha = 0.0040
Simulating trial. . .
espilon = 0.3520; alpha = 0.0040
Simulating trial. . .
espilon = 0.3520; alpha = 0.0040
Simulating trial. . .
espilon = 0.3520; alpha = 0.0040
Simulating trial. . .
espilon = 0.3520; alpha = 0.0040
Simulating trial. . .
espilon = 0.3520; alpha = 0.0040
Simulating trial. . .
espilon = 0.3520; alpha = 0.0040
Simulating trial. . .
espilon = 0.3520; alpha = 0.0040
Simulating trial. . .
espilon = 0.3520; alpha = 0.0040
Simulating trial. . .
espilon = 0.3520; alpha = 0.0040
Simulating trial. . .
espilon = 0.3520; alpha = 0.0040
Simulating trial. . .
espilon = 0.3520; alpha = 0.0040
Simulating trial. . .
espilon = 0.3520; alpha = 0.0040
Simulating trial. . .
espilon = 0.3520; alpha = 0.0040
Simulating trial. . .
espilon = 0.3520; alpha = 0.0040
Simulating trial. . .
espilon = 0.3520; alpha = 0.0040
Simulating trial. . .
espilon = 0.3520; alpha = 0.0040
Simulating trial. . .
espilon = 0.3520; alpha = 0.0040
Simulating trial. . .
espilon = 0.3520; alpha = 0.0040
Simulating trial. . .
espilon = 0.3520; alpha = 0.0040
Simulating trial. . .
espilon = 0.3520; alpha = 0.0040
Simulating trial. . .
espilon = 0.3520; alpha = 0.0040
Simulating trial. . .
espilon = 0.3520; alpha = 0.0040
Simulating trial. . .
espilon = 0.3520; alpha = 0.0040
Simulating trial. . .
espilon = 0.3520; alpha = 0.0040
Simulating trial. . .
espilon = 0.3520; alpha = 0.0040
Simulating trial. . .
espilon = 0.3520; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'right')
Agent drove right instead of forward. (rewarded 0.52)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.76)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.90)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.84)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.23)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded 1.08)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 0.36)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove left instead of right. (rewarded 1.13)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.43)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.87)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.78)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 0.47)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.09)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.14)
53% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 263
\-------------------------
Simulating trial. . .
espilon = 0.3506; alpha = 0.0040
Simulating trial. . .
espilon = 0.3506; alpha = 0.0040
Simulating trial. . .
espilon = 0.3506; alpha = 0.0040
Simulating trial. . .
espilon = 0.3506; alpha = 0.0040
Simulating trial. . .
espilon = 0.3506; alpha = 0.0040
Simulating trial. . .
espilon = 0.3506; alpha = 0.0040
Simulating trial. . .
espilon = 0.3506; alpha = 0.0040
Simulating trial. . .
espilon = 0.3506; alpha = 0.0040
Simulating trial. . .
espilon = 0.3506; alpha = 0.0040
Simulating trial. . .
espilon = 0.3506; alpha = 0.0040
Simulating trial. . .
espilon = 0.3506; alpha = 0.0040
Simulating trial. . .
espilon = 0.3506; alpha = 0.0040
Simulating trial. . .
espilon = 0.3506; alpha = 0.0040
Simulating trial. . .
espilon = 0.3506; alpha = 0.0040
Simulating trial. . .
espilon = 0.3506; alpha = 0.0040
Simulating trial. . .
espilon = 0.3506; alpha = 0.0040
Simulating trial. . .
espilon = 0.3506; alpha = 0.0040
Simulating trial. . .
espilon = 0.3506; alpha = 0.0040
Simulating trial. . .
espilon = 0.3506; alpha = 0.0040
Simulating trial. . .
espilon = 0.3506; alpha = 0.0040
Simulating trial. . .
espilon = 0.3506; alpha = 0.0040
Simulating trial. . .
espilon = 0.3506; alpha = 0.0040
Simulating trial. . .
espilon = 0.3506; alpha = 0.0040
Simulating trial. . .
espilon = 0.3506; alpha = 0.0040
Simulating trial. . .
espilon = 0.3506; alpha = 0.0040
Simulating trial. . .
espilon = 0.3506; alpha = 0.0040
Simulating trial. . .
espilon = 0.3506; alpha = 0.0040
Simulating trial. . .
espilon = 0.3506; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.42)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.24)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.05)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.32)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.30)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.15)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.39)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'right', None)
Agent drove left instead of forward. (rewarded 0.95)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent drove left instead of right. (rewarded 0.88)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.50)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent drove right instead of forward. (rewarded 0.00)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 0.26)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.00)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', 'left')
Agent followed the waypoint forward. (rewarded 1.82)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'left')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.28)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.14)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.45)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 0.89)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.92)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.15)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 0.58)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent drove right instead of left. (rewarded 0.59)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 0.77)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.25)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.81)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.25)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.61)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.88)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.85)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 0.12)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 264
\-------------------------
Simulating trial. . .
espilon = 0.3492; alpha = 0.0040
Simulating trial. . .
espilon = 0.3492; alpha = 0.0040
Simulating trial. . .
espilon = 0.3492; alpha = 0.0040
Simulating trial. . .
espilon = 0.3492; alpha = 0.0040
Simulating trial. . .
espilon = 0.3492; alpha = 0.0040
Simulating trial. . .
espilon = 0.3492; alpha = 0.0040
Simulating trial. . .
espilon = 0.3492; alpha = 0.0040
Simulating trial. . .
espilon = 0.3492; alpha = 0.0040
Simulating trial. . .
espilon = 0.3492; alpha = 0.0040
Simulating trial. . .
espilon = 0.3492; alpha = 0.0040
Simulating trial. . .
espilon = 0.3492; alpha = 0.0040
Simulating trial. . .
espilon = 0.3492; alpha = 0.0040
Simulating trial. . .
espilon = 0.3492; alpha = 0.0040
Simulating trial. . .
espilon = 0.3492; alpha = 0.0040
Simulating trial. . .
espilon = 0.3492; alpha = 0.0040
Simulating trial. . .
espilon = 0.3492; alpha = 0.0040
Simulating trial. . .
espilon = 0.3492; alpha = 0.0040
Simulating trial. . .
espilon = 0.3492; alpha = 0.0040
Simulating trial. . .
espilon = 0.3492; alpha = 0.0040
Simulating trial. . .
espilon = 0.3492; alpha = 0.0040
Simulating trial. . .
espilon = 0.3492; alpha = 0.0040
Simulating trial. . .
espilon = 0.3492; alpha = 0.0040
Simulating trial. . .
espilon = 0.3492; alpha = 0.0040
Simulating trial. . .
espilon = 0.3492; alpha = 0.0040
Simulating trial. . .
espilon = 0.3492; alpha = 0.0040
Simulating trial. . .
espilon = 0.3492; alpha = 0.0040
Simulating trial. . .
espilon = 0.3492; alpha = 0.0040
Simulating trial. . .
espilon = 0.3492; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', 'right')
Agent drove right instead of left. (rewarded 0.65)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.84)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.92)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.61)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.06)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.11)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent drove right instead of left. (rewarded 0.79)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.57)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.12)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.49)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.61)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.85)
52% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 265
\-------------------------
Simulating trial. . .
espilon = 0.3478; alpha = 0.0040
Simulating trial. . .
espilon = 0.3478; alpha = 0.0040
Simulating trial. . .
espilon = 0.3478; alpha = 0.0040
Simulating trial. . .
espilon = 0.3478; alpha = 0.0040
Simulating trial. . .
espilon = 0.3478; alpha = 0.0040
Simulating trial. . .
espilon = 0.3478; alpha = 0.0040
Simulating trial. . .
espilon = 0.3478; alpha = 0.0040
Simulating trial. . .
espilon = 0.3478; alpha = 0.0040
Simulating trial. . .
espilon = 0.3478; alpha = 0.0040
Simulating trial. . .
espilon = 0.3478; alpha = 0.0040
Simulating trial. . .
espilon = 0.3478; alpha = 0.0040
Simulating trial. . .
espilon = 0.3478; alpha = 0.0040
Simulating trial. . .
espilon = 0.3478; alpha = 0.0040
Simulating trial. . .
espilon = 0.3478; alpha = 0.0040
Simulating trial. . .
espilon = 0.3478; alpha = 0.0040
Simulating trial. . .
espilon = 0.3478; alpha = 0.0040
Simulating trial. . .
espilon = 0.3478; alpha = 0.0040
Simulating trial. . .
espilon = 0.3478; alpha = 0.0040
Simulating trial. . .
espilon = 0.3478; alpha = 0.0040
Simulating trial. . .
espilon = 0.3478; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent drove forward instead of left. (rewarded 1.70)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent drove right instead of left. (rewarded 0.82)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.60)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.03)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.45)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.72)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 1.17)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'forward')
Agent drove forward instead of right. (rewarded 0.39)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent followed the waypoint right. (rewarded 1.47)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 0.99)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent drove right instead of forward. (rewarded -0.12)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.33)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.68)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove forward instead of left. (rewarded -0.33)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.47)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.62)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.44)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'right')
Agent drove forward instead of left. (rewarded -0.38)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 0.77)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'left')
Agent followed the waypoint left. (rewarded 2.09)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 266
\-------------------------
Simulating trial. . .
espilon = 0.3465; alpha = 0.0040
Simulating trial. . .
espilon = 0.3465; alpha = 0.0040
Simulating trial. . .
espilon = 0.3465; alpha = 0.0040
Simulating trial. . .
espilon = 0.3465; alpha = 0.0040
Simulating trial. . .
espilon = 0.3465; alpha = 0.0040
Simulating trial. . .
espilon = 0.3465; alpha = 0.0040
Simulating trial. . .
espilon = 0.3465; alpha = 0.0040
Simulating trial. . .
espilon = 0.3465; alpha = 0.0040
Simulating trial. . .
espilon = 0.3465; alpha = 0.0040
Simulating trial. . .
espilon = 0.3465; alpha = 0.0040
Simulating trial. . .
espilon = 0.3465; alpha = 0.0040
Simulating trial. . .
espilon = 0.3465; alpha = 0.0040
Simulating trial. . .
espilon = 0.3465; alpha = 0.0040
Simulating trial. . .
espilon = 0.3465; alpha = 0.0040
Simulating trial. . .
espilon = 0.3465; alpha = 0.0040
Simulating trial. . .
espilon = 0.3465; alpha = 0.0040
Simulating trial. . .
espilon = 0.3465; alpha = 0.0040
Simulating trial. . .
espilon = 0.3465; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'right', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.95)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.35)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.90)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.26)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.77)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.76)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent drove left instead of forward. (rewarded 1.30)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.95)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 2.22)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 1.24)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 0.95)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.17)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.43)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.35)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.96)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.23)
36% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 267
\-------------------------
Simulating trial. . .
espilon = 0.3451; alpha = 0.0040
Simulating trial. . .
espilon = 0.3451; alpha = 0.0040
Simulating trial. . .
espilon = 0.3451; alpha = 0.0040
Simulating trial. . .
espilon = 0.3451; alpha = 0.0040
Simulating trial. . .
espilon = 0.3451; alpha = 0.0040
Simulating trial. . .
espilon = 0.3451; alpha = 0.0040
Simulating trial. . .
espilon = 0.3451; alpha = 0.0040
Simulating trial. . .
espilon = 0.3451; alpha = 0.0040
Simulating trial. . .
espilon = 0.3451; alpha = 0.0040
Simulating trial. . .
espilon = 0.3451; alpha = 0.0040
Simulating trial. . .
espilon = 0.3451; alpha = 0.0040
Simulating trial. . .
espilon = 0.3451; alpha = 0.0040
Simulating trial. . .
espilon = 0.3451; alpha = 0.0040
Simulating trial. . .
espilon = 0.3451; alpha = 0.0040
Simulating trial. . .
espilon = 0.3451; alpha = 0.0040
Simulating trial. . .
espilon = 0.3451; alpha = 0.0040
Simulating trial. . .
espilon = 0.3451; alpha = 0.0040
Simulating trial. . .
espilon = 0.3451; alpha = 0.0040
Simulating trial. . .
espilon = 0.3451; alpha = 0.0040
Simulating trial. . .
espilon = 0.3451; alpha = 0.0040
Simulating trial. . .
espilon = 0.3451; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.16)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'right')
Agent drove right instead of forward. (rewarded 0.70)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 1.82)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.76)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', 'left')
Agent drove right instead of forward. (rewarded 1.73)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', 'forward')
Agent drove forward instead of left. (rewarded 0.90)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.84)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.28)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove right instead of left. (rewarded -0.10)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.65)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.50)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.11)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent drove right instead of left. (rewarded 1.47)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.66)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.70)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.32)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.30)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.14)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.30)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 0.58)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.07)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 0.85)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded 0.86)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.88)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.27)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 268
\-------------------------
Simulating trial. . .
espilon = 0.3437; alpha = 0.0040
Simulating trial. . .
espilon = 0.3437; alpha = 0.0040
Simulating trial. . .
espilon = 0.3437; alpha = 0.0040
Simulating trial. . .
espilon = 0.3437; alpha = 0.0040
Simulating trial. . .
espilon = 0.3437; alpha = 0.0040
Simulating trial. . .
espilon = 0.3437; alpha = 0.0040
Simulating trial. . .
espilon = 0.3437; alpha = 0.0040
Simulating trial. . .
espilon = 0.3437; alpha = 0.0040
Simulating trial. . .
espilon = 0.3437; alpha = 0.0040
Simulating trial. . .
espilon = 0.3437; alpha = 0.0040
Simulating trial. . .
espilon = 0.3437; alpha = 0.0040
Simulating trial. . .
espilon = 0.3437; alpha = 0.0040
Simulating trial. . .
espilon = 0.3437; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', None)
Agent drove right instead of forward. (rewarded 1.37)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'left', None)
Agent followed the waypoint left. (rewarded 2.37)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent attempted driving forward through a red light. (rewarded -10.60)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.49)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.64)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.91)
70% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 269
\-------------------------
Simulating trial. . .
espilon = 0.3423; alpha = 0.0040
Simulating trial. . .
espilon = 0.3423; alpha = 0.0040
Simulating trial. . .
espilon = 0.3423; alpha = 0.0040
Simulating trial. . .
espilon = 0.3423; alpha = 0.0040
Simulating trial. . .
espilon = 0.3423; alpha = 0.0040
Simulating trial. . .
espilon = 0.3423; alpha = 0.0040
Simulating trial. . .
espilon = 0.3423; alpha = 0.0040
Simulating trial. . .
espilon = 0.3423; alpha = 0.0040
Simulating trial. . .
espilon = 0.3423; alpha = 0.0040
Simulating trial. . .
espilon = 0.3423; alpha = 0.0040
Simulating trial. . .
espilon = 0.3423; alpha = 0.0040
Simulating trial. . .
espilon = 0.3423; alpha = 0.0040
Simulating trial. . .
espilon = 0.3423; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove forward instead of right. (rewarded 1.63)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', 'left')
Agent followed the waypoint right. (rewarded 2.86)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 0.99)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -5.16)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 2.67)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 0.80)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 2.17)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 1.13)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'left', None)
Agent drove forward instead of left. (rewarded 0.01)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.69)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.45)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'right')
Agent drove right instead of left. (rewarded 0.22)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.79)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent drove right instead of forward. (rewarded 1.78)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.67)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.50)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.19)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded 0.76)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.14)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded 1.10)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'right')
Agent followed the waypoint right. (rewarded 1.07)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 2.48)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.74)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.82)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.23)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.07)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'right')
Agent followed the waypoint forward. (rewarded 0.69)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.20)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 0.46)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.86)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 270
\-------------------------
Simulating trial. . .
espilon = 0.3410; alpha = 0.0040
Simulating trial. . .
espilon = 0.3410; alpha = 0.0040
Simulating trial. . .
espilon = 0.3410; alpha = 0.0040
Simulating trial. . .
espilon = 0.3410; alpha = 0.0040
Simulating trial. . .
espilon = 0.3410; alpha = 0.0040
Simulating trial. . .
espilon = 0.3410; alpha = 0.0040
Simulating trial. . .
espilon = 0.3410; alpha = 0.0040
Simulating trial. . .
espilon = 0.3410; alpha = 0.0040
Simulating trial. . .
espilon = 0.3410; alpha = 0.0040
Simulating trial. . .
espilon = 0.3410; alpha = 0.0040
Simulating trial. . .
espilon = 0.3410; alpha = 0.0040
Simulating trial. . .
espilon = 0.3410; alpha = 0.0040
Simulating trial. . .
espilon = 0.3410; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent drove forward instead of right. (rewarded 0.84)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.73)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent followed the waypoint right. (rewarded 1.49)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.63)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 1.67)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'forward', None)
Agent properly idled at a red light. (rewarded 2.18)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.87)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.88)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'right')
Agent drove right instead of forward. (rewarded 1.15)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.13)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.83)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.17)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.96)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.93)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.27)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.39)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 0.45)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.08)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.06)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.38)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.47)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.01)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.70)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.12)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent attempted driving forward through a red light. (rewarded -9.75)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 271
\-------------------------
Simulating trial. . .
espilon = 0.3396; alpha = 0.0040
Simulating trial. . .
espilon = 0.3396; alpha = 0.0040
Simulating trial. . .
espilon = 0.3396; alpha = 0.0040
Simulating trial. . .
espilon = 0.3396; alpha = 0.0040
Simulating trial. . .
espilon = 0.3396; alpha = 0.0040
Simulating trial. . .
espilon = 0.3396; alpha = 0.0040
Simulating trial. . .
espilon = 0.3396; alpha = 0.0040
Simulating trial. . .
espilon = 0.3396; alpha = 0.0040
Simulating trial. . .
espilon = 0.3396; alpha = 0.0040
Simulating trial. . .
espilon = 0.3396; alpha = 0.0040
Simulating trial. . .
espilon = 0.3396; alpha = 0.0040
Simulating trial. . .
espilon = 0.3396; alpha = 0.0040
Simulating trial. . .
espilon = 0.3396; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', 'left')
Agent properly idled at a red light. (rewarded 1.71)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.92)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.86)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.80)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 0.77)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.87)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 0.93)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', None)
Agent properly idled at a red light. (rewarded 1.99)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.51)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.22)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent drove right instead of forward. (rewarded 1.24)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.43)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.06)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.85)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.09)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'right')
Agent drove right instead of forward. (rewarded 1.62)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 0.87)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded -0.01)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.54)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.38)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.83)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove forward instead of left. (rewarded -0.19)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.54)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove forward instead of left. (rewarded -0.56)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 0.92)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 272
\-------------------------
Simulating trial. . .
espilon = 0.3382; alpha = 0.0040
Simulating trial. . .
espilon = 0.3382; alpha = 0.0040
Simulating trial. . .
espilon = 0.3382; alpha = 0.0040
Simulating trial. . .
espilon = 0.3382; alpha = 0.0040
Simulating trial. . .
espilon = 0.3382; alpha = 0.0040
Simulating trial. . .
espilon = 0.3382; alpha = 0.0040
Simulating trial. . .
espilon = 0.3382; alpha = 0.0040
Simulating trial. . .
espilon = 0.3382; alpha = 0.0040
Simulating trial. . .
espilon = 0.3382; alpha = 0.0040
Simulating trial. . .
espilon = 0.3382; alpha = 0.0040
Simulating trial. . .
espilon = 0.3382; alpha = 0.0040
Simulating trial. . .
espilon = 0.3382; alpha = 0.0040
Simulating trial. . .
espilon = 0.3382; alpha = 0.0040
Simulating trial. . .
espilon = 0.3382; alpha = 0.0040
Simulating trial. . .
espilon = 0.3382; alpha = 0.0040
Simulating trial. . .
espilon = 0.3382; alpha = 0.0040
Simulating trial. . .
espilon = 0.3382; alpha = 0.0040
Simulating trial. . .
espilon = 0.3382; alpha = 0.0040
Simulating trial. . .
espilon = 0.3382; alpha = 0.0040
Simulating trial. . .
espilon = 0.3382; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.96)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.08)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.99)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.50)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.52)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'right')
Agent drove forward instead of left. (rewarded 1.86)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.72)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.39)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.98)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 1.57)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 0.89)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.84)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.48)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.09)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -10.61)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.11)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.15)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'right')
Agent properly idled at a red light. (rewarded 2.23)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 0.98)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.22)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.12)
16% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 273
\-------------------------
Simulating trial. . .
espilon = 0.3369; alpha = 0.0040
Simulating trial. . .
espilon = 0.3369; alpha = 0.0040
Simulating trial. . .
espilon = 0.3369; alpha = 0.0040
Simulating trial. . .
espilon = 0.3369; alpha = 0.0040
Simulating trial. . .
espilon = 0.3369; alpha = 0.0040
Simulating trial. . .
espilon = 0.3369; alpha = 0.0040
Simulating trial. . .
espilon = 0.3369; alpha = 0.0040
Simulating trial. . .
espilon = 0.3369; alpha = 0.0040
Simulating trial. . .
espilon = 0.3369; alpha = 0.0040
Simulating trial. . .
espilon = 0.3369; alpha = 0.0040
Simulating trial. . .
espilon = 0.3369; alpha = 0.0040
Simulating trial. . .
espilon = 0.3369; alpha = 0.0040
Simulating trial. . .
espilon = 0.3369; alpha = 0.0040
Simulating trial. . .
espilon = 0.3369; alpha = 0.0040
Simulating trial. . .
espilon = 0.3369; alpha = 0.0040
Simulating trial. . .
espilon = 0.3369; alpha = 0.0040
Simulating trial. . .
espilon = 0.3369; alpha = 0.0040
Simulating trial. . .
espilon = 0.3369; alpha = 0.0040
Simulating trial. . .
espilon = 0.3369; alpha = 0.0040
Simulating trial. . .
espilon = 0.3369; alpha = 0.0040
Simulating trial. . .
espilon = 0.3369; alpha = 0.0040
Simulating trial. . .
espilon = 0.3369; alpha = 0.0040
Simulating trial. . .
espilon = 0.3369; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 1.55)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent attempted driving forward through a red light. (rewarded -9.86)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent attempted driving left through a red light. (rewarded -10.77)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.53)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.28)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.35)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.24)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 0.49)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 0.53)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.11)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.29)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent drove right instead of forward. (rewarded 1.02)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'left', None)
Agent drove forward instead of left. (rewarded 0.74)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.80)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -10.95)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.81)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent drove right instead of left. (rewarded -0.26)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.32)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.17)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -10.93)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 0.66)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.19)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.33)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.17)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.24)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.93)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 0.43)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.20)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.88)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.70)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 274
\-------------------------
Simulating trial. . .
espilon = 0.3355; alpha = 0.0040
Simulating trial. . .
espilon = 0.3355; alpha = 0.0040
Simulating trial. . .
espilon = 0.3355; alpha = 0.0040
Simulating trial. . .
espilon = 0.3355; alpha = 0.0040
Simulating trial. . .
espilon = 0.3355; alpha = 0.0040
Simulating trial. . .
espilon = 0.3355; alpha = 0.0040
Simulating trial. . .
espilon = 0.3355; alpha = 0.0040
Simulating trial. . .
espilon = 0.3355; alpha = 0.0040
Simulating trial. . .
espilon = 0.3355; alpha = 0.0040
Simulating trial. . .
espilon = 0.3355; alpha = 0.0040
Simulating trial. . .
espilon = 0.3355; alpha = 0.0040
Simulating trial. . .
espilon = 0.3355; alpha = 0.0040
Simulating trial. . .
espilon = 0.3355; alpha = 0.0040
Simulating trial. . .
espilon = 0.3355; alpha = 0.0040
Simulating trial. . .
espilon = 0.3355; alpha = 0.0040
Simulating trial. . .
espilon = 0.3355; alpha = 0.0040
Simulating trial. . .
espilon = 0.3355; alpha = 0.0040
Simulating trial. . .
espilon = 0.3355; alpha = 0.0040
Simulating trial. . .
espilon = 0.3355; alpha = 0.0040
Simulating trial. . .
espilon = 0.3355; alpha = 0.0040
Simulating trial. . .
espilon = 0.3355; alpha = 0.0040
Simulating trial. . .
espilon = 0.3355; alpha = 0.0040
Simulating trial. . .
espilon = 0.3355; alpha = 0.0040
Simulating trial. . .
espilon = 0.3355; alpha = 0.0040
Simulating trial. . .
espilon = 0.3355; alpha = 0.0040
Simulating trial. . .
espilon = 0.3355; alpha = 0.0040
Simulating trial. . .
espilon = 0.3355; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent followed the waypoint left. (rewarded 2.29)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.03)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent drove left instead of forward. (rewarded 0.99)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', None)
Agent followed the waypoint right. (rewarded 1.65)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove left instead of forward. (rewarded 0.09)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove left instead of right. (rewarded 1.75)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.29)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.05)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.62)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 1.70)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.37)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent drove forward instead of right. (rewarded 1.49)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.57)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.03)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 0.58)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'forward')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.91)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -5.56)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.41)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.31)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.08)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 275
\-------------------------
Simulating trial. . .
espilon = 0.3342; alpha = 0.0040
Simulating trial. . .
espilon = 0.3342; alpha = 0.0040
Simulating trial. . .
espilon = 0.3342; alpha = 0.0040
Simulating trial. . .
espilon = 0.3342; alpha = 0.0040
Simulating trial. . .
espilon = 0.3342; alpha = 0.0040
Simulating trial. . .
espilon = 0.3342; alpha = 0.0040
Simulating trial. . .
espilon = 0.3342; alpha = 0.0040
Simulating trial. . .
espilon = 0.3342; alpha = 0.0040
Simulating trial. . .
espilon = 0.3342; alpha = 0.0040
Simulating trial. . .
espilon = 0.3342; alpha = 0.0040
Simulating trial. . .
espilon = 0.3342; alpha = 0.0040
Simulating trial. . .
espilon = 0.3342; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', 'left')
Agent followed the waypoint left. (rewarded 1.43)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.70)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.11)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.75)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.43)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.58)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.22)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.50)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.08)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent followed the waypoint forward. (rewarded 2.14)
67% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 276
\-------------------------
Simulating trial. . .
espilon = 0.3329; alpha = 0.0040
Simulating trial. . .
espilon = 0.3329; alpha = 0.0040
Simulating trial. . .
espilon = 0.3329; alpha = 0.0040
Simulating trial. . .
espilon = 0.3329; alpha = 0.0040
Simulating trial. . .
espilon = 0.3329; alpha = 0.0040
Simulating trial. . .
espilon = 0.3329; alpha = 0.0040
Simulating trial. . .
espilon = 0.3329; alpha = 0.0040
Simulating trial. . .
espilon = 0.3329; alpha = 0.0040
Simulating trial. . .
espilon = 0.3329; alpha = 0.0040
Simulating trial. . .
espilon = 0.3329; alpha = 0.0040
Simulating trial. . .
espilon = 0.3329; alpha = 0.0040
Simulating trial. . .
espilon = 0.3329; alpha = 0.0040
Simulating trial. . .
espilon = 0.3329; alpha = 0.0040
Simulating trial. . .
espilon = 0.3329; alpha = 0.0040
Simulating trial. . .
espilon = 0.3329; alpha = 0.0040
Simulating trial. . .
espilon = 0.3329; alpha = 0.0040
Simulating trial. . .
espilon = 0.3329; alpha = 0.0040
Simulating trial. . .
espilon = 0.3329; alpha = 0.0040
Simulating trial. . .
espilon = 0.3329; alpha = 0.0040
Simulating trial. . .
espilon = 0.3329; alpha = 0.0040
Simulating trial. . .
espilon = 0.3329; alpha = 0.0040
Simulating trial. . .
espilon = 0.3329; alpha = 0.0040
Simulating trial. . .
espilon = 0.3329; alpha = 0.0040
Simulating trial. . .
espilon = 0.3329; alpha = 0.0040
Simulating trial. . .
espilon = 0.3329; alpha = 0.0040
Simulating trial. . .
espilon = 0.3329; alpha = 0.0040
Simulating trial. . .
espilon = 0.3329; alpha = 0.0040
Simulating trial. . .
espilon = 0.3329; alpha = 0.0040
Simulating trial. . .
espilon = 0.3329; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 2.35)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.70)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.95)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.44)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.26)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.91)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'right', 'left')
Agent drove right instead of forward. (rewarded 1.18)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.29)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.03)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.61)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', None)
Agent drove right instead of left. (rewarded 1.32)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 1.46)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.19)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.51)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.84)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.49)
20% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 277
\-------------------------
Simulating trial. . .
espilon = 0.3315; alpha = 0.0040
Simulating trial. . .
espilon = 0.3315; alpha = 0.0040
Simulating trial. . .
espilon = 0.3315; alpha = 0.0040
Simulating trial. . .
espilon = 0.3315; alpha = 0.0040
Simulating trial. . .
espilon = 0.3315; alpha = 0.0040
Simulating trial. . .
espilon = 0.3315; alpha = 0.0040
Simulating trial. . .
espilon = 0.3315; alpha = 0.0040
Simulating trial. . .
espilon = 0.3315; alpha = 0.0040
Simulating trial. . .
espilon = 0.3315; alpha = 0.0040
Simulating trial. . .
espilon = 0.3315; alpha = 0.0040
Simulating trial. . .
espilon = 0.3315; alpha = 0.0040
Simulating trial. . .
espilon = 0.3315; alpha = 0.0040
Simulating trial. . .
espilon = 0.3315; alpha = 0.0040
Simulating trial. . .
espilon = 0.3315; alpha = 0.0040
Simulating trial. . .
espilon = 0.3315; alpha = 0.0040
Simulating trial. . .
espilon = 0.3315; alpha = 0.0040
Simulating trial. . .
espilon = 0.3315; alpha = 0.0040
Simulating trial. . .
espilon = 0.3315; alpha = 0.0040
Simulating trial. . .
espilon = 0.3315; alpha = 0.0040
Simulating trial. . .
espilon = 0.3315; alpha = 0.0040
Simulating trial. . .
espilon = 0.3315; alpha = 0.0040
Simulating trial. . .
espilon = 0.3315; alpha = 0.0040
Simulating trial. . .
espilon = 0.3315; alpha = 0.0040
Simulating trial. . .
espilon = 0.3315; alpha = 0.0040
Simulating trial. . .
espilon = 0.3315; alpha = 0.0040
Simulating trial. . .
espilon = 0.3315; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.72)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', None)
Agent properly idled at a red light. (rewarded 2.85)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent drove right instead of forward. (rewarded 1.94)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 1.61)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.50)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.96)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.08)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent followed the waypoint forward. (rewarded 1.66)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.91)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'right')
Agent followed the waypoint forward. (rewarded 0.96)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.99)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.37)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.12)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 1.22)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 2.17)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.73)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', 'forward')
Agent drove forward instead of right. (rewarded 0.74)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.96)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.28)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.98)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 278
\-------------------------
Simulating trial. . .
espilon = 0.3302; alpha = 0.0040
Simulating trial. . .
espilon = 0.3302; alpha = 0.0040
Simulating trial. . .
espilon = 0.3302; alpha = 0.0040
Simulating trial. . .
espilon = 0.3302; alpha = 0.0040
Simulating trial. . .
espilon = 0.3302; alpha = 0.0040
Simulating trial. . .
espilon = 0.3302; alpha = 0.0040
Simulating trial. . .
espilon = 0.3302; alpha = 0.0040
Simulating trial. . .
espilon = 0.3302; alpha = 0.0040
Simulating trial. . .
espilon = 0.3302; alpha = 0.0040
Simulating trial. . .
espilon = 0.3302; alpha = 0.0040
Simulating trial. . .
espilon = 0.3302; alpha = 0.0040
Simulating trial. . .
espilon = 0.3302; alpha = 0.0040
Simulating trial. . .
espilon = 0.3302; alpha = 0.0040
Simulating trial. . .
espilon = 0.3302; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.73)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent drove right instead of left. (rewarded 1.86)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.34)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.71)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent attempted driving left through a red light. (rewarded -9.31)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.00)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.77)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.43)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.69)
64% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 279
\-------------------------
Simulating trial. . .
espilon = 0.3289; alpha = 0.0040
Simulating trial. . .
espilon = 0.3289; alpha = 0.0040
Simulating trial. . .
espilon = 0.3289; alpha = 0.0040
Simulating trial. . .
espilon = 0.3289; alpha = 0.0040
Simulating trial. . .
espilon = 0.3289; alpha = 0.0040
Simulating trial. . .
espilon = 0.3289; alpha = 0.0040
Simulating trial. . .
espilon = 0.3289; alpha = 0.0040
Simulating trial. . .
espilon = 0.3289; alpha = 0.0040
Simulating trial. . .
espilon = 0.3289; alpha = 0.0040
Simulating trial. . .
espilon = 0.3289; alpha = 0.0040
Simulating trial. . .
espilon = 0.3289; alpha = 0.0040
Simulating trial. . .
espilon = 0.3289; alpha = 0.0040
Simulating trial. . .
espilon = 0.3289; alpha = 0.0040
Simulating trial. . .
espilon = 0.3289; alpha = 0.0040
Simulating trial. . .
espilon = 0.3289; alpha = 0.0040
Simulating trial. . .
espilon = 0.3289; alpha = 0.0040
Simulating trial. . .
espilon = 0.3289; alpha = 0.0040
Simulating trial. . .
espilon = 0.3289; alpha = 0.0040
Simulating trial. . .
espilon = 0.3289; alpha = 0.0040
Simulating trial. . .
espilon = 0.3289; alpha = 0.0040
Simulating trial. . .
espilon = 0.3289; alpha = 0.0040
Simulating trial. . .
espilon = 0.3289; alpha = 0.0040
Simulating trial. . .
espilon = 0.3289; alpha = 0.0040
Simulating trial. . .
espilon = 0.3289; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.18)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'forward', None)
Agent drove left instead of right. (rewarded 0.08)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.93)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 1.97)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.57)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.56)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 1.00)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.02)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', 'forward')
Agent drove forward instead of right. (rewarded 0.44)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'left')
Agent followed the waypoint right. (rewarded 0.97)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.25)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.38)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.10)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent followed the waypoint left. (rewarded 1.03)
53% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 280
\-------------------------
Simulating trial. . .
espilon = 0.3276; alpha = 0.0040
Simulating trial. . .
espilon = 0.3276; alpha = 0.0040
Simulating trial. . .
espilon = 0.3276; alpha = 0.0040
Simulating trial. . .
espilon = 0.3276; alpha = 0.0040
Simulating trial. . .
espilon = 0.3276; alpha = 0.0040
Simulating trial. . .
espilon = 0.3276; alpha = 0.0040
Simulating trial. . .
espilon = 0.3276; alpha = 0.0040
Simulating trial. . .
espilon = 0.3276; alpha = 0.0040
Simulating trial. . .
espilon = 0.3276; alpha = 0.0040
Simulating trial. . .
espilon = 0.3276; alpha = 0.0040
Simulating trial. . .
espilon = 0.3276; alpha = 0.0040
Simulating trial. . .
espilon = 0.3276; alpha = 0.0040
Simulating trial. . .
espilon = 0.3276; alpha = 0.0040
Simulating trial. . .
espilon = 0.3276; alpha = 0.0040
Simulating trial. . .
espilon = 0.3276; alpha = 0.0040
Simulating trial. . .
espilon = 0.3276; alpha = 0.0040
Simulating trial. . .
espilon = 0.3276; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 0.89)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.72)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.72)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.70)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.81)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.25)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.02)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'right', 'forward')
Agent followed the waypoint forward. (rewarded 1.60)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 1.26)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.55)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.64)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -5.70)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent followed the waypoint left. (rewarded 2.50)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.01)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.30)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.38)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 0.31)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 0.65)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.83)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.59)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 281
\-------------------------
Simulating trial. . .
espilon = 0.3263; alpha = 0.0040
Simulating trial. . .
espilon = 0.3263; alpha = 0.0040
Simulating trial. . .
espilon = 0.3263; alpha = 0.0040
Simulating trial. . .
espilon = 0.3263; alpha = 0.0040
Simulating trial. . .
espilon = 0.3263; alpha = 0.0040
Simulating trial. . .
espilon = 0.3263; alpha = 0.0040
Simulating trial. . .
espilon = 0.3263; alpha = 0.0040
Simulating trial. . .
espilon = 0.3263; alpha = 0.0040
Simulating trial. . .
espilon = 0.3263; alpha = 0.0040
Simulating trial. . .
espilon = 0.3263; alpha = 0.0040
Simulating trial. . .
espilon = 0.3263; alpha = 0.0040
Simulating trial. . .
espilon = 0.3263; alpha = 0.0040
Simulating trial. . .
espilon = 0.3263; alpha = 0.0040
Simulating trial. . .
espilon = 0.3263; alpha = 0.0040
Simulating trial. . .
espilon = 0.3263; alpha = 0.0040
Simulating trial. . .
espilon = 0.3263; alpha = 0.0040
Simulating trial. . .
espilon = 0.3263; alpha = 0.0040
Simulating trial. . .
espilon = 0.3263; alpha = 0.0040
Simulating trial. . .
espilon = 0.3263; alpha = 0.0040
Simulating trial. . .
espilon = 0.3263; alpha = 0.0040
Simulating trial. . .
espilon = 0.3263; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.96)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.45)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.11)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'right')
Agent drove right instead of forward. (rewarded 1.84)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', None)
Agent drove right instead of left. (rewarded 0.04)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.70)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.50)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.83)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 1.26)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.29)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.62)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.36)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.79)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent attempted driving forward through a red light. (rewarded -10.89)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.96)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.25)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.95)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.36)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.32)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 0.44)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.48)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 0.84)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.14)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent drove left instead of right. (rewarded 0.53)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, None)
Agent drove left instead of right. (rewarded 0.98)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 282
\-------------------------
Simulating trial. . .
espilon = 0.3250; alpha = 0.0040
Simulating trial. . .
espilon = 0.3250; alpha = 0.0040
Simulating trial. . .
espilon = 0.3250; alpha = 0.0040
Simulating trial. . .
espilon = 0.3250; alpha = 0.0040
Simulating trial. . .
espilon = 0.3250; alpha = 0.0040
Simulating trial. . .
espilon = 0.3250; alpha = 0.0040
Simulating trial. . .
espilon = 0.3250; alpha = 0.0040
Simulating trial. . .
espilon = 0.3250; alpha = 0.0040
Simulating trial. . .
espilon = 0.3250; alpha = 0.0040
Simulating trial. . .
espilon = 0.3250; alpha = 0.0040
Simulating trial. . .
espilon = 0.3250; alpha = 0.0040
Simulating trial. . .
espilon = 0.3250; alpha = 0.0040
Simulating trial. . .
espilon = 0.3250; alpha = 0.0040
Simulating trial. . .
espilon = 0.3250; alpha = 0.0040
Simulating trial. . .
espilon = 0.3250; alpha = 0.0040
Simulating trial. . .
espilon = 0.3250; alpha = 0.0040
Simulating trial. . .
espilon = 0.3250; alpha = 0.0040
Simulating trial. . .
espilon = 0.3250; alpha = 0.0040
Simulating trial. . .
espilon = 0.3250; alpha = 0.0040
Simulating trial. . .
espilon = 0.3250; alpha = 0.0040
Simulating trial. . .
espilon = 0.3250; alpha = 0.0040
Simulating trial. . .
espilon = 0.3250; alpha = 0.0040
Simulating trial. . .
espilon = 0.3250; alpha = 0.0040
Simulating trial. . .
espilon = 0.3250; alpha = 0.0040
Simulating trial. . .
espilon = 0.3250; alpha = 0.0040
Simulating trial. . .
espilon = 0.3250; alpha = 0.0040
Simulating trial. . .
espilon = 0.3250; alpha = 0.0040
Simulating trial. . .
espilon = 0.3250; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.25)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'right', None)
Agent followed the waypoint right. (rewarded 2.47)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.87)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.85)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.46)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.65)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove right instead of forward. (rewarded 1.61)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', None)
Agent drove right instead of left. (rewarded 0.71)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.50)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 0.92)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'forward')
Agent followed the waypoint right. (rewarded 1.25)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'right', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.92)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.72)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent drove right instead of forward. (rewarded 1.28)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 0.87)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.41)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.33)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.95)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'right')
Agent drove right instead of left. (rewarded -0.02)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -4.17)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 283
\-------------------------
Simulating trial. . .
espilon = 0.3237; alpha = 0.0040
Simulating trial. . .
espilon = 0.3237; alpha = 0.0040
Simulating trial. . .
espilon = 0.3237; alpha = 0.0040
Simulating trial. . .
espilon = 0.3237; alpha = 0.0040
Simulating trial. . .
espilon = 0.3237; alpha = 0.0040
Simulating trial. . .
espilon = 0.3237; alpha = 0.0040
Simulating trial. . .
espilon = 0.3237; alpha = 0.0040
Simulating trial. . .
espilon = 0.3237; alpha = 0.0040
Simulating trial. . .
espilon = 0.3237; alpha = 0.0040
Simulating trial. . .
espilon = 0.3237; alpha = 0.0040
Simulating trial. . .
espilon = 0.3237; alpha = 0.0040
Simulating trial. . .
espilon = 0.3237; alpha = 0.0040
Simulating trial. . .
espilon = 0.3237; alpha = 0.0040
Simulating trial. . .
espilon = 0.3237; alpha = 0.0040
Simulating trial. . .
espilon = 0.3237; alpha = 0.0040
Simulating trial. . .
espilon = 0.3237; alpha = 0.0040
Simulating trial. . .
espilon = 0.3237; alpha = 0.0040
Simulating trial. . .
espilon = 0.3237; alpha = 0.0040
Simulating trial. . .
espilon = 0.3237; alpha = 0.0040
Simulating trial. . .
espilon = 0.3237; alpha = 0.0040
Simulating trial. . .
espilon = 0.3237; alpha = 0.0040
Simulating trial. . .
espilon = 0.3237; alpha = 0.0040
Simulating trial. . .
espilon = 0.3237; alpha = 0.0040
Simulating trial. . .
espilon = 0.3237; alpha = 0.0040
Simulating trial. . .
espilon = 0.3237; alpha = 0.0040
Simulating trial. . .
espilon = 0.3237; alpha = 0.0040
Simulating trial. . .
espilon = 0.3237; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 1.69)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent followed the waypoint left. (rewarded 1.21)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 1.64)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 1.41)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.00)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.54)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.22)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent drove forward instead of left. (rewarded 1.68)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove forward instead of left. (rewarded 0.51)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 1.35)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'left')
Agent followed the waypoint left. (rewarded 2.41)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 0.83)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.12)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.45)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.01)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 1.54)
36% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 284
\-------------------------
Simulating trial. . .
espilon = 0.3224; alpha = 0.0040
Simulating trial. . .
espilon = 0.3224; alpha = 0.0040
Simulating trial. . .
espilon = 0.3224; alpha = 0.0040
Simulating trial. . .
espilon = 0.3224; alpha = 0.0040
Simulating trial. . .
espilon = 0.3224; alpha = 0.0040
Simulating trial. . .
espilon = 0.3224; alpha = 0.0040
Simulating trial. . .
espilon = 0.3224; alpha = 0.0040
Simulating trial. . .
espilon = 0.3224; alpha = 0.0040
Simulating trial. . .
espilon = 0.3224; alpha = 0.0040
Simulating trial. . .
espilon = 0.3224; alpha = 0.0040
Simulating trial. . .
espilon = 0.3224; alpha = 0.0040
Simulating trial. . .
espilon = 0.3224; alpha = 0.0040
Simulating trial. . .
espilon = 0.3224; alpha = 0.0040
Simulating trial. . .
espilon = 0.3224; alpha = 0.0040
Simulating trial. . .
espilon = 0.3224; alpha = 0.0040
Simulating trial. . .
espilon = 0.3224; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.28)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 1.77)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 1.03)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.60)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.40)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.30)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.17)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.03)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 1.89)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.78)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.51)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.15)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent attempted driving forward through a red light. (rewarded -9.29)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.28)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.33)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.37)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent drove right instead of left. (rewarded -0.33)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.22)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.83)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove right instead of forward. (rewarded 0.68)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 285
\-------------------------
Simulating trial. . .
espilon = 0.3211; alpha = 0.0040
Simulating trial. . .
espilon = 0.3211; alpha = 0.0040
Simulating trial. . .
espilon = 0.3211; alpha = 0.0040
Simulating trial. . .
espilon = 0.3211; alpha = 0.0040
Simulating trial. . .
espilon = 0.3211; alpha = 0.0040
Simulating trial. . .
espilon = 0.3211; alpha = 0.0040
Simulating trial. . .
espilon = 0.3211; alpha = 0.0040
Simulating trial. . .
espilon = 0.3211; alpha = 0.0040
Simulating trial. . .
espilon = 0.3211; alpha = 0.0040
Simulating trial. . .
espilon = 0.3211; alpha = 0.0040
Simulating trial. . .
espilon = 0.3211; alpha = 0.0040
Simulating trial. . .
espilon = 0.3211; alpha = 0.0040
Simulating trial. . .
espilon = 0.3211; alpha = 0.0040
Simulating trial. . .
espilon = 0.3211; alpha = 0.0040
Simulating trial. . .
espilon = 0.3211; alpha = 0.0040
Simulating trial. . .
espilon = 0.3211; alpha = 0.0040
Simulating trial. . .
espilon = 0.3211; alpha = 0.0040
Simulating trial. . .
espilon = 0.3211; alpha = 0.0040
Simulating trial. . .
espilon = 0.3211; alpha = 0.0040
Simulating trial. . .
espilon = 0.3211; alpha = 0.0040
Simulating trial. . .
espilon = 0.3211; alpha = 0.0040
Simulating trial. . .
espilon = 0.3211; alpha = 0.0040
Simulating trial. . .
espilon = 0.3211; alpha = 0.0040
Simulating trial. . .
espilon = 0.3211; alpha = 0.0040
Simulating trial. . .
espilon = 0.3211; alpha = 0.0040
Simulating trial. . .
espilon = 0.3211; alpha = 0.0040
Simulating trial. . .
espilon = 0.3211; alpha = 0.0040
Simulating trial. . .
espilon = 0.3211; alpha = 0.0040
Simulating trial. . .
espilon = 0.3211; alpha = 0.0040
Simulating trial. . .
espilon = 0.3211; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -10.95)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.44)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.93)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.08)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.32)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.50)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.13)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent drove right instead of left. (rewarded 0.63)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'right', 'forward')
Agent drove forward instead of right. (rewarded 0.78)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.79)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.53)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.38)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.86)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'right')
Agent followed the waypoint forward. (rewarded 2.05)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent attempted driving forward through a red light. (rewarded -10.31)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.31)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.53)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'right')
Agent drove right instead of left. (rewarded 0.21)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.20)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.75)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 286
\-------------------------
Simulating trial. . .
espilon = 0.3198; alpha = 0.0040
Simulating trial. . .
espilon = 0.3198; alpha = 0.0040
Simulating trial. . .
espilon = 0.3198; alpha = 0.0040
Simulating trial. . .
espilon = 0.3198; alpha = 0.0040
Simulating trial. . .
espilon = 0.3198; alpha = 0.0040
Simulating trial. . .
espilon = 0.3198; alpha = 0.0040
Simulating trial. . .
espilon = 0.3198; alpha = 0.0040
Simulating trial. . .
espilon = 0.3198; alpha = 0.0040
Simulating trial. . .
espilon = 0.3198; alpha = 0.0040
Simulating trial. . .
espilon = 0.3198; alpha = 0.0040
Simulating trial. . .
espilon = 0.3198; alpha = 0.0040
Simulating trial. . .
espilon = 0.3198; alpha = 0.0040
Simulating trial. . .
espilon = 0.3198; alpha = 0.0040
Simulating trial. . .
espilon = 0.3198; alpha = 0.0040
Simulating trial. . .
espilon = 0.3198; alpha = 0.0040
Simulating trial. . .
espilon = 0.3198; alpha = 0.0040
Simulating trial. . .
espilon = 0.3198; alpha = 0.0040
Simulating trial. . .
espilon = 0.3198; alpha = 0.0040
Simulating trial. . .
espilon = 0.3198; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.60)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.26)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.66)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.25)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.77)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent attempted driving left through a red light. (rewarded -9.81)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 2.07)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.47)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'forward')
Agent drove right instead of left. (rewarded 0.73)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.39)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.04)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.44)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'right', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.17)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.37)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.53)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.23)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.52)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.65)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 287
\-------------------------
Simulating trial. . .
espilon = 0.3185; alpha = 0.0040
Simulating trial. . .
espilon = 0.3185; alpha = 0.0040
Simulating trial. . .
espilon = 0.3185; alpha = 0.0040
Simulating trial. . .
espilon = 0.3185; alpha = 0.0040
Simulating trial. . .
espilon = 0.3185; alpha = 0.0040
Simulating trial. . .
espilon = 0.3185; alpha = 0.0040
Simulating trial. . .
espilon = 0.3185; alpha = 0.0040
Simulating trial. . .
espilon = 0.3185; alpha = 0.0040
Simulating trial. . .
espilon = 0.3185; alpha = 0.0040
Simulating trial. . .
espilon = 0.3185; alpha = 0.0040
Simulating trial. . .
espilon = 0.3185; alpha = 0.0040
Simulating trial. . .
espilon = 0.3185; alpha = 0.0040
Simulating trial. . .
espilon = 0.3185; alpha = 0.0040
Simulating trial. . .
espilon = 0.3185; alpha = 0.0040
Simulating trial. . .
espilon = 0.3185; alpha = 0.0040
Simulating trial. . .
espilon = 0.3185; alpha = 0.0040
Simulating trial. . .
espilon = 0.3185; alpha = 0.0040
Simulating trial. . .
espilon = 0.3185; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', 'left')
Agent followed the waypoint right. (rewarded 2.22)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.95)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.24)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.69)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.16)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.00)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'left')
Agent drove forward instead of right. (rewarded 1.80)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent followed the waypoint right. (rewarded 2.30)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.18)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.43)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent drove right instead of left. (rewarded -0.11)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent drove forward instead of right. (rewarded 1.64)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.74)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.44)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', 'right')
Agent followed the waypoint forward. (rewarded 1.74)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 0.72)
36% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 288
\-------------------------
Simulating trial. . .
espilon = 0.3173; alpha = 0.0040
Simulating trial. . .
espilon = 0.3173; alpha = 0.0040
Simulating trial. . .
espilon = 0.3173; alpha = 0.0040
Simulating trial. . .
espilon = 0.3173; alpha = 0.0040
Simulating trial. . .
espilon = 0.3173; alpha = 0.0040
Simulating trial. . .
espilon = 0.3173; alpha = 0.0040
Simulating trial. . .
espilon = 0.3173; alpha = 0.0040
Simulating trial. . .
espilon = 0.3173; alpha = 0.0040
Simulating trial. . .
espilon = 0.3173; alpha = 0.0040
Simulating trial. . .
espilon = 0.3173; alpha = 0.0040
Simulating trial. . .
espilon = 0.3173; alpha = 0.0040
Simulating trial. . .
espilon = 0.3173; alpha = 0.0040
Simulating trial. . .
espilon = 0.3173; alpha = 0.0040
Simulating trial. . .
espilon = 0.3173; alpha = 0.0040
Simulating trial. . .
espilon = 0.3173; alpha = 0.0040
Simulating trial. . .
espilon = 0.3173; alpha = 0.0040
Simulating trial. . .
espilon = 0.3173; alpha = 0.0040
Simulating trial. . .
espilon = 0.3173; alpha = 0.0040
Simulating trial. . .
espilon = 0.3173; alpha = 0.0040
Simulating trial. . .
espilon = 0.3173; alpha = 0.0040
Simulating trial. . .
espilon = 0.3173; alpha = 0.0040
Simulating trial. . .
espilon = 0.3173; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.78)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent drove left instead of right. (rewarded 0.38)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.60)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded 0.57)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.25)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.25)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 1.79)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.68)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent followed the waypoint left. (rewarded 2.57)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.88)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.45)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.38)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.86)
63% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 2.35)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 289
\-------------------------
Simulating trial. . .
espilon = 0.3160; alpha = 0.0040
Simulating trial. . .
espilon = 0.3160; alpha = 0.0040
Simulating trial. . .
espilon = 0.3160; alpha = 0.0040
Simulating trial. . .
espilon = 0.3160; alpha = 0.0040
Simulating trial. . .
espilon = 0.3160; alpha = 0.0040
Simulating trial. . .
espilon = 0.3160; alpha = 0.0040
Simulating trial. . .
espilon = 0.3160; alpha = 0.0040
Simulating trial. . .
espilon = 0.3160; alpha = 0.0040
Simulating trial. . .
espilon = 0.3160; alpha = 0.0040
Simulating trial. . .
espilon = 0.3160; alpha = 0.0040
Simulating trial. . .
espilon = 0.3160; alpha = 0.0040
Simulating trial. . .
espilon = 0.3160; alpha = 0.0040
Simulating trial. . .
espilon = 0.3160; alpha = 0.0040
Simulating trial. . .
espilon = 0.3160; alpha = 0.0040
Simulating trial. . .
espilon = 0.3160; alpha = 0.0040
Simulating trial. . .
espilon = 0.3160; alpha = 0.0040
Simulating trial. . .
espilon = 0.3160; alpha = 0.0040
Simulating trial. . .
espilon = 0.3160; alpha = 0.0040
Simulating trial. . .
espilon = 0.3160; alpha = 0.0040
Simulating trial. . .
espilon = 0.3160; alpha = 0.0040
Simulating trial. . .
espilon = 0.3160; alpha = 0.0040
Simulating trial. . .
espilon = 0.3160; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.10)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.16)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.52)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.66)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.15)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent drove right instead of forward. (rewarded 0.49)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 0.92)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', 'left')
Agent followed the waypoint left. (rewarded 2.35)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.68)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.58)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.80)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.65)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.00)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.94)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.80)
25% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 290
\-------------------------
Simulating trial. . .
espilon = 0.3147; alpha = 0.0040
Simulating trial. . .
espilon = 0.3147; alpha = 0.0040
Simulating trial. . .
espilon = 0.3147; alpha = 0.0040
Simulating trial. . .
espilon = 0.3147; alpha = 0.0040
Simulating trial. . .
espilon = 0.3147; alpha = 0.0040
Simulating trial. . .
espilon = 0.3147; alpha = 0.0040
Simulating trial. . .
espilon = 0.3147; alpha = 0.0040
Simulating trial. . .
espilon = 0.3147; alpha = 0.0040
Simulating trial. . .
espilon = 0.3147; alpha = 0.0040
Simulating trial. . .
espilon = 0.3147; alpha = 0.0040
Simulating trial. . .
espilon = 0.3147; alpha = 0.0040
Simulating trial. . .
espilon = 0.3147; alpha = 0.0040
Simulating trial. . .
espilon = 0.3147; alpha = 0.0040
Simulating trial. . .
espilon = 0.3147; alpha = 0.0040
Simulating trial. . .
espilon = 0.3147; alpha = 0.0040
Simulating trial. . .
espilon = 0.3147; alpha = 0.0040
Simulating trial. . .
espilon = 0.3147; alpha = 0.0040
Simulating trial. . .
espilon = 0.3147; alpha = 0.0040
Simulating trial. . .
espilon = 0.3147; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.89)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', 'left')
Agent attempted driving forward through a red light. (rewarded -10.61)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.88)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 0.28)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.10)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 0.02)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 1.40)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'right')
Agent drove right instead of left. (rewarded 0.08)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.65)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.52)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.56)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.38)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.09)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.07)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.16)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 291
\-------------------------
Simulating trial. . .
espilon = 0.3135; alpha = 0.0040
Simulating trial. . .
espilon = 0.3135; alpha = 0.0040
Simulating trial. . .
espilon = 0.3135; alpha = 0.0040
Simulating trial. . .
espilon = 0.3135; alpha = 0.0040
Simulating trial. . .
espilon = 0.3135; alpha = 0.0040
Simulating trial. . .
espilon = 0.3135; alpha = 0.0040
Simulating trial. . .
espilon = 0.3135; alpha = 0.0040
Simulating trial. . .
espilon = 0.3135; alpha = 0.0040
Simulating trial. . .
espilon = 0.3135; alpha = 0.0040
Simulating trial. . .
espilon = 0.3135; alpha = 0.0040
Simulating trial. . .
espilon = 0.3135; alpha = 0.0040
Simulating trial. . .
espilon = 0.3135; alpha = 0.0040
Simulating trial. . .
espilon = 0.3135; alpha = 0.0040
Simulating trial. . .
espilon = 0.3135; alpha = 0.0040
Simulating trial. . .
espilon = 0.3135; alpha = 0.0040
Simulating trial. . .
espilon = 0.3135; alpha = 0.0040
Simulating trial. . .
espilon = 0.3135; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.10)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.95)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.60)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.36)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.70)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.74)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.67)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'right')
Agent drove right instead of forward. (rewarded 0.22)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.43)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.44)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 0.43)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent followed the waypoint right. (rewarded 1.33)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.67)
63% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.65)
60% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 2.19)
57% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.71)
54% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.06)
51% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 292
\-------------------------
Simulating trial. . .
espilon = 0.3122; alpha = 0.0040
Simulating trial. . .
espilon = 0.3122; alpha = 0.0040
Simulating trial. . .
espilon = 0.3122; alpha = 0.0040
Simulating trial. . .
espilon = 0.3122; alpha = 0.0040
Simulating trial. . .
espilon = 0.3122; alpha = 0.0040
Simulating trial. . .
espilon = 0.3122; alpha = 0.0040
Simulating trial. . .
espilon = 0.3122; alpha = 0.0040
Simulating trial. . .
espilon = 0.3122; alpha = 0.0040
Simulating trial. . .
espilon = 0.3122; alpha = 0.0040
Simulating trial. . .
espilon = 0.3122; alpha = 0.0040
Simulating trial. . .
espilon = 0.3122; alpha = 0.0040
Simulating trial. . .
espilon = 0.3122; alpha = 0.0040
Simulating trial. . .
espilon = 0.3122; alpha = 0.0040
Simulating trial. . .
espilon = 0.3122; alpha = 0.0040
Simulating trial. . .
espilon = 0.3122; alpha = 0.0040
Simulating trial. . .
espilon = 0.3122; alpha = 0.0040
Simulating trial. . .
espilon = 0.3122; alpha = 0.0040
Simulating trial. . .
espilon = 0.3122; alpha = 0.0040
Simulating trial. . .
espilon = 0.3122; alpha = 0.0040
Simulating trial. . .
espilon = 0.3122; alpha = 0.0040
Simulating trial. . .
espilon = 0.3122; alpha = 0.0040
Simulating trial. . .
espilon = 0.3122; alpha = 0.0040
Simulating trial. . .
espilon = 0.3122; alpha = 0.0040
Simulating trial. . .
espilon = 0.3122; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.98)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 0.46)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 0.01)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.30)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.10)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.43)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.87)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.28)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.27)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', None)
Agent drove right instead of left. (rewarded 0.73)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.85)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent attempted driving left through a red light. (rewarded -10.18)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.28)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.14)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.89)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.52)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.29)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, None)
Agent followed the waypoint right. (rewarded 0.65)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.44)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.72)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 0.59)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 1.06)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'forward')
Agent followed the waypoint right. (rewarded 2.25)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.25)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.89)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 293
\-------------------------
Simulating trial. . .
espilon = 0.3110; alpha = 0.0040
Simulating trial. . .
espilon = 0.3110; alpha = 0.0040
Simulating trial. . .
espilon = 0.3110; alpha = 0.0040
Simulating trial. . .
espilon = 0.3110; alpha = 0.0040
Simulating trial. . .
espilon = 0.3110; alpha = 0.0040
Simulating trial. . .
espilon = 0.3110; alpha = 0.0040
Simulating trial. . .
espilon = 0.3110; alpha = 0.0040
Simulating trial. . .
espilon = 0.3110; alpha = 0.0040
Simulating trial. . .
espilon = 0.3110; alpha = 0.0040
Simulating trial. . .
espilon = 0.3110; alpha = 0.0040
Simulating trial. . .
espilon = 0.3110; alpha = 0.0040
Simulating trial. . .
espilon = 0.3110; alpha = 0.0040
Simulating trial. . .
espilon = 0.3110; alpha = 0.0040
Simulating trial. . .
espilon = 0.3110; alpha = 0.0040
Simulating trial. . .
espilon = 0.3110; alpha = 0.0040
Simulating trial. . .
espilon = 0.3110; alpha = 0.0040
Simulating trial. . .
espilon = 0.3110; alpha = 0.0040
Simulating trial. . .
espilon = 0.3110; alpha = 0.0040
Simulating trial. . .
espilon = 0.3110; alpha = 0.0040
Simulating trial. . .
espilon = 0.3110; alpha = 0.0040
Simulating trial. . .
espilon = 0.3110; alpha = 0.0040
Simulating trial. . .
espilon = 0.3110; alpha = 0.0040
Simulating trial. . .
espilon = 0.3110; alpha = 0.0040
Simulating trial. . .
espilon = 0.3110; alpha = 0.0040
Simulating trial. . .
espilon = 0.3110; alpha = 0.0040
Simulating trial. . .
espilon = 0.3110; alpha = 0.0040
Simulating trial. . .
espilon = 0.3110; alpha = 0.0040
Simulating trial. . .
espilon = 0.3110; alpha = 0.0040
Simulating trial. . .
espilon = 0.3110; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'right', 'forward')
Agent followed the waypoint forward. (rewarded 1.15)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', 'left')
Agent drove forward instead of right. (rewarded 1.49)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent drove forward instead of right. (rewarded 1.02)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.02)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent attempted driving left through a red light. (rewarded -9.90)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.56)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.90)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.64)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'right')
Agent drove right instead of forward. (rewarded 0.58)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.53)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent followed the waypoint left. (rewarded 0.85)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 1.69)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 0.86)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.36)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent drove left instead of forward. (rewarded 0.21)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.23)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.69)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.15)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.14)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.87)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 294
\-------------------------
Simulating trial. . .
espilon = 0.3097; alpha = 0.0040
Simulating trial. . .
espilon = 0.3097; alpha = 0.0040
Simulating trial. . .
espilon = 0.3097; alpha = 0.0040
Simulating trial. . .
espilon = 0.3097; alpha = 0.0040
Simulating trial. . .
espilon = 0.3097; alpha = 0.0040
Simulating trial. . .
espilon = 0.3097; alpha = 0.0040
Simulating trial. . .
espilon = 0.3097; alpha = 0.0040
Simulating trial. . .
espilon = 0.3097; alpha = 0.0040
Simulating trial. . .
espilon = 0.3097; alpha = 0.0040
Simulating trial. . .
espilon = 0.3097; alpha = 0.0040
Simulating trial. . .
espilon = 0.3097; alpha = 0.0040
Simulating trial. . .
espilon = 0.3097; alpha = 0.0040
Simulating trial. . .
espilon = 0.3097; alpha = 0.0040
Simulating trial. . .
espilon = 0.3097; alpha = 0.0040
Simulating trial. . .
espilon = 0.3097; alpha = 0.0040
Simulating trial. . .
espilon = 0.3097; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', 'forward')
Agent followed the waypoint right. (rewarded 1.50)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.77)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent drove right instead of forward. (rewarded 1.69)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.37)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 1.44)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', 'left')
Agent drove forward instead of right. (rewarded 0.26)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 2.30)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 0.51)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.59)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove forward instead of left. (rewarded 1.72)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', 'left')
Agent attempted driving forward through a red light. (rewarded -10.00)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent drove right instead of left. (rewarded 1.59)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.58)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.86)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.05)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.62)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.48)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.77)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.76)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.86)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 295
\-------------------------
Simulating trial. . .
espilon = 0.3085; alpha = 0.0040
Simulating trial. . .
espilon = 0.3085; alpha = 0.0040
Simulating trial. . .
espilon = 0.3085; alpha = 0.0040
Simulating trial. . .
espilon = 0.3085; alpha = 0.0040
Simulating trial. . .
espilon = 0.3085; alpha = 0.0040
Simulating trial. . .
espilon = 0.3085; alpha = 0.0040
Simulating trial. . .
espilon = 0.3085; alpha = 0.0040
Simulating trial. . .
espilon = 0.3085; alpha = 0.0040
Simulating trial. . .
espilon = 0.3085; alpha = 0.0040
Simulating trial. . .
espilon = 0.3085; alpha = 0.0040
Simulating trial. . .
espilon = 0.3085; alpha = 0.0040
Simulating trial. . .
espilon = 0.3085; alpha = 0.0040
Simulating trial. . .
espilon = 0.3085; alpha = 0.0040
Simulating trial. . .
espilon = 0.3085; alpha = 0.0040
Simulating trial. . .
espilon = 0.3085; alpha = 0.0040
Simulating trial. . .
espilon = 0.3085; alpha = 0.0040
Simulating trial. . .
espilon = 0.3085; alpha = 0.0040
Simulating trial. . .
espilon = 0.3085; alpha = 0.0040
Simulating trial. . .
espilon = 0.3085; alpha = 0.0040
Simulating trial. . .
espilon = 0.3085; alpha = 0.0040
Simulating trial. . .
espilon = 0.3085; alpha = 0.0040
Simulating trial. . .
espilon = 0.3085; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.35)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.88)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.77)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove right instead of forward. (rewarded 1.54)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 2.03)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.28)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 1.50)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent drove right instead of forward. (rewarded 0.74)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 1.00)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.32)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent drove right instead of left. (rewarded 1.81)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'left')
Agent followed the waypoint right. (rewarded 1.65)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.49)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.22)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.17)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.77)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.51)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded 1.28)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent followed the waypoint left. (rewarded 0.78)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 0.82)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'forward')
Agent drove forward instead of right. (rewarded -0.26)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 1.03)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.64)
8% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 296
\-------------------------
Simulating trial. . .
espilon = 0.3073; alpha = 0.0040
Simulating trial. . .
espilon = 0.3073; alpha = 0.0040
Simulating trial. . .
espilon = 0.3073; alpha = 0.0040
Simulating trial. . .
espilon = 0.3073; alpha = 0.0040
Simulating trial. . .
espilon = 0.3073; alpha = 0.0040
Simulating trial. . .
espilon = 0.3073; alpha = 0.0040
Simulating trial. . .
espilon = 0.3073; alpha = 0.0040
Simulating trial. . .
espilon = 0.3073; alpha = 0.0040
Simulating trial. . .
espilon = 0.3073; alpha = 0.0040
Simulating trial. . .
espilon = 0.3073; alpha = 0.0040
Simulating trial. . .
espilon = 0.3073; alpha = 0.0040
Simulating trial. . .
espilon = 0.3073; alpha = 0.0040
Simulating trial. . .
espilon = 0.3073; alpha = 0.0040
Simulating trial. . .
espilon = 0.3073; alpha = 0.0040
Simulating trial. . .
espilon = 0.3073; alpha = 0.0040
Simulating trial. . .
espilon = 0.3073; alpha = 0.0040
Simulating trial. . .
espilon = 0.3073; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', 'forward')
Agent drove right instead of forward. (rewarded 0.16)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.80)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.69)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.35)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.60)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.20)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.75)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.13)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 2.47)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.34)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.96)
56% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 297
\-------------------------
Simulating trial. . .
espilon = 0.3061; alpha = 0.0040
Simulating trial. . .
espilon = 0.3061; alpha = 0.0040
Simulating trial. . .
espilon = 0.3061; alpha = 0.0040
Simulating trial. . .
espilon = 0.3061; alpha = 0.0040
Simulating trial. . .
espilon = 0.3061; alpha = 0.0040
Simulating trial. . .
espilon = 0.3061; alpha = 0.0040
Simulating trial. . .
espilon = 0.3061; alpha = 0.0040
Simulating trial. . .
espilon = 0.3061; alpha = 0.0040
Simulating trial. . .
espilon = 0.3061; alpha = 0.0040
Simulating trial. . .
espilon = 0.3061; alpha = 0.0040
Simulating trial. . .
espilon = 0.3061; alpha = 0.0040
Simulating trial. . .
espilon = 0.3061; alpha = 0.0040
Simulating trial. . .
espilon = 0.3061; alpha = 0.0040
Simulating trial. . .
espilon = 0.3061; alpha = 0.0040
Simulating trial. . .
espilon = 0.3061; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'right', None)
Agent properly idled at a red light. (rewarded 2.43)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.63)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.89)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.21)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.60)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.65)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent drove forward instead of right. (rewarded 1.26)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.33)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', 'right')
Agent attempted driving forward through a red light. (rewarded -10.11)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded -0.07)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent followed the waypoint right. (rewarded 1.83)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.88)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.30)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent drove left instead of forward. (rewarded -0.08)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.77)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.95)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.48)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded 0.26)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.20)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'right')
Agent attempted driving forward through a red light. (rewarded -9.94)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.28)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.79)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.80)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.31)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.31)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.70)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 0.75)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 0.36)
7% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 298
\-------------------------
Simulating trial. . .
espilon = 0.3048; alpha = 0.0040
Simulating trial. . .
espilon = 0.3048; alpha = 0.0040
Simulating trial. . .
espilon = 0.3048; alpha = 0.0040
Simulating trial. . .
espilon = 0.3048; alpha = 0.0040
Simulating trial. . .
espilon = 0.3048; alpha = 0.0040
Simulating trial. . .
espilon = 0.3048; alpha = 0.0040
Simulating trial. . .
espilon = 0.3048; alpha = 0.0040
Simulating trial. . .
espilon = 0.3048; alpha = 0.0040
Simulating trial. . .
espilon = 0.3048; alpha = 0.0040
Simulating trial. . .
espilon = 0.3048; alpha = 0.0040
Simulating trial. . .
espilon = 0.3048; alpha = 0.0040
Simulating trial. . .
espilon = 0.3048; alpha = 0.0040
Simulating trial. . .
espilon = 0.3048; alpha = 0.0040
Simulating trial. . .
espilon = 0.3048; alpha = 0.0040
Simulating trial. . .
espilon = 0.3048; alpha = 0.0040
Simulating trial. . .
espilon = 0.3048; alpha = 0.0040
Simulating trial. . .
espilon = 0.3048; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', 'forward')
Agent drove right instead of left. (rewarded 0.59)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent drove forward instead of right. (rewarded 0.49)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.55)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent drove right instead of forward. (rewarded 0.05)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.30)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 0.24)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.77)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'forward')
Agent drove forward instead of left. (rewarded 0.97)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.25)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.71)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 1.31)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.35)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.16)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.07)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.64)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.11)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.09)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'right', None)
Agent properly idled at a red light. (rewarded 0.62)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.37)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.57)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 299
\-------------------------
Simulating trial. . .
espilon = 0.3036; alpha = 0.0040
Simulating trial. . .
espilon = 0.3036; alpha = 0.0040
Simulating trial. . .
espilon = 0.3036; alpha = 0.0040
Simulating trial. . .
espilon = 0.3036; alpha = 0.0040
Simulating trial. . .
espilon = 0.3036; alpha = 0.0040
Simulating trial. . .
espilon = 0.3036; alpha = 0.0040
Simulating trial. . .
espilon = 0.3036; alpha = 0.0040
Simulating trial. . .
espilon = 0.3036; alpha = 0.0040
Simulating trial. . .
espilon = 0.3036; alpha = 0.0040
Simulating trial. . .
espilon = 0.3036; alpha = 0.0040
Simulating trial. . .
espilon = 0.3036; alpha = 0.0040
Simulating trial. . .
espilon = 0.3036; alpha = 0.0040
Simulating trial. . .
espilon = 0.3036; alpha = 0.0040
Simulating trial. . .
espilon = 0.3036; alpha = 0.0040
Simulating trial. . .
espilon = 0.3036; alpha = 0.0040
Simulating trial. . .
espilon = 0.3036; alpha = 0.0040
Simulating trial. . .
espilon = 0.3036; alpha = 0.0040
Simulating trial. . .
espilon = 0.3036; alpha = 0.0040
Simulating trial. . .
espilon = 0.3036; alpha = 0.0040
Simulating trial. . .
espilon = 0.3036; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 1.67)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.52)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.84)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.51)
80% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 300
\-------------------------
Simulating trial. . .
espilon = 0.3024; alpha = 0.0040
Simulating trial. . .
espilon = 0.3024; alpha = 0.0040
Simulating trial. . .
espilon = 0.3024; alpha = 0.0040
Simulating trial. . .
espilon = 0.3024; alpha = 0.0040
Simulating trial. . .
espilon = 0.3024; alpha = 0.0040
Simulating trial. . .
espilon = 0.3024; alpha = 0.0040
Simulating trial. . .
espilon = 0.3024; alpha = 0.0040
Simulating trial. . .
espilon = 0.3024; alpha = 0.0040
Simulating trial. . .
espilon = 0.3024; alpha = 0.0040
Simulating trial. . .
espilon = 0.3024; alpha = 0.0040
Simulating trial. . .
espilon = 0.3024; alpha = 0.0040
Simulating trial. . .
espilon = 0.3024; alpha = 0.0040
Simulating trial. . .
espilon = 0.3024; alpha = 0.0040
Simulating trial. . .
espilon = 0.3024; alpha = 0.0040
Simulating trial. . .
espilon = 0.3024; alpha = 0.0040
Simulating trial. . .
espilon = 0.3024; alpha = 0.0040
Simulating trial. . .
espilon = 0.3024; alpha = 0.0040
Simulating trial. . .
espilon = 0.3024; alpha = 0.0040
Simulating trial. . .
espilon = 0.3024; alpha = 0.0040
Simulating trial. . .
espilon = 0.3024; alpha = 0.0040
Simulating trial. . .
espilon = 0.3024; alpha = 0.0040
Simulating trial. . .
espilon = 0.3024; alpha = 0.0040
Simulating trial. . .
espilon = 0.3024; alpha = 0.0040
Simulating trial. . .
espilon = 0.3024; alpha = 0.0040
Simulating trial. . .
espilon = 0.3024; alpha = 0.0040
Simulating trial. . .
espilon = 0.3024; alpha = 0.0040
Simulating trial. . .
espilon = 0.3024; alpha = 0.0040
Simulating trial. . .
espilon = 0.3024; alpha = 0.0040
Simulating trial. . .
espilon = 0.3024; alpha = 0.0040
Simulating trial. . .
espilon = 0.3024; alpha = 0.0040
Simulating trial. . .
espilon = 0.3024; alpha = 0.0040
Simulating trial. . .
espilon = 0.3024; alpha = 0.0040
Simulating trial. . .
espilon = 0.3024; alpha = 0.0040
Simulating trial. . .
espilon = 0.3024; alpha = 0.0040
Simulating trial. . .
espilon = 0.3024; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.90)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.40)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.59)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.51)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent drove forward instead of right. (rewarded 0.08)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.77)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.80)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'right')
Agent properly idled at a red light. (rewarded 1.64)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.71)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 0.85)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -9.59)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.68)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.83)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.51)
30% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 301
\-------------------------
Simulating trial. . .
espilon = 0.3012; alpha = 0.0040
Simulating trial. . .
espilon = 0.3012; alpha = 0.0040
Simulating trial. . .
espilon = 0.3012; alpha = 0.0040
Simulating trial. . .
espilon = 0.3012; alpha = 0.0040
Simulating trial. . .
espilon = 0.3012; alpha = 0.0040
Simulating trial. . .
espilon = 0.3012; alpha = 0.0040
Simulating trial. . .
espilon = 0.3012; alpha = 0.0040
Simulating trial. . .
espilon = 0.3012; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', 'right')
Agent drove forward instead of right. (rewarded 1.89)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.78)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.49)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.08)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.75)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.03)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.10)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.32)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent followed the waypoint left. (rewarded 1.19)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent followed the waypoint left. (rewarded 2.31)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.92)
56% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 302
\-------------------------
Simulating trial. . .
espilon = 0.3000; alpha = 0.0040
Simulating trial. . .
espilon = 0.3000; alpha = 0.0040
Simulating trial. . .
espilon = 0.3000; alpha = 0.0040
Simulating trial. . .
espilon = 0.3000; alpha = 0.0040
Simulating trial. . .
espilon = 0.3000; alpha = 0.0040
Simulating trial. . .
espilon = 0.3000; alpha = 0.0040
Simulating trial. . .
espilon = 0.3000; alpha = 0.0040
Simulating trial. . .
espilon = 0.3000; alpha = 0.0040
Simulating trial. . .
espilon = 0.3000; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', 'forward')
Agent properly idled at a red light. (rewarded 1.03)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.99)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.99)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.11)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 2.69)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded -0.06)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.82)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 1.22)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.90)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.09)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.85)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'right', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.68)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent followed the waypoint left. (rewarded 2.19)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.41)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.33)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 0.78)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.10)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.57)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 0.77)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.97)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent drove right instead of left. (rewarded -0.07)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.58)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.02)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.01)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.35)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 303
\-------------------------
Simulating trial. . .
espilon = 0.2988; alpha = 0.0040
Simulating trial. . .
espilon = 0.2988; alpha = 0.0040
Simulating trial. . .
espilon = 0.2988; alpha = 0.0040
Simulating trial. . .
espilon = 0.2988; alpha = 0.0040
Simulating trial. . .
espilon = 0.2988; alpha = 0.0040
Simulating trial. . .
espilon = 0.2988; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.03)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.89)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.26)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.41)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.93)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent drove right instead of left. (rewarded 0.52)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.94)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.13)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.39)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.37)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent drove right instead of forward. (rewarded 0.13)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.43)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.57)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.42)
44% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 304
\-------------------------
Simulating trial. . .
espilon = 0.2976; alpha = 0.0040
Simulating trial. . .
espilon = 0.2976; alpha = 0.0040
Simulating trial. . .
espilon = 0.2976; alpha = 0.0040
Simulating trial. . .
espilon = 0.2976; alpha = 0.0040
Simulating trial. . .
espilon = 0.2976; alpha = 0.0040
Simulating trial. . .
espilon = 0.2976; alpha = 0.0040
Simulating trial. . .
espilon = 0.2976; alpha = 0.0040
Simulating trial. . .
espilon = 0.2976; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 2.42)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 2.67)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 2.53)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.68)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.72)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.89)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.60)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.14)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.67)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.35)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.74)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', None)
Agent followed the waypoint right. (rewarded 2.10)
52% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 305
\-------------------------
Simulating trial. . .
espilon = 0.2964; alpha = 0.0040
Simulating trial. . .
espilon = 0.2964; alpha = 0.0040
Simulating trial. . .
espilon = 0.2964; alpha = 0.0040
Simulating trial. . .
espilon = 0.2964; alpha = 0.0040
Simulating trial. . .
espilon = 0.2964; alpha = 0.0040
Simulating trial. . .
espilon = 0.2964; alpha = 0.0040
Simulating trial. . .
espilon = 0.2964; alpha = 0.0040
Simulating trial. . .
espilon = 0.2964; alpha = 0.0040
Simulating trial. . .
espilon = 0.2964; alpha = 0.0040
Simulating trial. . .
espilon = 0.2964; alpha = 0.0040
Simulating trial. . .
espilon = 0.2964; alpha = 0.0040
Simulating trial. . .
espilon = 0.2964; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.03)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent followed the waypoint right. (rewarded 2.94)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.18)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.20)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.68)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent drove forward instead of left. (rewarded 1.89)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent attempted driving left through a red light. (rewarded -10.15)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -4.58)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 1.98)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.54)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent followed the waypoint right. (rewarded 1.73)
69% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 306
\-------------------------
Simulating trial. . .
espilon = 0.2952; alpha = 0.0040
Simulating trial. . .
espilon = 0.2952; alpha = 0.0040
Simulating trial. . .
espilon = 0.2952; alpha = 0.0040
Simulating trial. . .
espilon = 0.2952; alpha = 0.0040
Simulating trial. . .
espilon = 0.2952; alpha = 0.0040
Simulating trial. . .
espilon = 0.2952; alpha = 0.0040
Simulating trial. . .
espilon = 0.2952; alpha = 0.0040
Simulating trial. . .
espilon = 0.2952; alpha = 0.0040
Simulating trial. . .
espilon = 0.2952; alpha = 0.0040
Simulating trial. . .
espilon = 0.2952; alpha = 0.0040
Simulating trial. . .
espilon = 0.2952; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 1.99)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.06)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.90)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.38)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.16)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.22)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 0.01)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.03)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.05)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent drove forward instead of left. (rewarded -0.19)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.49)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.54)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.48)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 0.65)
30% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 307
\-------------------------
Simulating trial. . .
espilon = 0.2941; alpha = 0.0040
Simulating trial. . .
espilon = 0.2941; alpha = 0.0040
Simulating trial. . .
espilon = 0.2941; alpha = 0.0040
Simulating trial. . .
espilon = 0.2941; alpha = 0.0040
Simulating trial. . .
espilon = 0.2941; alpha = 0.0040
Simulating trial. . .
espilon = 0.2941; alpha = 0.0040
Simulating trial. . .
espilon = 0.2941; alpha = 0.0040
Simulating trial. . .
espilon = 0.2941; alpha = 0.0040
Simulating trial. . .
espilon = 0.2941; alpha = 0.0040
Simulating trial. . .
espilon = 0.2941; alpha = 0.0040
Simulating trial. . .
espilon = 0.2941; alpha = 0.0040
Simulating trial. . .
espilon = 0.2941; alpha = 0.0040
Simulating trial. . .
espilon = 0.2941; alpha = 0.0040
Simulating trial. . .
espilon = 0.2941; alpha = 0.0040
Simulating trial. . .
espilon = 0.2941; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent followed the waypoint right. (rewarded 1.79)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.00)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.35)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.53)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 1.87)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.77)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.66)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.96)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.89)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.63)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.94)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.66)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 2.74)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.74)
53% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 308
\-------------------------
Simulating trial. . .
espilon = 0.2929; alpha = 0.0040
Simulating trial. . .
espilon = 0.2929; alpha = 0.0040
Simulating trial. . .
espilon = 0.2929; alpha = 0.0040
Simulating trial. . .
espilon = 0.2929; alpha = 0.0040
Simulating trial. . .
espilon = 0.2929; alpha = 0.0040
Simulating trial. . .
espilon = 0.2929; alpha = 0.0040
Simulating trial. . .
espilon = 0.2929; alpha = 0.0040
Simulating trial. . .
espilon = 0.2929; alpha = 0.0040
Simulating trial. . .
espilon = 0.2929; alpha = 0.0040
Simulating trial. . .
espilon = 0.2929; alpha = 0.0040
Simulating trial. . .
espilon = 0.2929; alpha = 0.0040
Simulating trial. . .
espilon = 0.2929; alpha = 0.0040
Simulating trial. . .
espilon = 0.2929; alpha = 0.0040
Simulating trial. . .
espilon = 0.2929; alpha = 0.0040
Simulating trial. . .
espilon = 0.2929; alpha = 0.0040
Simulating trial. . .
espilon = 0.2929; alpha = 0.0040
Simulating trial. . .
espilon = 0.2929; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -5.23)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.02)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent followed the waypoint right. (rewarded 1.50)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.23)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove left instead of forward. (rewarded 1.86)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.14)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.74)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove forward instead of right. (rewarded 0.56)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 1.23)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.01)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.88)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.97)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 309
\-------------------------
Simulating trial. . .
espilon = 0.2917; alpha = 0.0040
Simulating trial. . .
espilon = 0.2917; alpha = 0.0040
Simulating trial. . .
espilon = 0.2917; alpha = 0.0040
Simulating trial. . .
espilon = 0.2917; alpha = 0.0040
Simulating trial. . .
espilon = 0.2917; alpha = 0.0040
Simulating trial. . .
espilon = 0.2917; alpha = 0.0040
Simulating trial. . .
espilon = 0.2917; alpha = 0.0040
Simulating trial. . .
espilon = 0.2917; alpha = 0.0040
Simulating trial. . .
espilon = 0.2917; alpha = 0.0040
Simulating trial. . .
espilon = 0.2917; alpha = 0.0040
Simulating trial. . .
espilon = 0.2917; alpha = 0.0040
Simulating trial. . .
espilon = 0.2917; alpha = 0.0040
Simulating trial. . .
espilon = 0.2917; alpha = 0.0040
Simulating trial. . .
espilon = 0.2917; alpha = 0.0040
Simulating trial. . .
espilon = 0.2917; alpha = 0.0040
Simulating trial. . .
espilon = 0.2917; alpha = 0.0040
Simulating trial. . .
espilon = 0.2917; alpha = 0.0040
Simulating trial. . .
espilon = 0.2917; alpha = 0.0040
Simulating trial. . .
espilon = 0.2917; alpha = 0.0040
Simulating trial. . .
espilon = 0.2917; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'right', 'left')
Agent followed the waypoint right. (rewarded 2.36)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.56)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.43)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.99)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.38)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove forward instead of right. (rewarded 1.14)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'right', 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.16)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.99)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', 'left')
Agent drove forward instead of right. (rewarded 1.09)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'left')
Agent drove forward instead of right. (rewarded 0.31)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.72)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 1.11)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent drove right instead of left. (rewarded 0.05)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent drove forward instead of right. (rewarded -0.01)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 0.82)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded -0.32)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.85)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.33)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.49)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.91)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.88)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 0.53)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded -0.68)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.02)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.17)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 310
\-------------------------
Simulating trial. . .
espilon = 0.2905; alpha = 0.0040
Simulating trial. . .
espilon = 0.2905; alpha = 0.0040
Simulating trial. . .
espilon = 0.2905; alpha = 0.0040
Simulating trial. . .
espilon = 0.2905; alpha = 0.0040
Simulating trial. . .
espilon = 0.2905; alpha = 0.0040
Simulating trial. . .
espilon = 0.2905; alpha = 0.0040
Simulating trial. . .
espilon = 0.2905; alpha = 0.0040
Simulating trial. . .
espilon = 0.2905; alpha = 0.0040
Simulating trial. . .
espilon = 0.2905; alpha = 0.0040
Simulating trial. . .
espilon = 0.2905; alpha = 0.0040
Simulating trial. . .
espilon = 0.2905; alpha = 0.0040
Simulating trial. . .
espilon = 0.2905; alpha = 0.0040
Simulating trial. . .
espilon = 0.2905; alpha = 0.0040
Simulating trial. . .
espilon = 0.2905; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.59)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'left', 'left')
Agent followed the waypoint forward. (rewarded 2.44)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.55)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 1.45)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 2.25)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.30)
70% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 311
\-------------------------
Simulating trial. . .
espilon = 0.2894; alpha = 0.0040
Simulating trial. . .
espilon = 0.2894; alpha = 0.0040
Simulating trial. . .
espilon = 0.2894; alpha = 0.0040
Simulating trial. . .
espilon = 0.2894; alpha = 0.0040
Simulating trial. . .
espilon = 0.2894; alpha = 0.0040
Simulating trial. . .
espilon = 0.2894; alpha = 0.0040
Simulating trial. . .
espilon = 0.2894; alpha = 0.0040
Simulating trial. . .
espilon = 0.2894; alpha = 0.0040
Simulating trial. . .
espilon = 0.2894; alpha = 0.0040
Simulating trial. . .
espilon = 0.2894; alpha = 0.0040
Simulating trial. . .
espilon = 0.2894; alpha = 0.0040
Simulating trial. . .
espilon = 0.2894; alpha = 0.0040
Simulating trial. . .
espilon = 0.2894; alpha = 0.0040
Simulating trial. . .
espilon = 0.2894; alpha = 0.0040
Simulating trial. . .
espilon = 0.2894; alpha = 0.0040
Simulating trial. . .
espilon = 0.2894; alpha = 0.0040
Simulating trial. . .
espilon = 0.2894; alpha = 0.0040
Simulating trial. . .
espilon = 0.2894; alpha = 0.0040
Simulating trial. . .
espilon = 0.2894; alpha = 0.0040
Simulating trial. . .
espilon = 0.2894; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'right')
Agent properly idled at a red light. (rewarded 1.75)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.64)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.13)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.44)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded 0.17)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 1.75)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.49)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 0.74)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 2.83)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.28)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.00)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent followed the waypoint right. (rewarded 2.18)
52% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 312
\-------------------------
Simulating trial. . .
espilon = 0.2882; alpha = 0.0040
Simulating trial. . .
espilon = 0.2882; alpha = 0.0040
Simulating trial. . .
espilon = 0.2882; alpha = 0.0040
Simulating trial. . .
espilon = 0.2882; alpha = 0.0040
Simulating trial. . .
espilon = 0.2882; alpha = 0.0040
Simulating trial. . .
espilon = 0.2882; alpha = 0.0040
Simulating trial. . .
espilon = 0.2882; alpha = 0.0040
Simulating trial. . .
espilon = 0.2882; alpha = 0.0040
Simulating trial. . .
espilon = 0.2882; alpha = 0.0040
Simulating trial. . .
espilon = 0.2882; alpha = 0.0040
Simulating trial. . .
espilon = 0.2882; alpha = 0.0040
Simulating trial. . .
espilon = 0.2882; alpha = 0.0040
Simulating trial. . .
espilon = 0.2882; alpha = 0.0040
Simulating trial. . .
espilon = 0.2882; alpha = 0.0040
Simulating trial. . .
espilon = 0.2882; alpha = 0.0040
Simulating trial. . .
espilon = 0.2882; alpha = 0.0040
Simulating trial. . .
espilon = 0.2882; alpha = 0.0040
Simulating trial. . .
espilon = 0.2882; alpha = 0.0040
Simulating trial. . .
espilon = 0.2882; alpha = 0.0040
Simulating trial. . .
espilon = 0.2882; alpha = 0.0040
Simulating trial. . .
espilon = 0.2882; alpha = 0.0040
Simulating trial. . .
espilon = 0.2882; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.04)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.18)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.70)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.23)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.58)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent followed the waypoint forward. (rewarded 2.69)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent drove right instead of forward. (rewarded 0.03)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 0.96)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent drove forward instead of left. (rewarded 1.43)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.31)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.81)
63% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 313
\-------------------------
Simulating trial. . .
espilon = 0.2871; alpha = 0.0040
Simulating trial. . .
espilon = 0.2871; alpha = 0.0040
Simulating trial. . .
espilon = 0.2871; alpha = 0.0040
Simulating trial. . .
espilon = 0.2871; alpha = 0.0040
Simulating trial. . .
espilon = 0.2871; alpha = 0.0040
Simulating trial. . .
espilon = 0.2871; alpha = 0.0040
Simulating trial. . .
espilon = 0.2871; alpha = 0.0040
Simulating trial. . .
espilon = 0.2871; alpha = 0.0040
Simulating trial. . .
espilon = 0.2871; alpha = 0.0040
Simulating trial. . .
espilon = 0.2871; alpha = 0.0040
Simulating trial. . .
espilon = 0.2871; alpha = 0.0040
Simulating trial. . .
espilon = 0.2871; alpha = 0.0040
Simulating trial. . .
espilon = 0.2871; alpha = 0.0040
Simulating trial. . .
espilon = 0.2871; alpha = 0.0040
Simulating trial. . .
espilon = 0.2871; alpha = 0.0040
Simulating trial. . .
espilon = 0.2871; alpha = 0.0040
Simulating trial. . .
espilon = 0.2871; alpha = 0.0040
Simulating trial. . .
espilon = 0.2871; alpha = 0.0040
Simulating trial. . .
espilon = 0.2871; alpha = 0.0040
Simulating trial. . .
espilon = 0.2871; alpha = 0.0040
Simulating trial. . .
espilon = 0.2871; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'right', 'forward')
Agent followed the waypoint right. (rewarded 1.82)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent drove forward instead of right. (rewarded 0.71)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 1.91)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', 'forward')
Agent drove right instead of forward. (rewarded 0.71)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.85)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.98)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.01)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.57)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.33)
70% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 314
\-------------------------
Simulating trial. . .
espilon = 0.2859; alpha = 0.0040
Simulating trial. . .
espilon = 0.2859; alpha = 0.0040
Simulating trial. . .
espilon = 0.2859; alpha = 0.0040
Simulating trial. . .
espilon = 0.2859; alpha = 0.0040
Simulating trial. . .
espilon = 0.2859; alpha = 0.0040
Simulating trial. . .
espilon = 0.2859; alpha = 0.0040
Simulating trial. . .
espilon = 0.2859; alpha = 0.0040
Simulating trial. . .
espilon = 0.2859; alpha = 0.0040
Simulating trial. . .
espilon = 0.2859; alpha = 0.0040
Simulating trial. . .
espilon = 0.2859; alpha = 0.0040
Simulating trial. . .
espilon = 0.2859; alpha = 0.0040
Simulating trial. . .
espilon = 0.2859; alpha = 0.0040
Simulating trial. . .
espilon = 0.2859; alpha = 0.0040
Simulating trial. . .
espilon = 0.2859; alpha = 0.0040
Simulating trial. . .
espilon = 0.2859; alpha = 0.0040
Simulating trial. . .
espilon = 0.2859; alpha = 0.0040
Simulating trial. . .
espilon = 0.2859; alpha = 0.0040
Simulating trial. . .
espilon = 0.2859; alpha = 0.0040
Simulating trial. . .
espilon = 0.2859; alpha = 0.0040
Simulating trial. . .
espilon = 0.2859; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.22)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.63)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.79)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.64)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.09)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.04)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 1.08)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.20)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.39)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.21)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -4.67)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent followed the waypoint left. (rewarded 1.23)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 315
\-------------------------
Simulating trial. . .
espilon = 0.2848; alpha = 0.0040
Simulating trial. . .
espilon = 0.2848; alpha = 0.0040
Simulating trial. . .
espilon = 0.2848; alpha = 0.0040
Simulating trial. . .
espilon = 0.2848; alpha = 0.0040
Simulating trial. . .
espilon = 0.2848; alpha = 0.0040
Simulating trial. . .
espilon = 0.2848; alpha = 0.0040
Simulating trial. . .
espilon = 0.2848; alpha = 0.0040
Simulating trial. . .
espilon = 0.2848; alpha = 0.0040
Simulating trial. . .
espilon = 0.2848; alpha = 0.0040
Simulating trial. . .
espilon = 0.2848; alpha = 0.0040
Simulating trial. . .
espilon = 0.2848; alpha = 0.0040
Simulating trial. . .
espilon = 0.2848; alpha = 0.0040
Simulating trial. . .
espilon = 0.2848; alpha = 0.0040
Simulating trial. . .
espilon = 0.2848; alpha = 0.0040
Simulating trial. . .
espilon = 0.2848; alpha = 0.0040
Simulating trial. . .
espilon = 0.2848; alpha = 0.0040
Simulating trial. . .
espilon = 0.2848; alpha = 0.0040
Simulating trial. . .
espilon = 0.2848; alpha = 0.0040
Simulating trial. . .
espilon = 0.2848; alpha = 0.0040
Simulating trial. . .
espilon = 0.2848; alpha = 0.0040
Simulating trial. . .
espilon = 0.2848; alpha = 0.0040
Simulating trial. . .
espilon = 0.2848; alpha = 0.0040
Simulating trial. . .
espilon = 0.2848; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, None)
Agent drove forward instead of right. (rewarded 0.56)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 2.24)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.10)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.88)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.92)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.38)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.74)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', 'left')
Agent properly idled at a red light. (rewarded 2.03)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.48)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.66)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'left', None)
Agent followed the waypoint right. (rewarded 2.10)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.30)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'left')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.16)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', 'left')
Agent followed the waypoint forward. (rewarded 2.07)
44% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 316
\-------------------------
Simulating trial. . .
espilon = 0.2837; alpha = 0.0040
Simulating trial. . .
espilon = 0.2837; alpha = 0.0040
Simulating trial. . .
espilon = 0.2837; alpha = 0.0040
Simulating trial. . .
espilon = 0.2837; alpha = 0.0040
Simulating trial. . .
espilon = 0.2837; alpha = 0.0040
Simulating trial. . .
espilon = 0.2837; alpha = 0.0040
Simulating trial. . .
espilon = 0.2837; alpha = 0.0040
Simulating trial. . .
espilon = 0.2837; alpha = 0.0040
Simulating trial. . .
espilon = 0.2837; alpha = 0.0040
Simulating trial. . .
espilon = 0.2837; alpha = 0.0040
Simulating trial. . .
espilon = 0.2837; alpha = 0.0040
Simulating trial. . .
espilon = 0.2837; alpha = 0.0040
Simulating trial. . .
espilon = 0.2837; alpha = 0.0040
Simulating trial. . .
espilon = 0.2837; alpha = 0.0040
Simulating trial. . .
espilon = 0.2837; alpha = 0.0040
Simulating trial. . .
espilon = 0.2837; alpha = 0.0040
Simulating trial. . .
espilon = 0.2837; alpha = 0.0040
Simulating trial. . .
espilon = 0.2837; alpha = 0.0040
Simulating trial. . .
espilon = 0.2837; alpha = 0.0040
Simulating trial. . .
espilon = 0.2837; alpha = 0.0040
Simulating trial. . .
espilon = 0.2837; alpha = 0.0040
Simulating trial. . .
espilon = 0.2837; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.43)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent drove left instead of forward. (rewarded 0.08)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.89)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 2.74)
80% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 317
\-------------------------
Simulating trial. . .
espilon = 0.2825; alpha = 0.0040
Simulating trial. . .
espilon = 0.2825; alpha = 0.0040
Simulating trial. . .
espilon = 0.2825; alpha = 0.0040
Simulating trial. . .
espilon = 0.2825; alpha = 0.0040
Simulating trial. . .
espilon = 0.2825; alpha = 0.0040
Simulating trial. . .
espilon = 0.2825; alpha = 0.0040
Simulating trial. . .
espilon = 0.2825; alpha = 0.0040
Simulating trial. . .
espilon = 0.2825; alpha = 0.0040
Simulating trial. . .
espilon = 0.2825; alpha = 0.0040
Simulating trial. . .
espilon = 0.2825; alpha = 0.0040
Simulating trial. . .
espilon = 0.2825; alpha = 0.0040
Simulating trial. . .
espilon = 0.2825; alpha = 0.0040
Simulating trial. . .
espilon = 0.2825; alpha = 0.0040
Simulating trial. . .
espilon = 0.2825; alpha = 0.0040
Simulating trial. . .
espilon = 0.2825; alpha = 0.0040
Simulating trial. . .
espilon = 0.2825; alpha = 0.0040
Simulating trial. . .
espilon = 0.2825; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 2.93)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'forward')
Agent drove right instead of forward. (rewarded 1.94)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.25)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 2.64)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.12)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.49)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.45)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.06)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', 'forward')
Agent followed the waypoint forward. (rewarded 2.45)
64% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 318
\-------------------------
Simulating trial. . .
espilon = 0.2814; alpha = 0.0040
Simulating trial. . .
espilon = 0.2814; alpha = 0.0040
Simulating trial. . .
espilon = 0.2814; alpha = 0.0040
Simulating trial. . .
espilon = 0.2814; alpha = 0.0040
Simulating trial. . .
espilon = 0.2814; alpha = 0.0040
Simulating trial. . .
espilon = 0.2814; alpha = 0.0040
Simulating trial. . .
espilon = 0.2814; alpha = 0.0040
Simulating trial. . .
espilon = 0.2814; alpha = 0.0040
Simulating trial. . .
espilon = 0.2814; alpha = 0.0040
Simulating trial. . .
espilon = 0.2814; alpha = 0.0040
Simulating trial. . .
espilon = 0.2814; alpha = 0.0040
Simulating trial. . .
espilon = 0.2814; alpha = 0.0040
Simulating trial. . .
espilon = 0.2814; alpha = 0.0040
Simulating trial. . .
espilon = 0.2814; alpha = 0.0040
Simulating trial. . .
espilon = 0.2814; alpha = 0.0040
Simulating trial. . .
espilon = 0.2814; alpha = 0.0040
Simulating trial. . .
espilon = 0.2814; alpha = 0.0040
Simulating trial. . .
espilon = 0.2814; alpha = 0.0040
Simulating trial. . .
espilon = 0.2814; alpha = 0.0040
Simulating trial. . .
espilon = 0.2814; alpha = 0.0040
Simulating trial. . .
espilon = 0.2814; alpha = 0.0040
Simulating trial. . .
espilon = 0.2814; alpha = 0.0040
Simulating trial. . .
espilon = 0.2814; alpha = 0.0040
Simulating trial. . .
espilon = 0.2814; alpha = 0.0040
Simulating trial. . .
espilon = 0.2814; alpha = 0.0040
Simulating trial. . .
espilon = 0.2814; alpha = 0.0040
Simulating trial. . .
espilon = 0.2814; alpha = 0.0040
Simulating trial. . .
espilon = 0.2814; alpha = 0.0040
Simulating trial. . .
espilon = 0.2814; alpha = 0.0040
Simulating trial. . .
espilon = 0.2814; alpha = 0.0040
Simulating trial. . .
espilon = 0.2814; alpha = 0.0040
Simulating trial. . .
espilon = 0.2814; alpha = 0.0040
Simulating trial. . .
espilon = 0.2814; alpha = 0.0040
Simulating trial. . .
espilon = 0.2814; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.12)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.14)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent drove right instead of forward. (rewarded 1.72)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.78)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.37)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent drove forward instead of left. (rewarded 0.26)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.51)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent followed the waypoint left. (rewarded 1.09)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent drove left instead of forward. (rewarded 1.48)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 0.98)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.78)
56% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 319
\-------------------------
Simulating trial. . .
espilon = 0.2803; alpha = 0.0040
Simulating trial. . .
espilon = 0.2803; alpha = 0.0040
Simulating trial. . .
espilon = 0.2803; alpha = 0.0040
Simulating trial. . .
espilon = 0.2803; alpha = 0.0040
Simulating trial. . .
espilon = 0.2803; alpha = 0.0040
Simulating trial. . .
espilon = 0.2803; alpha = 0.0040
Simulating trial. . .
espilon = 0.2803; alpha = 0.0040
Simulating trial. . .
espilon = 0.2803; alpha = 0.0040
Simulating trial. . .
espilon = 0.2803; alpha = 0.0040
Simulating trial. . .
espilon = 0.2803; alpha = 0.0040
Simulating trial. . .
espilon = 0.2803; alpha = 0.0040
Simulating trial. . .
espilon = 0.2803; alpha = 0.0040
Simulating trial. . .
espilon = 0.2803; alpha = 0.0040
Simulating trial. . .
espilon = 0.2803; alpha = 0.0040
Simulating trial. . .
espilon = 0.2803; alpha = 0.0040
Simulating trial. . .
espilon = 0.2803; alpha = 0.0040
Simulating trial. . .
espilon = 0.2803; alpha = 0.0040
Simulating trial. . .
espilon = 0.2803; alpha = 0.0040
Simulating trial. . .
espilon = 0.2803; alpha = 0.0040
Simulating trial. . .
espilon = 0.2803; alpha = 0.0040
Simulating trial. . .
espilon = 0.2803; alpha = 0.0040
Simulating trial. . .
espilon = 0.2803; alpha = 0.0040
Simulating trial. . .
espilon = 0.2803; alpha = 0.0040
Simulating trial. . .
espilon = 0.2803; alpha = 0.0040
Simulating trial. . .
espilon = 0.2803; alpha = 0.0040
Simulating trial. . .
espilon = 0.2803; alpha = 0.0040
Simulating trial. . .
espilon = 0.2803; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.01)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.40)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent drove left instead of forward. (rewarded 0.42)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.25)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.26)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 1.78)
70% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 320
\-------------------------
Simulating trial. . .
espilon = 0.2792; alpha = 0.0040
Simulating trial. . .
espilon = 0.2792; alpha = 0.0040
Simulating trial. . .
espilon = 0.2792; alpha = 0.0040
Simulating trial. . .
espilon = 0.2792; alpha = 0.0040
Simulating trial. . .
espilon = 0.2792; alpha = 0.0040
Simulating trial. . .
espilon = 0.2792; alpha = 0.0040
Simulating trial. . .
espilon = 0.2792; alpha = 0.0040
Simulating trial. . .
espilon = 0.2792; alpha = 0.0040
Simulating trial. . .
espilon = 0.2792; alpha = 0.0040
Simulating trial. . .
espilon = 0.2792; alpha = 0.0040
Simulating trial. . .
espilon = 0.2792; alpha = 0.0040
Simulating trial. . .
espilon = 0.2792; alpha = 0.0040
Simulating trial. . .
espilon = 0.2792; alpha = 0.0040
Simulating trial. . .
espilon = 0.2792; alpha = 0.0040
Simulating trial. . .
espilon = 0.2792; alpha = 0.0040
Simulating trial. . .
espilon = 0.2792; alpha = 0.0040
Simulating trial. . .
espilon = 0.2792; alpha = 0.0040
Simulating trial. . .
espilon = 0.2792; alpha = 0.0040
Simulating trial. . .
espilon = 0.2792; alpha = 0.0040
Simulating trial. . .
espilon = 0.2792; alpha = 0.0040
Simulating trial. . .
espilon = 0.2792; alpha = 0.0040
Simulating trial. . .
espilon = 0.2792; alpha = 0.0040
Simulating trial. . .
espilon = 0.2792; alpha = 0.0040
Simulating trial. . .
espilon = 0.2792; alpha = 0.0040
Simulating trial. . .
espilon = 0.2792; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -5.19)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.30)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', 'right')
Agent drove forward instead of right. (rewarded 0.71)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent attempted driving forward through a red light. (rewarded -10.59)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.16)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.94)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'right', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.71)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.95)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.61)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.96)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.76)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 1.64)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'right')
Agent drove right instead of left. (rewarded 0.51)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'left')
Agent drove forward instead of right. (rewarded -0.29)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -4.37)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.26)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 0.61)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.30)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', 'forward')
Agent followed the waypoint right. (rewarded 1.76)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.85)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 321
\-------------------------
Simulating trial. . .
espilon = 0.2780; alpha = 0.0040
Simulating trial. . .
espilon = 0.2780; alpha = 0.0040
Simulating trial. . .
espilon = 0.2780; alpha = 0.0040
Simulating trial. . .
espilon = 0.2780; alpha = 0.0040
Simulating trial. . .
espilon = 0.2780; alpha = 0.0040
Simulating trial. . .
espilon = 0.2780; alpha = 0.0040
Simulating trial. . .
espilon = 0.2780; alpha = 0.0040
Simulating trial. . .
espilon = 0.2780; alpha = 0.0040
Simulating trial. . .
espilon = 0.2780; alpha = 0.0040
Simulating trial. . .
espilon = 0.2780; alpha = 0.0040
Simulating trial. . .
espilon = 0.2780; alpha = 0.0040
Simulating trial. . .
espilon = 0.2780; alpha = 0.0040
Simulating trial. . .
espilon = 0.2780; alpha = 0.0040
Simulating trial. . .
espilon = 0.2780; alpha = 0.0040
Simulating trial. . .
espilon = 0.2780; alpha = 0.0040
Simulating trial. . .
espilon = 0.2780; alpha = 0.0040
Simulating trial. . .
espilon = 0.2780; alpha = 0.0040
Simulating trial. . .
espilon = 0.2780; alpha = 0.0040
Simulating trial. . .
espilon = 0.2780; alpha = 0.0040
Simulating trial. . .
espilon = 0.2780; alpha = 0.0040
Simulating trial. . .
espilon = 0.2780; alpha = 0.0040
Simulating trial. . .
espilon = 0.2780; alpha = 0.0040
Simulating trial. . .
espilon = 0.2780; alpha = 0.0040
Simulating trial. . .
espilon = 0.2780; alpha = 0.0040
Simulating trial. . .
espilon = 0.2780; alpha = 0.0040
Simulating trial. . .
espilon = 0.2780; alpha = 0.0040
Simulating trial. . .
espilon = 0.2780; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent drove right instead of left. (rewarded 1.98)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.40)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 1.09)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.28)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.72)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.80)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 2.62)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.71)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.50)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.59)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.05)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.43)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent drove right instead of forward. (rewarded 1.44)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', 'forward')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.11)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent attempted driving forward through a red light. (rewarded -10.66)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.50)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.51)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 0.85)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.56)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded 1.04)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 1.99)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 2.49)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 0.55)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent drove right instead of forward. (rewarded -0.23)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.97)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 0.78)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 2.09)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.25)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.65)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.94)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 322
\-------------------------
Simulating trial. . .
espilon = 0.2769; alpha = 0.0040
Simulating trial. . .
espilon = 0.2769; alpha = 0.0040
Simulating trial. . .
espilon = 0.2769; alpha = 0.0040
Simulating trial. . .
espilon = 0.2769; alpha = 0.0040
Simulating trial. . .
espilon = 0.2769; alpha = 0.0040
Simulating trial. . .
espilon = 0.2769; alpha = 0.0040
Simulating trial. . .
espilon = 0.2769; alpha = 0.0040
Simulating trial. . .
espilon = 0.2769; alpha = 0.0040
Simulating trial. . .
espilon = 0.2769; alpha = 0.0040
Simulating trial. . .
espilon = 0.2769; alpha = 0.0040
Simulating trial. . .
espilon = 0.2769; alpha = 0.0040
Simulating trial. . .
espilon = 0.2769; alpha = 0.0040
Simulating trial. . .
espilon = 0.2769; alpha = 0.0040
Simulating trial. . .
espilon = 0.2769; alpha = 0.0040
Simulating trial. . .
espilon = 0.2769; alpha = 0.0040
Simulating trial. . .
espilon = 0.2769; alpha = 0.0040
Simulating trial. . .
espilon = 0.2769; alpha = 0.0040
Simulating trial. . .
espilon = 0.2769; alpha = 0.0040
Simulating trial. . .
espilon = 0.2769; alpha = 0.0040
Simulating trial. . .
espilon = 0.2769; alpha = 0.0040
Simulating trial. . .
espilon = 0.2769; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.56)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.67)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.27)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', None)
Agent attempted driving left through a red light. (rewarded -10.53)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.95)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.19)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'right')
Agent drove right instead of forward. (rewarded 0.91)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.09)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.32)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.61)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.94)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.54)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.86)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.12)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 0.87)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', 'forward')
Agent properly idled at a red light. (rewarded 0.99)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.00)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.86)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 323
\-------------------------
Simulating trial. . .
espilon = 0.2758; alpha = 0.0040
Simulating trial. . .
espilon = 0.2758; alpha = 0.0040
Simulating trial. . .
espilon = 0.2758; alpha = 0.0040
Simulating trial. . .
espilon = 0.2758; alpha = 0.0040
Simulating trial. . .
espilon = 0.2758; alpha = 0.0040
Simulating trial. . .
espilon = 0.2758; alpha = 0.0040
Simulating trial. . .
espilon = 0.2758; alpha = 0.0040
Simulating trial. . .
espilon = 0.2758; alpha = 0.0040
Simulating trial. . .
espilon = 0.2758; alpha = 0.0040
Simulating trial. . .
espilon = 0.2758; alpha = 0.0040
Simulating trial. . .
espilon = 0.2758; alpha = 0.0040
Simulating trial. . .
espilon = 0.2758; alpha = 0.0040
Simulating trial. . .
espilon = 0.2758; alpha = 0.0040
Simulating trial. . .
espilon = 0.2758; alpha = 0.0040
Simulating trial. . .
espilon = 0.2758; alpha = 0.0040
Simulating trial. . .
espilon = 0.2758; alpha = 0.0040
Simulating trial. . .
espilon = 0.2758; alpha = 0.0040
Simulating trial. . .
espilon = 0.2758; alpha = 0.0040
Simulating trial. . .
espilon = 0.2758; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.97)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', None)
Agent properly idled at a red light. (rewarded 1.24)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 1.80)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.38)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.81)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.69)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.13)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.17)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.51)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.54)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.75)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.79)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.00)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.11)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.45)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 0.79)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 0.89)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'right', 'forward')
Agent followed the waypoint forward. (rewarded 2.44)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.70)
24% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 324
\-------------------------
Simulating trial. . .
espilon = 0.2747; alpha = 0.0040
Simulating trial. . .
espilon = 0.2747; alpha = 0.0040
Simulating trial. . .
espilon = 0.2747; alpha = 0.0040
Simulating trial. . .
espilon = 0.2747; alpha = 0.0040
Simulating trial. . .
espilon = 0.2747; alpha = 0.0040
Simulating trial. . .
espilon = 0.2747; alpha = 0.0040
Simulating trial. . .
espilon = 0.2747; alpha = 0.0040
Simulating trial. . .
espilon = 0.2747; alpha = 0.0040
Simulating trial. . .
espilon = 0.2747; alpha = 0.0040
Simulating trial. . .
espilon = 0.2747; alpha = 0.0040
Simulating trial. . .
espilon = 0.2747; alpha = 0.0040
Simulating trial. . .
espilon = 0.2747; alpha = 0.0040
Simulating trial. . .
espilon = 0.2747; alpha = 0.0040
Simulating trial. . .
espilon = 0.2747; alpha = 0.0040
Simulating trial. . .
espilon = 0.2747; alpha = 0.0040
Simulating trial. . .
espilon = 0.2747; alpha = 0.0040
Simulating trial. . .
espilon = 0.2747; alpha = 0.0040
Simulating trial. . .
espilon = 0.2747; alpha = 0.0040
Simulating trial. . .
espilon = 0.2747; alpha = 0.0040
Simulating trial. . .
espilon = 0.2747; alpha = 0.0040
Simulating trial. . .
espilon = 0.2747; alpha = 0.0040
Simulating trial. . .
espilon = 0.2747; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.78)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.74)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent drove right instead of left. (rewarded 1.76)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.91)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 1.66)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.73)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.31)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.41)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.88)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.65)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.01)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent followed the waypoint right. (rewarded 2.22)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.75)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.16)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.29)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.95)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.93)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 0.76)
28% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 325
\-------------------------
Simulating trial. . .
espilon = 0.2736; alpha = 0.0040
Simulating trial. . .
espilon = 0.2736; alpha = 0.0040
Simulating trial. . .
espilon = 0.2736; alpha = 0.0040
Simulating trial. . .
espilon = 0.2736; alpha = 0.0040
Simulating trial. . .
espilon = 0.2736; alpha = 0.0040
Simulating trial. . .
espilon = 0.2736; alpha = 0.0040
Simulating trial. . .
espilon = 0.2736; alpha = 0.0040
Simulating trial. . .
espilon = 0.2736; alpha = 0.0040
Simulating trial. . .
espilon = 0.2736; alpha = 0.0040
Simulating trial. . .
espilon = 0.2736; alpha = 0.0040
Simulating trial. . .
espilon = 0.2736; alpha = 0.0040
Simulating trial. . .
espilon = 0.2736; alpha = 0.0040
Simulating trial. . .
espilon = 0.2736; alpha = 0.0040
Simulating trial. . .
espilon = 0.2736; alpha = 0.0040
Simulating trial. . .
espilon = 0.2736; alpha = 0.0040
Simulating trial. . .
espilon = 0.2736; alpha = 0.0040
Simulating trial. . .
espilon = 0.2736; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.43)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', 'right')
Agent attempted driving forward through a red light. (rewarded -10.16)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'right')
Agent attempted driving forward through a red light. (rewarded -10.36)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.99)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.36)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.20)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 1.25)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.13)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.45)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.56)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent drove right instead of left. (rewarded 0.82)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded 1.37)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.85)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'forward')
Agent drove forward instead of left. (rewarded -0.07)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 1.42)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.99)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent followed the waypoint left. (rewarded 0.99)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.69)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.69)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.34)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded -0.43)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.50)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.96)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.20)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 1.07)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 326
\-------------------------
Simulating trial. . .
espilon = 0.2725; alpha = 0.0040
Simulating trial. . .
espilon = 0.2725; alpha = 0.0040
Simulating trial. . .
espilon = 0.2725; alpha = 0.0040
Simulating trial. . .
espilon = 0.2725; alpha = 0.0040
Simulating trial. . .
espilon = 0.2725; alpha = 0.0040
Simulating trial. . .
espilon = 0.2725; alpha = 0.0040
Simulating trial. . .
espilon = 0.2725; alpha = 0.0040
Simulating trial. . .
espilon = 0.2725; alpha = 0.0040
Simulating trial. . .
espilon = 0.2725; alpha = 0.0040
Simulating trial. . .
espilon = 0.2725; alpha = 0.0040
Simulating trial. . .
espilon = 0.2725; alpha = 0.0040
Simulating trial. . .
espilon = 0.2725; alpha = 0.0040
Simulating trial. . .
espilon = 0.2725; alpha = 0.0040
Simulating trial. . .
espilon = 0.2725; alpha = 0.0040
Simulating trial. . .
espilon = 0.2725; alpha = 0.0040
Simulating trial. . .
espilon = 0.2725; alpha = 0.0040
Simulating trial. . .
espilon = 0.2725; alpha = 0.0040
Simulating trial. . .
espilon = 0.2725; alpha = 0.0040
Simulating trial. . .
espilon = 0.2725; alpha = 0.0040
Simulating trial. . .
espilon = 0.2725; alpha = 0.0040
Simulating trial. . .
espilon = 0.2725; alpha = 0.0040
Simulating trial. . .
espilon = 0.2725; alpha = 0.0040
Simulating trial. . .
espilon = 0.2725; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.00)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.27)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.58)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.26)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.86)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 0.73)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -10.88)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.97)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.55)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.02)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.32)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.76)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent attempted driving forward through a red light. (rewarded -10.50)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 0.50)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, 'left')
Agent followed the waypoint right. (rewarded 2.39)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.91)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.55)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.09)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 2.02)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 0.58)
20% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 327
\-------------------------
Simulating trial. . .
espilon = 0.2714; alpha = 0.0040
Simulating trial. . .
espilon = 0.2714; alpha = 0.0040
Simulating trial. . .
espilon = 0.2714; alpha = 0.0040
Simulating trial. . .
espilon = 0.2714; alpha = 0.0040
Simulating trial. . .
espilon = 0.2714; alpha = 0.0040
Simulating trial. . .
espilon = 0.2714; alpha = 0.0040
Simulating trial. . .
espilon = 0.2714; alpha = 0.0040
Simulating trial. . .
espilon = 0.2714; alpha = 0.0040
Simulating trial. . .
espilon = 0.2714; alpha = 0.0040
Simulating trial. . .
espilon = 0.2714; alpha = 0.0040
Simulating trial. . .
espilon = 0.2714; alpha = 0.0040
Simulating trial. . .
espilon = 0.2714; alpha = 0.0040
Simulating trial. . .
espilon = 0.2714; alpha = 0.0040
Simulating trial. . .
espilon = 0.2714; alpha = 0.0040
Simulating trial. . .
espilon = 0.2714; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.31)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.77)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.70)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.61)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 1.47)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.95)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.26)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.15)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.71)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 2.17)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.97)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 2.27)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.24)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 1.78)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.62)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent followed the waypoint right. (rewarded 1.65)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.73)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'right')
Agent followed the waypoint forward. (rewarded 1.77)
28% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 328
\-------------------------
Simulating trial. . .
espilon = 0.2704; alpha = 0.0040
Simulating trial. . .
espilon = 0.2704; alpha = 0.0040
Simulating trial. . .
espilon = 0.2704; alpha = 0.0040
Simulating trial. . .
espilon = 0.2704; alpha = 0.0040
Simulating trial. . .
espilon = 0.2704; alpha = 0.0040
Simulating trial. . .
espilon = 0.2704; alpha = 0.0040
Simulating trial. . .
espilon = 0.2704; alpha = 0.0040
Simulating trial. . .
espilon = 0.2704; alpha = 0.0040
Simulating trial. . .
espilon = 0.2704; alpha = 0.0040
Simulating trial. . .
espilon = 0.2704; alpha = 0.0040
Simulating trial. . .
espilon = 0.2704; alpha = 0.0040
Simulating trial. . .
espilon = 0.2704; alpha = 0.0040
Simulating trial. . .
espilon = 0.2704; alpha = 0.0040
Simulating trial. . .
espilon = 0.2704; alpha = 0.0040
Simulating trial. . .
espilon = 0.2704; alpha = 0.0040
Simulating trial. . .
espilon = 0.2704; alpha = 0.0040
Simulating trial. . .
espilon = 0.2704; alpha = 0.0040
Simulating trial. . .
espilon = 0.2704; alpha = 0.0040
Simulating trial. . .
espilon = 0.2704; alpha = 0.0040
Simulating trial. . .
espilon = 0.2704; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', 'right')
Agent followed the waypoint right. (rewarded 1.82)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', 'right')
Agent followed the waypoint right. (rewarded 2.11)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 1.64)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.11)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.03)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.65)
70% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 329
\-------------------------
Simulating trial. . .
espilon = 0.2693; alpha = 0.0040
Simulating trial. . .
espilon = 0.2693; alpha = 0.0040
Simulating trial. . .
espilon = 0.2693; alpha = 0.0040
Simulating trial. . .
espilon = 0.2693; alpha = 0.0040
Simulating trial. . .
espilon = 0.2693; alpha = 0.0040
Simulating trial. . .
espilon = 0.2693; alpha = 0.0040
Simulating trial. . .
espilon = 0.2693; alpha = 0.0040
Simulating trial. . .
espilon = 0.2693; alpha = 0.0040
Simulating trial. . .
espilon = 0.2693; alpha = 0.0040
Simulating trial. . .
espilon = 0.2693; alpha = 0.0040
Simulating trial. . .
espilon = 0.2693; alpha = 0.0040
Simulating trial. . .
espilon = 0.2693; alpha = 0.0040
Simulating trial. . .
espilon = 0.2693; alpha = 0.0040
Simulating trial. . .
espilon = 0.2693; alpha = 0.0040
Simulating trial. . .
espilon = 0.2693; alpha = 0.0040
Simulating trial. . .
espilon = 0.2693; alpha = 0.0040
Simulating trial. . .
espilon = 0.2693; alpha = 0.0040
Simulating trial. . .
espilon = 0.2693; alpha = 0.0040
Simulating trial. . .
espilon = 0.2693; alpha = 0.0040
Simulating trial. . .
espilon = 0.2693; alpha = 0.0040
Simulating trial. . .
espilon = 0.2693; alpha = 0.0040
Simulating trial. . .
espilon = 0.2693; alpha = 0.0040
Simulating trial. . .
espilon = 0.2693; alpha = 0.0040
Simulating trial. . .
espilon = 0.2693; alpha = 0.0040
Simulating trial. . .
espilon = 0.2693; alpha = 0.0040
Simulating trial. . .
espilon = 0.2693; alpha = 0.0040
Simulating trial. . .
espilon = 0.2693; alpha = 0.0040
Simulating trial. . .
espilon = 0.2693; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', 'forward')
Agent drove forward instead of left. (rewarded 1.37)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.83)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.70)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.84)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.95)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.16)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.57)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.71)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 1.24)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 0.99)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.49)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.57)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.84)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.28)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.31)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 1.31)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.00)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded -0.45)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.50)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.35)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 330
\-------------------------
Simulating trial. . .
espilon = 0.2682; alpha = 0.0040
Simulating trial. . .
espilon = 0.2682; alpha = 0.0040
Simulating trial. . .
espilon = 0.2682; alpha = 0.0040
Simulating trial. . .
espilon = 0.2682; alpha = 0.0040
Simulating trial. . .
espilon = 0.2682; alpha = 0.0040
Simulating trial. . .
espilon = 0.2682; alpha = 0.0040
Simulating trial. . .
espilon = 0.2682; alpha = 0.0040
Simulating trial. . .
espilon = 0.2682; alpha = 0.0040
Simulating trial. . .
espilon = 0.2682; alpha = 0.0040
Simulating trial. . .
espilon = 0.2682; alpha = 0.0040
Simulating trial. . .
espilon = 0.2682; alpha = 0.0040
Simulating trial. . .
espilon = 0.2682; alpha = 0.0040
Simulating trial. . .
espilon = 0.2682; alpha = 0.0040
Simulating trial. . .
espilon = 0.2682; alpha = 0.0040
Simulating trial. . .
espilon = 0.2682; alpha = 0.0040
Simulating trial. . .
espilon = 0.2682; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', 'left')
Agent drove right instead of left. (rewarded 0.53)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.33)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.60)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.20)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'right', None)
Agent properly idled at a red light. (rewarded 2.66)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.25)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent drove right instead of forward. (rewarded 0.49)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.30)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.14)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 1.09)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.22)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.26)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 1.07)
35% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 331
\-------------------------
Simulating trial. . .
espilon = 0.2671; alpha = 0.0040
Simulating trial. . .
espilon = 0.2671; alpha = 0.0040
Simulating trial. . .
espilon = 0.2671; alpha = 0.0040
Simulating trial. . .
espilon = 0.2671; alpha = 0.0040
Simulating trial. . .
espilon = 0.2671; alpha = 0.0040
Simulating trial. . .
espilon = 0.2671; alpha = 0.0040
Simulating trial. . .
espilon = 0.2671; alpha = 0.0040
Simulating trial. . .
espilon = 0.2671; alpha = 0.0040
Simulating trial. . .
espilon = 0.2671; alpha = 0.0040
Simulating trial. . .
espilon = 0.2671; alpha = 0.0040
Simulating trial. . .
espilon = 0.2671; alpha = 0.0040
Simulating trial. . .
espilon = 0.2671; alpha = 0.0040
Simulating trial. . .
espilon = 0.2671; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.21)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.48)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent followed the waypoint right. (rewarded 1.03)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.55)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.82)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.66)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', None)
Agent drove left instead of forward. (rewarded 1.03)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.20)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.85)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent followed the waypoint right. (rewarded 1.32)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.06)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', None)
Agent followed the waypoint right. (rewarded 1.96)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove left instead of forward. (rewarded 0.32)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 1.72)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'right')
Agent followed the waypoint right. (rewarded 2.14)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 1.53)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded -0.24)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.67)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.12)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 0.65)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 0.17)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.33)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.86)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.19)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent followed the waypoint right. (rewarded 0.93)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.91)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent drove forward instead of right. (rewarded 0.98)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.98)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.76)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -10.10)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 332
\-------------------------
Simulating trial. . .
espilon = 0.2661; alpha = 0.0040
Simulating trial. . .
espilon = 0.2661; alpha = 0.0040
Simulating trial. . .
espilon = 0.2661; alpha = 0.0040
Simulating trial. . .
espilon = 0.2661; alpha = 0.0040
Simulating trial. . .
espilon = 0.2661; alpha = 0.0040
Simulating trial. . .
espilon = 0.2661; alpha = 0.0040
Simulating trial. . .
espilon = 0.2661; alpha = 0.0040
Simulating trial. . .
espilon = 0.2661; alpha = 0.0040
Simulating trial. . .
espilon = 0.2661; alpha = 0.0040
Simulating trial. . .
espilon = 0.2661; alpha = 0.0040
Simulating trial. . .
espilon = 0.2661; alpha = 0.0040
Simulating trial. . .
espilon = 0.2661; alpha = 0.0040
Simulating trial. . .
espilon = 0.2661; alpha = 0.0040
Simulating trial. . .
espilon = 0.2661; alpha = 0.0040
Simulating trial. . .
espilon = 0.2661; alpha = 0.0040
Simulating trial. . .
espilon = 0.2661; alpha = 0.0040
Simulating trial. . .
espilon = 0.2661; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 2.69)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.64)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.93)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.41)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.95)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.51)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.84)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.87)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.48)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.83)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.06)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove left instead of forward. (rewarded 0.10)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.72)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'left')
Agent drove left instead of right. (rewarded 1.38)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.41)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.01)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.66)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 2.28)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent drove forward instead of right. (rewarded 0.24)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'right', None)
Agent properly idled at a red light. (rewarded 0.74)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 0.71)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent followed the waypoint right. (rewarded 0.60)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.44)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'left')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.89)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.89)
17% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 333
\-------------------------
Simulating trial. . .
espilon = 0.2650; alpha = 0.0040
Simulating trial. . .
espilon = 0.2650; alpha = 0.0040
Simulating trial. . .
espilon = 0.2650; alpha = 0.0040
Simulating trial. . .
espilon = 0.2650; alpha = 0.0040
Simulating trial. . .
espilon = 0.2650; alpha = 0.0040
Simulating trial. . .
espilon = 0.2650; alpha = 0.0040
Simulating trial. . .
espilon = 0.2650; alpha = 0.0040
Simulating trial. . .
espilon = 0.2650; alpha = 0.0040
Simulating trial. . .
espilon = 0.2650; alpha = 0.0040
Simulating trial. . .
espilon = 0.2650; alpha = 0.0040
Simulating trial. . .
espilon = 0.2650; alpha = 0.0040
Simulating trial. . .
espilon = 0.2650; alpha = 0.0040
Simulating trial. . .
espilon = 0.2650; alpha = 0.0040
Simulating trial. . .
espilon = 0.2650; alpha = 0.0040
Simulating trial. . .
espilon = 0.2650; alpha = 0.0040
Simulating trial. . .
espilon = 0.2650; alpha = 0.0040
Simulating trial. . .
espilon = 0.2650; alpha = 0.0040
Simulating trial. . .
espilon = 0.2650; alpha = 0.0040
Simulating trial. . .
espilon = 0.2650; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.75)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.35)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.32)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', 'forward')
Agent followed the waypoint right. (rewarded 1.98)
80% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 334
\-------------------------
Simulating trial. . .
espilon = 0.2639; alpha = 0.0040
Simulating trial. . .
espilon = 0.2639; alpha = 0.0040
Simulating trial. . .
espilon = 0.2639; alpha = 0.0040
Simulating trial. . .
espilon = 0.2639; alpha = 0.0040
Simulating trial. . .
espilon = 0.2639; alpha = 0.0040
Simulating trial. . .
espilon = 0.2639; alpha = 0.0040
Simulating trial. . .
espilon = 0.2639; alpha = 0.0040
Simulating trial. . .
espilon = 0.2639; alpha = 0.0040
Simulating trial. . .
espilon = 0.2639; alpha = 0.0040
Simulating trial. . .
espilon = 0.2639; alpha = 0.0040
Simulating trial. . .
espilon = 0.2639; alpha = 0.0040
Simulating trial. . .
espilon = 0.2639; alpha = 0.0040
Simulating trial. . .
espilon = 0.2639; alpha = 0.0040
Simulating trial. . .
espilon = 0.2639; alpha = 0.0040
Simulating trial. . .
espilon = 0.2639; alpha = 0.0040
Simulating trial. . .
espilon = 0.2639; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove forward instead of left. (rewarded 1.35)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent followed the waypoint left. (rewarded 1.53)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.61)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent followed the waypoint right. (rewarded 1.00)
80% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 335
\-------------------------
Simulating trial. . .
espilon = 0.2629; alpha = 0.0040
Simulating trial. . .
espilon = 0.2629; alpha = 0.0040
Simulating trial. . .
espilon = 0.2629; alpha = 0.0040
Simulating trial. . .
espilon = 0.2629; alpha = 0.0040
Simulating trial. . .
espilon = 0.2629; alpha = 0.0040
Simulating trial. . .
espilon = 0.2629; alpha = 0.0040
Simulating trial. . .
espilon = 0.2629; alpha = 0.0040
Simulating trial. . .
espilon = 0.2629; alpha = 0.0040
Simulating trial. . .
espilon = 0.2629; alpha = 0.0040
Simulating trial. . .
espilon = 0.2629; alpha = 0.0040
Simulating trial. . .
espilon = 0.2629; alpha = 0.0040
Simulating trial. . .
espilon = 0.2629; alpha = 0.0040
Simulating trial. . .
espilon = 0.2629; alpha = 0.0040
Simulating trial. . .
espilon = 0.2629; alpha = 0.0040
Simulating trial. . .
espilon = 0.2629; alpha = 0.0040
Simulating trial. . .
espilon = 0.2629; alpha = 0.0040
Simulating trial. . .
espilon = 0.2629; alpha = 0.0040
Simulating trial. . .
espilon = 0.2629; alpha = 0.0040
Simulating trial. . .
espilon = 0.2629; alpha = 0.0040
Simulating trial. . .
espilon = 0.2629; alpha = 0.0040
Simulating trial. . .
espilon = 0.2629; alpha = 0.0040
Simulating trial. . .
espilon = 0.2629; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent followed the waypoint forward. (rewarded 1.21)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.77)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 1.41)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent drove right instead of left. (rewarded 0.53)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent drove forward instead of right. (rewarded -0.04)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.80)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.78)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent drove forward instead of right. (rewarded 0.04)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'right')
Agent followed the waypoint forward. (rewarded 1.84)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'left', None)
Agent followed the waypoint forward. (rewarded 1.54)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.13)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.84)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 1.59)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.00)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent drove right instead of forward. (rewarded 0.01)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent drove right instead of left. (rewarded -0.07)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.21)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent followed the waypoint right. (rewarded 0.75)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.76)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.19)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.64)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.60)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.04)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', 'left')
Agent drove right instead of forward. (rewarded -0.60)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 0.25)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 336
\-------------------------
Simulating trial. . .
espilon = 0.2618; alpha = 0.0040
Simulating trial. . .
espilon = 0.2618; alpha = 0.0040
Simulating trial. . .
espilon = 0.2618; alpha = 0.0040
Simulating trial. . .
espilon = 0.2618; alpha = 0.0040
Simulating trial. . .
espilon = 0.2618; alpha = 0.0040
Simulating trial. . .
espilon = 0.2618; alpha = 0.0040
Simulating trial. . .
espilon = 0.2618; alpha = 0.0040
Simulating trial. . .
espilon = 0.2618; alpha = 0.0040
Simulating trial. . .
espilon = 0.2618; alpha = 0.0040
Simulating trial. . .
espilon = 0.2618; alpha = 0.0040
Simulating trial. . .
espilon = 0.2618; alpha = 0.0040
Simulating trial. . .
espilon = 0.2618; alpha = 0.0040
Simulating trial. . .
espilon = 0.2618; alpha = 0.0040
Simulating trial. . .
espilon = 0.2618; alpha = 0.0040
Simulating trial. . .
espilon = 0.2618; alpha = 0.0040
Simulating trial. . .
espilon = 0.2618; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'left')
Agent drove forward instead of right. (rewarded 1.56)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent followed the waypoint right. (rewarded 2.05)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', 'left')
Agent followed the waypoint right. (rewarded 1.52)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.64)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.99)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.33)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.27)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -10.75)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.70)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.32)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.74)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 1.63)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 0.93)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.85)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.06)
25% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 337
\-------------------------
Simulating trial. . .
espilon = 0.2608; alpha = 0.0040
Simulating trial. . .
espilon = 0.2608; alpha = 0.0040
Simulating trial. . .
espilon = 0.2608; alpha = 0.0040
Simulating trial. . .
espilon = 0.2608; alpha = 0.0040
Simulating trial. . .
espilon = 0.2608; alpha = 0.0040
Simulating trial. . .
espilon = 0.2608; alpha = 0.0040
Simulating trial. . .
espilon = 0.2608; alpha = 0.0040
Simulating trial. . .
espilon = 0.2608; alpha = 0.0040
Simulating trial. . .
espilon = 0.2608; alpha = 0.0040
Simulating trial. . .
espilon = 0.2608; alpha = 0.0040
Simulating trial. . .
espilon = 0.2608; alpha = 0.0040
Simulating trial. . .
espilon = 0.2608; alpha = 0.0040
Simulating trial. . .
espilon = 0.2608; alpha = 0.0040
Simulating trial. . .
espilon = 0.2608; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.89)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.71)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 1.48)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.71)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.52)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.99)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent followed the waypoint forward. (rewarded 2.42)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.79)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.32)
70% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 338
\-------------------------
Simulating trial. . .
espilon = 0.2598; alpha = 0.0040
Simulating trial. . .
espilon = 0.2598; alpha = 0.0040
Simulating trial. . .
espilon = 0.2598; alpha = 0.0040
Simulating trial. . .
espilon = 0.2598; alpha = 0.0040
Simulating trial. . .
espilon = 0.2598; alpha = 0.0040
Simulating trial. . .
espilon = 0.2598; alpha = 0.0040
Simulating trial. . .
espilon = 0.2598; alpha = 0.0040
Simulating trial. . .
espilon = 0.2598; alpha = 0.0040
Simulating trial. . .
espilon = 0.2598; alpha = 0.0040
Simulating trial. . .
espilon = 0.2598; alpha = 0.0040
Simulating trial. . .
espilon = 0.2598; alpha = 0.0040
Simulating trial. . .
espilon = 0.2598; alpha = 0.0040
Simulating trial. . .
espilon = 0.2598; alpha = 0.0040
Simulating trial. . .
espilon = 0.2598; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 2.44)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.71)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.96)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.17)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.51)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.42)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 1.28)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 0.31)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.46)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 2.75)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'right')
Agent properly idled at a red light. (rewarded 1.52)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.12)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.69)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.90)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.54)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove forward instead of left. (rewarded 1.00)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.41)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent drove forward instead of left. (rewarded 1.31)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'right')
Agent properly idled at a red light. (rewarded 1.49)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'right')
Agent drove right instead of left. (rewarded 1.05)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 339
\-------------------------
Simulating trial. . .
espilon = 0.2587; alpha = 0.0040
Simulating trial. . .
espilon = 0.2587; alpha = 0.0040
Simulating trial. . .
espilon = 0.2587; alpha = 0.0040
Simulating trial. . .
espilon = 0.2587; alpha = 0.0040
Simulating trial. . .
espilon = 0.2587; alpha = 0.0040
Simulating trial. . .
espilon = 0.2587; alpha = 0.0040
Simulating trial. . .
espilon = 0.2587; alpha = 0.0040
Simulating trial. . .
espilon = 0.2587; alpha = 0.0040
Simulating trial. . .
espilon = 0.2587; alpha = 0.0040
Simulating trial. . .
espilon = 0.2587; alpha = 0.0040
Simulating trial. . .
espilon = 0.2587; alpha = 0.0040
Simulating trial. . .
espilon = 0.2587; alpha = 0.0040
Simulating trial. . .
espilon = 0.2587; alpha = 0.0040
Simulating trial. . .
espilon = 0.2587; alpha = 0.0040
Simulating trial. . .
espilon = 0.2587; alpha = 0.0040
Simulating trial. . .
espilon = 0.2587; alpha = 0.0040
Simulating trial. . .
espilon = 0.2587; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.24)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent followed the waypoint right. (rewarded 2.26)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.68)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.27)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 1.17)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.48)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 0.25)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.26)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent drove right instead of left. (rewarded -0.12)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', None)
Agent followed the waypoint right. (rewarded 1.37)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.27)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.60)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.20)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.12)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'left')
Agent drove left instead of right. (rewarded 1.50)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.07)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 0.50)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.92)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.54)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.87)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 340
\-------------------------
Simulating trial. . .
espilon = 0.2577; alpha = 0.0040
Simulating trial. . .
espilon = 0.2577; alpha = 0.0040
Simulating trial. . .
espilon = 0.2577; alpha = 0.0040
Simulating trial. . .
espilon = 0.2577; alpha = 0.0040
Simulating trial. . .
espilon = 0.2577; alpha = 0.0040
Simulating trial. . .
espilon = 0.2577; alpha = 0.0040
Simulating trial. . .
espilon = 0.2577; alpha = 0.0040
Simulating trial. . .
espilon = 0.2577; alpha = 0.0040
Simulating trial. . .
espilon = 0.2577; alpha = 0.0040
Simulating trial. . .
espilon = 0.2577; alpha = 0.0040
Simulating trial. . .
espilon = 0.2577; alpha = 0.0040
Simulating trial. . .
espilon = 0.2577; alpha = 0.0040
Simulating trial. . .
espilon = 0.2577; alpha = 0.0040
Simulating trial. . .
espilon = 0.2577; alpha = 0.0040
Simulating trial. . .
espilon = 0.2577; alpha = 0.0040
Simulating trial. . .
espilon = 0.2577; alpha = 0.0040
Simulating trial. . .
espilon = 0.2577; alpha = 0.0040
Simulating trial. . .
espilon = 0.2577; alpha = 0.0040
Simulating trial. . .
espilon = 0.2577; alpha = 0.0040
Simulating trial. . .
espilon = 0.2577; alpha = 0.0040
Simulating trial. . .
espilon = 0.2577; alpha = 0.0040
Simulating trial. . .
espilon = 0.2577; alpha = 0.0040
Simulating trial. . .
espilon = 0.2577; alpha = 0.0040
Simulating trial. . .
espilon = 0.2577; alpha = 0.0040
Simulating trial. . .
espilon = 0.2577; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.50)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.83)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.96)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.06)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.01)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.67)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.94)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.41)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.54)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove right instead of forward. (rewarded 0.85)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.92)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.56)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.01)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 0.27)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent followed the waypoint left. (rewarded 1.33)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'left')
Agent attempted driving left through a red light. (rewarded -9.61)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove forward instead of right. (rewarded 0.45)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 0.70)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.37)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.00)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.63)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'forward')
Agent drove right instead of left. (rewarded 0.99)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.32)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.59)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.88)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 341
\-------------------------
Simulating trial. . .
espilon = 0.2567; alpha = 0.0040
Simulating trial. . .
espilon = 0.2567; alpha = 0.0040
Simulating trial. . .
espilon = 0.2567; alpha = 0.0040
Simulating trial. . .
espilon = 0.2567; alpha = 0.0040
Simulating trial. . .
espilon = 0.2567; alpha = 0.0040
Simulating trial. . .
espilon = 0.2567; alpha = 0.0040
Simulating trial. . .
espilon = 0.2567; alpha = 0.0040
Simulating trial. . .
espilon = 0.2567; alpha = 0.0040
Simulating trial. . .
espilon = 0.2567; alpha = 0.0040
Simulating trial. . .
espilon = 0.2567; alpha = 0.0040
Simulating trial. . .
espilon = 0.2567; alpha = 0.0040
Simulating trial. . .
espilon = 0.2567; alpha = 0.0040
Simulating trial. . .
espilon = 0.2567; alpha = 0.0040
Simulating trial. . .
espilon = 0.2567; alpha = 0.0040
Simulating trial. . .
espilon = 0.2567; alpha = 0.0040
Simulating trial. . .
espilon = 0.2567; alpha = 0.0040
Simulating trial. . .
espilon = 0.2567; alpha = 0.0040
Simulating trial. . .
espilon = 0.2567; alpha = 0.0040
Simulating trial. . .
espilon = 0.2567; alpha = 0.0040
Simulating trial. . .
espilon = 0.2567; alpha = 0.0040
Simulating trial. . .
espilon = 0.2567; alpha = 0.0040
Simulating trial. . .
espilon = 0.2567; alpha = 0.0040
Simulating trial. . .
espilon = 0.2567; alpha = 0.0040
Simulating trial. . .
espilon = 0.2567; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.76)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 1.55)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.57)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.03)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.03)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.70)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.16)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 0.93)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.93)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.35)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.98)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.04)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 0.85)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.99)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.69)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 0.94)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.35)
15% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 342
\-------------------------
Simulating trial. . .
espilon = 0.2556; alpha = 0.0040
Simulating trial. . .
espilon = 0.2556; alpha = 0.0040
Simulating trial. . .
espilon = 0.2556; alpha = 0.0040
Simulating trial. . .
espilon = 0.2556; alpha = 0.0040
Simulating trial. . .
espilon = 0.2556; alpha = 0.0040
Simulating trial. . .
espilon = 0.2556; alpha = 0.0040
Simulating trial. . .
espilon = 0.2556; alpha = 0.0040
Simulating trial. . .
espilon = 0.2556; alpha = 0.0040
Simulating trial. . .
espilon = 0.2556; alpha = 0.0040
Simulating trial. . .
espilon = 0.2556; alpha = 0.0040
Simulating trial. . .
espilon = 0.2556; alpha = 0.0040
Simulating trial. . .
espilon = 0.2556; alpha = 0.0040
Simulating trial. . .
espilon = 0.2556; alpha = 0.0040
Simulating trial. . .
espilon = 0.2556; alpha = 0.0040
Simulating trial. . .
espilon = 0.2556; alpha = 0.0040
Simulating trial. . .
espilon = 0.2556; alpha = 0.0040
Simulating trial. . .
espilon = 0.2556; alpha = 0.0040
Simulating trial. . .
espilon = 0.2556; alpha = 0.0040
Simulating trial. . .
espilon = 0.2556; alpha = 0.0040
Simulating trial. . .
espilon = 0.2556; alpha = 0.0040
Simulating trial. . .
espilon = 0.2556; alpha = 0.0040
Simulating trial. . .
espilon = 0.2556; alpha = 0.0040
Simulating trial. . .
espilon = 0.2556; alpha = 0.0040
Simulating trial. . .
espilon = 0.2556; alpha = 0.0040
Simulating trial. . .
espilon = 0.2556; alpha = 0.0040
Simulating trial. . .
espilon = 0.2556; alpha = 0.0040
Simulating trial. . .
espilon = 0.2556; alpha = 0.0040
Simulating trial. . .
espilon = 0.2556; alpha = 0.0040
Simulating trial. . .
espilon = 0.2556; alpha = 0.0040
Simulating trial. . .
espilon = 0.2556; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.07)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.83)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.96)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.04)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.99)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.41)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', None)
Agent attempted driving left through a red light. (rewarded -10.50)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.24)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 1.54)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.27)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.32)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.14)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.84)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.71)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.67)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'right', None)
Agent drove right instead of forward. (rewarded 0.54)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.49)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.25)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.90)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.51)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.12)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 0.99)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.37)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'right')
Agent drove forward instead of left. (rewarded 0.26)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.51)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.03)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 0.95)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 0.86)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.44)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.82)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 343
\-------------------------
Simulating trial. . .
espilon = 0.2546; alpha = 0.0040
Simulating trial. . .
espilon = 0.2546; alpha = 0.0040
Simulating trial. . .
espilon = 0.2546; alpha = 0.0040
Simulating trial. . .
espilon = 0.2546; alpha = 0.0040
Simulating trial. . .
espilon = 0.2546; alpha = 0.0040
Simulating trial. . .
espilon = 0.2546; alpha = 0.0040
Simulating trial. . .
espilon = 0.2546; alpha = 0.0040
Simulating trial. . .
espilon = 0.2546; alpha = 0.0040
Simulating trial. . .
espilon = 0.2546; alpha = 0.0040
Simulating trial. . .
espilon = 0.2546; alpha = 0.0040
Simulating trial. . .
espilon = 0.2546; alpha = 0.0040
Simulating trial. . .
espilon = 0.2546; alpha = 0.0040
Simulating trial. . .
espilon = 0.2546; alpha = 0.0040
Simulating trial. . .
espilon = 0.2546; alpha = 0.0040
Simulating trial. . .
espilon = 0.2546; alpha = 0.0040
Simulating trial. . .
espilon = 0.2546; alpha = 0.0040
Simulating trial. . .
espilon = 0.2546; alpha = 0.0040
Simulating trial. . .
espilon = 0.2546; alpha = 0.0040
Simulating trial. . .
espilon = 0.2546; alpha = 0.0040
Simulating trial. . .
espilon = 0.2546; alpha = 0.0040
Simulating trial. . .
espilon = 0.2546; alpha = 0.0040
Simulating trial. . .
espilon = 0.2546; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.12)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 1.31)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.64)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.48)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.69)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.17)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.48)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.56)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.29)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.08)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.15)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.09)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 344
\-------------------------
Simulating trial. . .
espilon = 0.2536; alpha = 0.0040
Simulating trial. . .
espilon = 0.2536; alpha = 0.0040
Simulating trial. . .
espilon = 0.2536; alpha = 0.0040
Simulating trial. . .
espilon = 0.2536; alpha = 0.0040
Simulating trial. . .
espilon = 0.2536; alpha = 0.0040
Simulating trial. . .
espilon = 0.2536; alpha = 0.0040
Simulating trial. . .
espilon = 0.2536; alpha = 0.0040
Simulating trial. . .
espilon = 0.2536; alpha = 0.0040
Simulating trial. . .
espilon = 0.2536; alpha = 0.0040
Simulating trial. . .
espilon = 0.2536; alpha = 0.0040
Simulating trial. . .
espilon = 0.2536; alpha = 0.0040
Simulating trial. . .
espilon = 0.2536; alpha = 0.0040
Simulating trial. . .
espilon = 0.2536; alpha = 0.0040
Simulating trial. . .
espilon = 0.2536; alpha = 0.0040
Simulating trial. . .
espilon = 0.2536; alpha = 0.0040
Simulating trial. . .
espilon = 0.2536; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.70)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.92)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.25)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.13)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 2.35)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 1.22)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', 'left')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.18)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.39)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 345
\-------------------------
Simulating trial. . .
espilon = 0.2526; alpha = 0.0040
Simulating trial. . .
espilon = 0.2526; alpha = 0.0040
Simulating trial. . .
espilon = 0.2526; alpha = 0.0040
Simulating trial. . .
espilon = 0.2526; alpha = 0.0040
Simulating trial. . .
espilon = 0.2526; alpha = 0.0040
Simulating trial. . .
espilon = 0.2526; alpha = 0.0040
Simulating trial. . .
espilon = 0.2526; alpha = 0.0040
Simulating trial. . .
espilon = 0.2526; alpha = 0.0040
Simulating trial. . .
espilon = 0.2526; alpha = 0.0040
Simulating trial. . .
espilon = 0.2526; alpha = 0.0040
Simulating trial. . .
espilon = 0.2526; alpha = 0.0040
Simulating trial. . .
espilon = 0.2526; alpha = 0.0040
Simulating trial. . .
espilon = 0.2526; alpha = 0.0040
Simulating trial. . .
espilon = 0.2526; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 2.79)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'forward', None)
Agent drove left instead of right. (rewarded 0.93)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.43)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.61)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 2.67)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.94)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.05)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.57)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 346
\-------------------------
Simulating trial. . .
espilon = 0.2516; alpha = 0.0040
Simulating trial. . .
espilon = 0.2516; alpha = 0.0040
Simulating trial. . .
espilon = 0.2516; alpha = 0.0040
Simulating trial. . .
espilon = 0.2516; alpha = 0.0040
Simulating trial. . .
espilon = 0.2516; alpha = 0.0040
Simulating trial. . .
espilon = 0.2516; alpha = 0.0040
Simulating trial. . .
espilon = 0.2516; alpha = 0.0040
Simulating trial. . .
espilon = 0.2516; alpha = 0.0040
Simulating trial. . .
espilon = 0.2516; alpha = 0.0040
Simulating trial. . .
espilon = 0.2516; alpha = 0.0040
Simulating trial. . .
espilon = 0.2516; alpha = 0.0040
Simulating trial. . .
espilon = 0.2516; alpha = 0.0040
Simulating trial. . .
espilon = 0.2516; alpha = 0.0040
Simulating trial. . .
espilon = 0.2516; alpha = 0.0040
Simulating trial. . .
espilon = 0.2516; alpha = 0.0040
Simulating trial. . .
espilon = 0.2516; alpha = 0.0040
Simulating trial. . .
espilon = 0.2516; alpha = 0.0040
Simulating trial. . .
espilon = 0.2516; alpha = 0.0040
Simulating trial. . .
espilon = 0.2516; alpha = 0.0040
Simulating trial. . .
espilon = 0.2516; alpha = 0.0040
Simulating trial. . .
espilon = 0.2516; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent drove forward instead of right. (rewarded 1.34)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.34)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.75)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.17)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'forward')
Agent followed the waypoint right. (rewarded 2.29)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.75)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.59)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.61)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'right')
Agent properly idled at a red light. (rewarded 2.10)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.85)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'left')
Agent drove right instead of left. (rewarded 0.06)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 2.15)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.41)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.40)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.58)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.11)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.38)
15% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 347
\-------------------------
Simulating trial. . .
espilon = 0.2506; alpha = 0.0040
Simulating trial. . .
espilon = 0.2506; alpha = 0.0040
Simulating trial. . .
espilon = 0.2506; alpha = 0.0040
Simulating trial. . .
espilon = 0.2506; alpha = 0.0040
Simulating trial. . .
espilon = 0.2506; alpha = 0.0040
Simulating trial. . .
espilon = 0.2506; alpha = 0.0040
Simulating trial. . .
espilon = 0.2506; alpha = 0.0040
Simulating trial. . .
espilon = 0.2506; alpha = 0.0040
Simulating trial. . .
espilon = 0.2506; alpha = 0.0040
Simulating trial. . .
espilon = 0.2506; alpha = 0.0040
Simulating trial. . .
espilon = 0.2506; alpha = 0.0040
Simulating trial. . .
espilon = 0.2506; alpha = 0.0040
Simulating trial. . .
espilon = 0.2506; alpha = 0.0040
Simulating trial. . .
espilon = 0.2506; alpha = 0.0040
Simulating trial. . .
espilon = 0.2506; alpha = 0.0040
Simulating trial. . .
espilon = 0.2506; alpha = 0.0040
Simulating trial. . .
espilon = 0.2506; alpha = 0.0040
Simulating trial. . .
espilon = 0.2506; alpha = 0.0040
Simulating trial. . .
espilon = 0.2506; alpha = 0.0040
Simulating trial. . .
espilon = 0.2506; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'right', 'left')
Agent drove forward instead of right. (rewarded 0.19)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 2.98)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.70)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded 1.91)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.47)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.26)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.70)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 0.94)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.66)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent attempted driving forward through a red light. (rewarded -10.43)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent attempted driving forward through a red light. (rewarded -10.34)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.65)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent drove right instead of forward. (rewarded 1.44)
63% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.62)
60% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.47)
57% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.39)
54% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent followed the waypoint forward. (rewarded 2.06)
51% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.65)
49% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.30)
46% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 0.75)
43% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.37)
40% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.20)
37% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 0.75)
34% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.06)
31% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 1.40)
29% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.41)
26% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 348
\-------------------------
Simulating trial. . .
espilon = 0.2496; alpha = 0.0040
Simulating trial. . .
espilon = 0.2496; alpha = 0.0040
Simulating trial. . .
espilon = 0.2496; alpha = 0.0040
Simulating trial. . .
espilon = 0.2496; alpha = 0.0040
Simulating trial. . .
espilon = 0.2496; alpha = 0.0040
Simulating trial. . .
espilon = 0.2496; alpha = 0.0040
Simulating trial. . .
espilon = 0.2496; alpha = 0.0040
Simulating trial. . .
espilon = 0.2496; alpha = 0.0040
Simulating trial. . .
espilon = 0.2496; alpha = 0.0040
Simulating trial. . .
espilon = 0.2496; alpha = 0.0040
Simulating trial. . .
espilon = 0.2496; alpha = 0.0040
Simulating trial. . .
espilon = 0.2496; alpha = 0.0040
Simulating trial. . .
espilon = 0.2496; alpha = 0.0040
Simulating trial. . .
espilon = 0.2496; alpha = 0.0040
Simulating trial. . .
espilon = 0.2496; alpha = 0.0040
Simulating trial. . .
espilon = 0.2496; alpha = 0.0040
Simulating trial. . .
espilon = 0.2496; alpha = 0.0040
Simulating trial. . .
espilon = 0.2496; alpha = 0.0040
Simulating trial. . .
espilon = 0.2496; alpha = 0.0040
Simulating trial. . .
espilon = 0.2496; alpha = 0.0040
Simulating trial. . .
espilon = 0.2496; alpha = 0.0040
Simulating trial. . .
espilon = 0.2496; alpha = 0.0040
Simulating trial. . .
espilon = 0.2496; alpha = 0.0040
Simulating trial. . .
espilon = 0.2496; alpha = 0.0040
Simulating trial. . .
espilon = 0.2496; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.16)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 2.90)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.84)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent drove right instead of left. (rewarded 0.03)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.80)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.54)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 1.70)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 1.52)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 2.34)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.48)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 349
\-------------------------
Simulating trial. . .
espilon = 0.2486; alpha = 0.0040
Simulating trial. . .
espilon = 0.2486; alpha = 0.0040
Simulating trial. . .
espilon = 0.2486; alpha = 0.0040
Simulating trial. . .
espilon = 0.2486; alpha = 0.0040
Simulating trial. . .
espilon = 0.2486; alpha = 0.0040
Simulating trial. . .
espilon = 0.2486; alpha = 0.0040
Simulating trial. . .
espilon = 0.2486; alpha = 0.0040
Simulating trial. . .
espilon = 0.2486; alpha = 0.0040
Simulating trial. . .
espilon = 0.2486; alpha = 0.0040
Simulating trial. . .
espilon = 0.2486; alpha = 0.0040
Simulating trial. . .
espilon = 0.2486; alpha = 0.0040
Simulating trial. . .
espilon = 0.2486; alpha = 0.0040
Simulating trial. . .
espilon = 0.2486; alpha = 0.0040
Simulating trial. . .
espilon = 0.2486; alpha = 0.0040
Simulating trial. . .
espilon = 0.2486; alpha = 0.0040
Simulating trial. . .
espilon = 0.2486; alpha = 0.0040
Simulating trial. . .
espilon = 0.2486; alpha = 0.0040
Simulating trial. . .
espilon = 0.2486; alpha = 0.0040
Simulating trial. . .
espilon = 0.2486; alpha = 0.0040
Simulating trial. . .
espilon = 0.2486; alpha = 0.0040
Simulating trial. . .
espilon = 0.2486; alpha = 0.0040
Simulating trial. . .
espilon = 0.2486; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.37)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.66)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.29)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.43)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent drove right instead of left. (rewarded 1.52)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.45)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.84)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -5.66)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'left')
Agent followed the waypoint right. (rewarded 2.47)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.43)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove right instead of forward. (rewarded 0.77)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded -0.12)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.77)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 0.64)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove left instead of right. (rewarded -0.36)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.02)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.33)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove right instead of forward. (rewarded 1.07)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent drove right instead of left. (rewarded -0.27)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 1.70)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 350
\-------------------------
Simulating trial. . .
espilon = 0.2476; alpha = 0.0040
Simulating trial. . .
espilon = 0.2476; alpha = 0.0040
Simulating trial. . .
espilon = 0.2476; alpha = 0.0040
Simulating trial. . .
espilon = 0.2476; alpha = 0.0040
Simulating trial. . .
espilon = 0.2476; alpha = 0.0040
Simulating trial. . .
espilon = 0.2476; alpha = 0.0040
Simulating trial. . .
espilon = 0.2476; alpha = 0.0040
Simulating trial. . .
espilon = 0.2476; alpha = 0.0040
Simulating trial. . .
espilon = 0.2476; alpha = 0.0040
Simulating trial. . .
espilon = 0.2476; alpha = 0.0040
Simulating trial. . .
espilon = 0.2476; alpha = 0.0040
Simulating trial. . .
espilon = 0.2476; alpha = 0.0040
Simulating trial. . .
espilon = 0.2476; alpha = 0.0040
Simulating trial. . .
espilon = 0.2476; alpha = 0.0040
Simulating trial. . .
espilon = 0.2476; alpha = 0.0040
Simulating trial. . .
espilon = 0.2476; alpha = 0.0040
Simulating trial. . .
espilon = 0.2476; alpha = 0.0040
Simulating trial. . .
espilon = 0.2476; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent attempted driving left through a red light. (rewarded -9.81)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.10)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.68)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.19)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.55)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.66)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent followed the waypoint left. (rewarded 2.80)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent attempted driving forward through a red light. (rewarded -10.87)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.34)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.22)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.58)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.29)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.62)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.04)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.13)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.69)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.44)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', 'right')
Agent attempted driving forward through a red light. (rewarded -9.40)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.30)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.49)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.51)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.63)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent drove right instead of left. (rewarded 0.46)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'left')
Agent followed the waypoint right. (rewarded 0.88)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.47)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove forward instead of right. (rewarded 1.29)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.99)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.73)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.92)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.05)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 351
\-------------------------
Simulating trial. . .
espilon = 0.2466; alpha = 0.0040
Simulating trial. . .
espilon = 0.2466; alpha = 0.0040
Simulating trial. . .
espilon = 0.2466; alpha = 0.0040
Simulating trial. . .
espilon = 0.2466; alpha = 0.0040
Simulating trial. . .
espilon = 0.2466; alpha = 0.0040
Simulating trial. . .
espilon = 0.2466; alpha = 0.0040
Simulating trial. . .
espilon = 0.2466; alpha = 0.0040
Simulating trial. . .
espilon = 0.2466; alpha = 0.0040
Simulating trial. . .
espilon = 0.2466; alpha = 0.0040
Simulating trial. . .
espilon = 0.2466; alpha = 0.0040
Simulating trial. . .
espilon = 0.2466; alpha = 0.0040
Simulating trial. . .
espilon = 0.2466; alpha = 0.0040
Simulating trial. . .
espilon = 0.2466; alpha = 0.0040
Simulating trial. . .
espilon = 0.2466; alpha = 0.0040
Simulating trial. . .
espilon = 0.2466; alpha = 0.0040
Simulating trial. . .
espilon = 0.2466; alpha = 0.0040
Simulating trial. . .
espilon = 0.2466; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.35)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.18)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.59)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.74)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 1.94)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.55)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 1.04)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.97)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.76)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 0.87)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'right')
Agent drove right instead of left. (rewarded 1.13)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.81)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.64)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'left')
Agent followed the waypoint right. (rewarded 2.00)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.91)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.69)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.27)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.43)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', 'forward')
Agent followed the waypoint forward. (rewarded 1.11)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 0.88)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.67)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 0.65)
12% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 352
\-------------------------
Simulating trial. . .
espilon = 0.2456; alpha = 0.0040
Simulating trial. . .
espilon = 0.2456; alpha = 0.0040
Simulating trial. . .
espilon = 0.2456; alpha = 0.0040
Simulating trial. . .
espilon = 0.2456; alpha = 0.0040
Simulating trial. . .
espilon = 0.2456; alpha = 0.0040
Simulating trial. . .
espilon = 0.2456; alpha = 0.0040
Simulating trial. . .
espilon = 0.2456; alpha = 0.0040
Simulating trial. . .
espilon = 0.2456; alpha = 0.0040
Simulating trial. . .
espilon = 0.2456; alpha = 0.0040
Simulating trial. . .
espilon = 0.2456; alpha = 0.0040
Simulating trial. . .
espilon = 0.2456; alpha = 0.0040
Simulating trial. . .
espilon = 0.2456; alpha = 0.0040
Simulating trial. . .
espilon = 0.2456; alpha = 0.0040
Simulating trial. . .
espilon = 0.2456; alpha = 0.0040
Simulating trial. . .
espilon = 0.2456; alpha = 0.0040
Simulating trial. . .
espilon = 0.2456; alpha = 0.0040
Simulating trial. . .
espilon = 0.2456; alpha = 0.0040
Simulating trial. . .
espilon = 0.2456; alpha = 0.0040
Simulating trial. . .
espilon = 0.2456; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.67)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.81)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 1.94)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.34)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 2.15)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.35)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.72)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.25)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.48)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 1.09)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.03)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 2.38)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.97)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.12)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.93)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.95)
20% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 353
\-------------------------
Simulating trial. . .
espilon = 0.2446; alpha = 0.0040
Simulating trial. . .
espilon = 0.2446; alpha = 0.0040
Simulating trial. . .
espilon = 0.2446; alpha = 0.0040
Simulating trial. . .
espilon = 0.2446; alpha = 0.0040
Simulating trial. . .
espilon = 0.2446; alpha = 0.0040
Simulating trial. . .
espilon = 0.2446; alpha = 0.0040
Simulating trial. . .
espilon = 0.2446; alpha = 0.0040
Simulating trial. . .
espilon = 0.2446; alpha = 0.0040
Simulating trial. . .
espilon = 0.2446; alpha = 0.0040
Simulating trial. . .
espilon = 0.2446; alpha = 0.0040
Simulating trial. . .
espilon = 0.2446; alpha = 0.0040
Simulating trial. . .
espilon = 0.2446; alpha = 0.0040
Simulating trial. . .
espilon = 0.2446; alpha = 0.0040
Simulating trial. . .
espilon = 0.2446; alpha = 0.0040
Simulating trial. . .
espilon = 0.2446; alpha = 0.0040
Simulating trial. . .
espilon = 0.2446; alpha = 0.0040
Simulating trial. . .
espilon = 0.2446; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', 'right')
Agent drove forward instead of right. (rewarded 0.85)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'forward')
Agent attempted driving forward through a red light. (rewarded -10.07)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.19)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.66)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 1.59)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.63)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.06)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent attempted driving left through a red light. (rewarded -10.21)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.17)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 2.73)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.37)
45% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 354
\-------------------------
Simulating trial. . .
espilon = 0.2437; alpha = 0.0040
Simulating trial. . .
espilon = 0.2437; alpha = 0.0040
Simulating trial. . .
espilon = 0.2437; alpha = 0.0040
Simulating trial. . .
espilon = 0.2437; alpha = 0.0040
Simulating trial. . .
espilon = 0.2437; alpha = 0.0040
Simulating trial. . .
espilon = 0.2437; alpha = 0.0040
Simulating trial. . .
espilon = 0.2437; alpha = 0.0040
Simulating trial. . .
espilon = 0.2437; alpha = 0.0040
Simulating trial. . .
espilon = 0.2437; alpha = 0.0040
Simulating trial. . .
espilon = 0.2437; alpha = 0.0040
Simulating trial. . .
espilon = 0.2437; alpha = 0.0040
Simulating trial. . .
espilon = 0.2437; alpha = 0.0040
Simulating trial. . .
espilon = 0.2437; alpha = 0.0040
Simulating trial. . .
espilon = 0.2437; alpha = 0.0040
Simulating trial. . .
espilon = 0.2437; alpha = 0.0040
Simulating trial. . .
espilon = 0.2437; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 1.25)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 0.91)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent followed the waypoint right. (rewarded 2.32)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.09)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.36)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.91)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.27)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.58)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 2.16)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.75)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.83)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.10)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.42)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.33)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.47)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 355
\-------------------------
Simulating trial. . .
espilon = 0.2427; alpha = 0.0040
Simulating trial. . .
espilon = 0.2427; alpha = 0.0040
Simulating trial. . .
espilon = 0.2427; alpha = 0.0040
Simulating trial. . .
espilon = 0.2427; alpha = 0.0040
Simulating trial. . .
espilon = 0.2427; alpha = 0.0040
Simulating trial. . .
espilon = 0.2427; alpha = 0.0040
Simulating trial. . .
espilon = 0.2427; alpha = 0.0040
Simulating trial. . .
espilon = 0.2427; alpha = 0.0040
Simulating trial. . .
espilon = 0.2427; alpha = 0.0040
Simulating trial. . .
espilon = 0.2427; alpha = 0.0040
Simulating trial. . .
espilon = 0.2427; alpha = 0.0040
Simulating trial. . .
espilon = 0.2427; alpha = 0.0040
Simulating trial. . .
espilon = 0.2427; alpha = 0.0040
Simulating trial. . .
espilon = 0.2427; alpha = 0.0040
Simulating trial. . .
espilon = 0.2427; alpha = 0.0040
Simulating trial. . .
espilon = 0.2427; alpha = 0.0040
Simulating trial. . .
espilon = 0.2427; alpha = 0.0040
Simulating trial. . .
espilon = 0.2427; alpha = 0.0040
Simulating trial. . .
espilon = 0.2427; alpha = 0.0040
Simulating trial. . .
espilon = 0.2427; alpha = 0.0040
Simulating trial. . .
espilon = 0.2427; alpha = 0.0040
Simulating trial. . .
espilon = 0.2427; alpha = 0.0040
Simulating trial. . .
espilon = 0.2427; alpha = 0.0040
Simulating trial. . .
espilon = 0.2427; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.65)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', 'forward')
Agent attempted driving left through a red light. (rewarded -9.09)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.26)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.35)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.71)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.33)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.76)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.92)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent drove left instead of forward. (rewarded 1.86)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent followed the waypoint right. (rewarded 2.82)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'forward')
Agent followed the waypoint right. (rewarded 1.19)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.08)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.66)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.17)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 1.03)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 1.58)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.72)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.67)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.75)
24% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 356
\-------------------------
Simulating trial. . .
espilon = 0.2417; alpha = 0.0040
Simulating trial. . .
espilon = 0.2417; alpha = 0.0040
Simulating trial. . .
espilon = 0.2417; alpha = 0.0040
Simulating trial. . .
espilon = 0.2417; alpha = 0.0040
Simulating trial. . .
espilon = 0.2417; alpha = 0.0040
Simulating trial. . .
espilon = 0.2417; alpha = 0.0040
Simulating trial. . .
espilon = 0.2417; alpha = 0.0040
Simulating trial. . .
espilon = 0.2417; alpha = 0.0040
Simulating trial. . .
espilon = 0.2417; alpha = 0.0040
Simulating trial. . .
espilon = 0.2417; alpha = 0.0040
Simulating trial. . .
espilon = 0.2417; alpha = 0.0040
Simulating trial. . .
espilon = 0.2417; alpha = 0.0040
Simulating trial. . .
espilon = 0.2417; alpha = 0.0040
Simulating trial. . .
espilon = 0.2417; alpha = 0.0040
Simulating trial. . .
espilon = 0.2417; alpha = 0.0040
Simulating trial. . .
espilon = 0.2417; alpha = 0.0040
Simulating trial. . .
espilon = 0.2417; alpha = 0.0040
Simulating trial. . .
espilon = 0.2417; alpha = 0.0040
Simulating trial. . .
espilon = 0.2417; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.85)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.72)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.72)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.49)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.49)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.31)
70% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 357
\-------------------------
Simulating trial. . .
espilon = 0.2407; alpha = 0.0040
Simulating trial. . .
espilon = 0.2407; alpha = 0.0040
Simulating trial. . .
espilon = 0.2407; alpha = 0.0040
Simulating trial. . .
espilon = 0.2407; alpha = 0.0040
Simulating trial. . .
espilon = 0.2407; alpha = 0.0040
Simulating trial. . .
espilon = 0.2407; alpha = 0.0040
Simulating trial. . .
espilon = 0.2407; alpha = 0.0040
Simulating trial. . .
espilon = 0.2407; alpha = 0.0040
Simulating trial. . .
espilon = 0.2407; alpha = 0.0040
Simulating trial. . .
espilon = 0.2407; alpha = 0.0040
Simulating trial. . .
espilon = 0.2407; alpha = 0.0040
Simulating trial. . .
espilon = 0.2407; alpha = 0.0040
Simulating trial. . .
espilon = 0.2407; alpha = 0.0040
Simulating trial. . .
espilon = 0.2407; alpha = 0.0040
Simulating trial. . .
espilon = 0.2407; alpha = 0.0040
Simulating trial. . .
espilon = 0.2407; alpha = 0.0040
Simulating trial. . .
espilon = 0.2407; alpha = 0.0040
Simulating trial. . .
espilon = 0.2407; alpha = 0.0040
Simulating trial. . .
espilon = 0.2407; alpha = 0.0040
Simulating trial. . .
espilon = 0.2407; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.98)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.13)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 1.52)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.76)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.84)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.30)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.58)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.60)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'right')
Agent drove right instead of forward. (rewarded -0.03)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.73)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.77)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.14)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 2.11)
35% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 358
\-------------------------
Simulating trial. . .
espilon = 0.2398; alpha = 0.0040
Simulating trial. . .
espilon = 0.2398; alpha = 0.0040
Simulating trial. . .
espilon = 0.2398; alpha = 0.0040
Simulating trial. . .
espilon = 0.2398; alpha = 0.0040
Simulating trial. . .
espilon = 0.2398; alpha = 0.0040
Simulating trial. . .
espilon = 0.2398; alpha = 0.0040
Simulating trial. . .
espilon = 0.2398; alpha = 0.0040
Simulating trial. . .
espilon = 0.2398; alpha = 0.0040
Simulating trial. . .
espilon = 0.2398; alpha = 0.0040
Simulating trial. . .
espilon = 0.2398; alpha = 0.0040
Simulating trial. . .
espilon = 0.2398; alpha = 0.0040
Simulating trial. . .
espilon = 0.2398; alpha = 0.0040
Simulating trial. . .
espilon = 0.2398; alpha = 0.0040
Simulating trial. . .
espilon = 0.2398; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'right', 'left')
Agent followed the waypoint right. (rewarded 2.56)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.51)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.07)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.04)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 2.39)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 1.32)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 0.99)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.25)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.07)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded -0.03)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent drove forward instead of right. (rewarded 0.85)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.72)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.69)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.43)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.77)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded -0.42)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'forward')
Agent drove forward instead of left. (rewarded 0.73)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.26)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded 0.30)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.53)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 359
\-------------------------
Simulating trial. . .
espilon = 0.2388; alpha = 0.0040
Simulating trial. . .
espilon = 0.2388; alpha = 0.0040
Simulating trial. . .
espilon = 0.2388; alpha = 0.0040
Simulating trial. . .
espilon = 0.2388; alpha = 0.0040
Simulating trial. . .
espilon = 0.2388; alpha = 0.0040
Simulating trial. . .
espilon = 0.2388; alpha = 0.0040
Simulating trial. . .
espilon = 0.2388; alpha = 0.0040
Simulating trial. . .
espilon = 0.2388; alpha = 0.0040
Simulating trial. . .
espilon = 0.2388; alpha = 0.0040
Simulating trial. . .
espilon = 0.2388; alpha = 0.0040
Simulating trial. . .
espilon = 0.2388; alpha = 0.0040
Simulating trial. . .
espilon = 0.2388; alpha = 0.0040
Simulating trial. . .
espilon = 0.2388; alpha = 0.0040
Simulating trial. . .
espilon = 0.2388; alpha = 0.0040
Simulating trial. . .
espilon = 0.2388; alpha = 0.0040
Simulating trial. . .
espilon = 0.2388; alpha = 0.0040
Simulating trial. . .
espilon = 0.2388; alpha = 0.0040
Simulating trial. . .
espilon = 0.2388; alpha = 0.0040
Simulating trial. . .
espilon = 0.2388; alpha = 0.0040
Simulating trial. . .
espilon = 0.2388; alpha = 0.0040
Simulating trial. . .
espilon = 0.2388; alpha = 0.0040
Simulating trial. . .
espilon = 0.2388; alpha = 0.0040
Simulating trial. . .
espilon = 0.2388; alpha = 0.0040
Simulating trial. . .
espilon = 0.2388; alpha = 0.0040
Simulating trial. . .
espilon = 0.2388; alpha = 0.0040
Simulating trial. . .
espilon = 0.2388; alpha = 0.0040
Simulating trial. . .
espilon = 0.2388; alpha = 0.0040
Simulating trial. . .
espilon = 0.2388; alpha = 0.0040
Simulating trial. . .
espilon = 0.2388; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'right')
Agent drove forward instead of left. (rewarded 1.15)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.77)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.16)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.01)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.96)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.64)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 1.53)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 0.90)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.77)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 1.33)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'right')
Agent drove right instead of left. (rewarded 0.81)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.31)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 0.95)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.99)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.45)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.00)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove left instead of forward. (rewarded 0.63)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.63)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.01)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.24)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 0.79)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.98)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', 'left')
Agent drove right instead of forward. (rewarded 0.09)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 0.80)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.63)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 360
\-------------------------
Simulating trial. . .
espilon = 0.2379; alpha = 0.0040
Simulating trial. . .
espilon = 0.2379; alpha = 0.0040
Simulating trial. . .
espilon = 0.2379; alpha = 0.0040
Simulating trial. . .
espilon = 0.2379; alpha = 0.0040
Simulating trial. . .
espilon = 0.2379; alpha = 0.0040
Simulating trial. . .
espilon = 0.2379; alpha = 0.0040
Simulating trial. . .
espilon = 0.2379; alpha = 0.0040
Simulating trial. . .
espilon = 0.2379; alpha = 0.0040
Simulating trial. . .
espilon = 0.2379; alpha = 0.0040
Simulating trial. . .
espilon = 0.2379; alpha = 0.0040
Simulating trial. . .
espilon = 0.2379; alpha = 0.0040
Simulating trial. . .
espilon = 0.2379; alpha = 0.0040
Simulating trial. . .
espilon = 0.2379; alpha = 0.0040
Simulating trial. . .
espilon = 0.2379; alpha = 0.0040
Simulating trial. . .
espilon = 0.2379; alpha = 0.0040
Simulating trial. . .
espilon = 0.2379; alpha = 0.0040
Simulating trial. . .
espilon = 0.2379; alpha = 0.0040
Simulating trial. . .
espilon = 0.2379; alpha = 0.0040
Simulating trial. . .
espilon = 0.2379; alpha = 0.0040
Simulating trial. . .
espilon = 0.2379; alpha = 0.0040
Simulating trial. . .
espilon = 0.2379; alpha = 0.0040
Simulating trial. . .
espilon = 0.2379; alpha = 0.0040
Simulating trial. . .
espilon = 0.2379; alpha = 0.0040
Simulating trial. . .
espilon = 0.2379; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.48)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.78)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.70)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded 0.45)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.83)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.41)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'right')
Agent properly idled at a red light. (rewarded 2.14)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'right')
Agent drove right instead of left. (rewarded 1.69)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.70)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.69)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.27)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 2.26)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.03)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.46)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.53)
25% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 361
\-------------------------
Simulating trial. . .
espilon = 0.2369; alpha = 0.0040
Simulating trial. . .
espilon = 0.2369; alpha = 0.0040
Simulating trial. . .
espilon = 0.2369; alpha = 0.0040
Simulating trial. . .
espilon = 0.2369; alpha = 0.0040
Simulating trial. . .
espilon = 0.2369; alpha = 0.0040
Simulating trial. . .
espilon = 0.2369; alpha = 0.0040
Simulating trial. . .
espilon = 0.2369; alpha = 0.0040
Simulating trial. . .
espilon = 0.2369; alpha = 0.0040
Simulating trial. . .
espilon = 0.2369; alpha = 0.0040
Simulating trial. . .
espilon = 0.2369; alpha = 0.0040
Simulating trial. . .
espilon = 0.2369; alpha = 0.0040
Simulating trial. . .
espilon = 0.2369; alpha = 0.0040
Simulating trial. . .
espilon = 0.2369; alpha = 0.0040
Simulating trial. . .
espilon = 0.2369; alpha = 0.0040
Simulating trial. . .
espilon = 0.2369; alpha = 0.0040
Simulating trial. . .
espilon = 0.2369; alpha = 0.0040
Simulating trial. . .
espilon = 0.2369; alpha = 0.0040
Simulating trial. . .
espilon = 0.2369; alpha = 0.0040
Simulating trial. . .
espilon = 0.2369; alpha = 0.0040
Simulating trial. . .
espilon = 0.2369; alpha = 0.0040
Simulating trial. . .
espilon = 0.2369; alpha = 0.0040
Simulating trial. . .
espilon = 0.2369; alpha = 0.0040
Simulating trial. . .
espilon = 0.2369; alpha = 0.0040
Simulating trial. . .
espilon = 0.2369; alpha = 0.0040
Simulating trial. . .
espilon = 0.2369; alpha = 0.0040
Simulating trial. . .
espilon = 0.2369; alpha = 0.0040
Simulating trial. . .
espilon = 0.2369; alpha = 0.0040
Simulating trial. . .
espilon = 0.2369; alpha = 0.0040
Simulating trial. . .
espilon = 0.2369; alpha = 0.0040
Simulating trial. . .
espilon = 0.2369; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.33)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'right', None)
Agent followed the waypoint right. (rewarded 2.28)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.93)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.17)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.25)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.84)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.81)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.03)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.54)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.57)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'right')
Agent drove right instead of forward. (rewarded 0.58)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 2.16)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.47)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.00)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.69)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.39)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.36)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.20)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.60)
37% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 362
\-------------------------
Simulating trial. . .
espilon = 0.2360; alpha = 0.0040
Simulating trial. . .
espilon = 0.2360; alpha = 0.0040
Simulating trial. . .
espilon = 0.2360; alpha = 0.0040
Simulating trial. . .
espilon = 0.2360; alpha = 0.0040
Simulating trial. . .
espilon = 0.2360; alpha = 0.0040
Simulating trial. . .
espilon = 0.2360; alpha = 0.0040
Simulating trial. . .
espilon = 0.2360; alpha = 0.0040
Simulating trial. . .
espilon = 0.2360; alpha = 0.0040
Simulating trial. . .
espilon = 0.2360; alpha = 0.0040
Simulating trial. . .
espilon = 0.2360; alpha = 0.0040
Simulating trial. . .
espilon = 0.2360; alpha = 0.0040
Simulating trial. . .
espilon = 0.2360; alpha = 0.0040
Simulating trial. . .
espilon = 0.2360; alpha = 0.0040
Simulating trial. . .
espilon = 0.2360; alpha = 0.0040
Simulating trial. . .
espilon = 0.2360; alpha = 0.0040
Simulating trial. . .
espilon = 0.2360; alpha = 0.0040
Simulating trial. . .
espilon = 0.2360; alpha = 0.0040
Simulating trial. . .
espilon = 0.2360; alpha = 0.0040
Simulating trial. . .
espilon = 0.2360; alpha = 0.0040
Simulating trial. . .
espilon = 0.2360; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent followed the waypoint right. (rewarded 1.83)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.90)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.76)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.92)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.26)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.11)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.15)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.51)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.00)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.72)
71% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 363
\-------------------------
Simulating trial. . .
espilon = 0.2350; alpha = 0.0040
Simulating trial. . .
espilon = 0.2350; alpha = 0.0040
Simulating trial. . .
espilon = 0.2350; alpha = 0.0040
Simulating trial. . .
espilon = 0.2350; alpha = 0.0040
Simulating trial. . .
espilon = 0.2350; alpha = 0.0040
Simulating trial. . .
espilon = 0.2350; alpha = 0.0040
Simulating trial. . .
espilon = 0.2350; alpha = 0.0040
Simulating trial. . .
espilon = 0.2350; alpha = 0.0040
Simulating trial. . .
espilon = 0.2350; alpha = 0.0040
Simulating trial. . .
espilon = 0.2350; alpha = 0.0040
Simulating trial. . .
espilon = 0.2350; alpha = 0.0040
Simulating trial. . .
espilon = 0.2350; alpha = 0.0040
Simulating trial. . .
espilon = 0.2350; alpha = 0.0040
Simulating trial. . .
espilon = 0.2350; alpha = 0.0040
Simulating trial. . .
espilon = 0.2350; alpha = 0.0040
Simulating trial. . .
espilon = 0.2350; alpha = 0.0040
Simulating trial. . .
espilon = 0.2350; alpha = 0.0040
Simulating trial. . .
espilon = 0.2350; alpha = 0.0040
Simulating trial. . .
espilon = 0.2350; alpha = 0.0040
Simulating trial. . .
espilon = 0.2350; alpha = 0.0040
Simulating trial. . .
espilon = 0.2350; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.16)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.81)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent drove right instead of forward. (rewarded 0.09)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.71)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.74)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.44)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.11)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.07)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.79)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.70)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.42)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'right', 'left')
Agent drove right instead of forward. (rewarded 1.56)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.43)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.71)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.48)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.14)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent attempted driving forward through a red light. (rewarded -9.76)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded -0.02)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.23)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent followed the waypoint right. (rewarded 1.03)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 0.64)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.57)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.19)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'right')
Agent properly idled at a red light. (rewarded 0.40)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.38)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 364
\-------------------------
Simulating trial. . .
espilon = 0.2341; alpha = 0.0040
Simulating trial. . .
espilon = 0.2341; alpha = 0.0040
Simulating trial. . .
espilon = 0.2341; alpha = 0.0040
Simulating trial. . .
espilon = 0.2341; alpha = 0.0040
Simulating trial. . .
espilon = 0.2341; alpha = 0.0040
Simulating trial. . .
espilon = 0.2341; alpha = 0.0040
Simulating trial. . .
espilon = 0.2341; alpha = 0.0040
Simulating trial. . .
espilon = 0.2341; alpha = 0.0040
Simulating trial. . .
espilon = 0.2341; alpha = 0.0040
Simulating trial. . .
espilon = 0.2341; alpha = 0.0040
Simulating trial. . .
espilon = 0.2341; alpha = 0.0040
Simulating trial. . .
espilon = 0.2341; alpha = 0.0040
Simulating trial. . .
espilon = 0.2341; alpha = 0.0040
Simulating trial. . .
espilon = 0.2341; alpha = 0.0040
Simulating trial. . .
espilon = 0.2341; alpha = 0.0040
Simulating trial. . .
espilon = 0.2341; alpha = 0.0040
Simulating trial. . .
espilon = 0.2341; alpha = 0.0040
Simulating trial. . .
espilon = 0.2341; alpha = 0.0040
Simulating trial. . .
espilon = 0.2341; alpha = 0.0040
Simulating trial. . .
espilon = 0.2341; alpha = 0.0040
Simulating trial. . .
espilon = 0.2341; alpha = 0.0040
Simulating trial. . .
espilon = 0.2341; alpha = 0.0040
Simulating trial. . .
espilon = 0.2341; alpha = 0.0040
Simulating trial. . .
espilon = 0.2341; alpha = 0.0040
Simulating trial. . .
espilon = 0.2341; alpha = 0.0040
Simulating trial. . .
espilon = 0.2341; alpha = 0.0040
Simulating trial. . .
espilon = 0.2341; alpha = 0.0040
Simulating trial. . .
espilon = 0.2341; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.95)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.95)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.50)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.93)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.05)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.20)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.80)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.22)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.65)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.99)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.74)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.85)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.08)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'right')
Agent drove right instead of forward. (rewarded 0.96)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'right', None)
Agent drove forward instead of left. (rewarded 0.55)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 0.16)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.72)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.72)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 0.52)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.66)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 365
\-------------------------
Simulating trial. . .
espilon = 0.2332; alpha = 0.0040
Simulating trial. . .
espilon = 0.2332; alpha = 0.0040
Simulating trial. . .
espilon = 0.2332; alpha = 0.0040
Simulating trial. . .
espilon = 0.2332; alpha = 0.0040
Simulating trial. . .
espilon = 0.2332; alpha = 0.0040
Simulating trial. . .
espilon = 0.2332; alpha = 0.0040
Simulating trial. . .
espilon = 0.2332; alpha = 0.0040
Simulating trial. . .
espilon = 0.2332; alpha = 0.0040
Simulating trial. . .
espilon = 0.2332; alpha = 0.0040
Simulating trial. . .
espilon = 0.2332; alpha = 0.0040
Simulating trial. . .
espilon = 0.2332; alpha = 0.0040
Simulating trial. . .
espilon = 0.2332; alpha = 0.0040
Simulating trial. . .
espilon = 0.2332; alpha = 0.0040
Simulating trial. . .
espilon = 0.2332; alpha = 0.0040
Simulating trial. . .
espilon = 0.2332; alpha = 0.0040
Simulating trial. . .
espilon = 0.2332; alpha = 0.0040
Simulating trial. . .
espilon = 0.2332; alpha = 0.0040
Simulating trial. . .
espilon = 0.2332; alpha = 0.0040
Simulating trial. . .
espilon = 0.2332; alpha = 0.0040
Simulating trial. . .
espilon = 0.2332; alpha = 0.0040
Simulating trial. . .
espilon = 0.2332; alpha = 0.0040
Simulating trial. . .
espilon = 0.2332; alpha = 0.0040
Simulating trial. . .
espilon = 0.2332; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.27)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.08)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.37)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.77)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.42)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.02)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.16)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -10.23)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.80)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.58)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.26)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.19)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 2.76)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 1.47)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.04)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.83)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.94)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 0.32)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.14)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.07)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.26)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.27)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.54)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.38)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -9.76)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 366
\-------------------------
Simulating trial. . .
espilon = 0.2322; alpha = 0.0040
Simulating trial. . .
espilon = 0.2322; alpha = 0.0040
Simulating trial. . .
espilon = 0.2322; alpha = 0.0040
Simulating trial. . .
espilon = 0.2322; alpha = 0.0040
Simulating trial. . .
espilon = 0.2322; alpha = 0.0040
Simulating trial. . .
espilon = 0.2322; alpha = 0.0040
Simulating trial. . .
espilon = 0.2322; alpha = 0.0040
Simulating trial. . .
espilon = 0.2322; alpha = 0.0040
Simulating trial. . .
espilon = 0.2322; alpha = 0.0040
Simulating trial. . .
espilon = 0.2322; alpha = 0.0040
Simulating trial. . .
espilon = 0.2322; alpha = 0.0040
Simulating trial. . .
espilon = 0.2322; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'left', 'left')
Agent followed the waypoint right. (rewarded 2.81)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.07)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'right')
Agent drove right instead of forward. (rewarded 0.97)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.80)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.64)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 1.30)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent drove right instead of left. (rewarded -0.04)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.48)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent drove right instead of left. (rewarded 1.59)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 2.42)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.07)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.57)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.80)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.58)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.94)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 1.22)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.80)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.46)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent attempted driving forward through a red light. (rewarded -9.64)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'right')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.82)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.85)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.80)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'right', 'forward')
Agent followed the waypoint right. (rewarded 0.71)
23% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 367
\-------------------------
Simulating trial. . .
espilon = 0.2313; alpha = 0.0040
Simulating trial. . .
espilon = 0.2313; alpha = 0.0040
Simulating trial. . .
espilon = 0.2313; alpha = 0.0040
Simulating trial. . .
espilon = 0.2313; alpha = 0.0040
Simulating trial. . .
espilon = 0.2313; alpha = 0.0040
Simulating trial. . .
espilon = 0.2313; alpha = 0.0040
Simulating trial. . .
espilon = 0.2313; alpha = 0.0040
Simulating trial. . .
espilon = 0.2313; alpha = 0.0040
Simulating trial. . .
espilon = 0.2313; alpha = 0.0040
Simulating trial. . .
espilon = 0.2313; alpha = 0.0040
Simulating trial. . .
espilon = 0.2313; alpha = 0.0040
Simulating trial. . .
espilon = 0.2313; alpha = 0.0040
Simulating trial. . .
espilon = 0.2313; alpha = 0.0040
Simulating trial. . .
espilon = 0.2313; alpha = 0.0040
Simulating trial. . .
espilon = 0.2313; alpha = 0.0040
Simulating trial. . .
espilon = 0.2313; alpha = 0.0040
Simulating trial. . .
espilon = 0.2313; alpha = 0.0040
Simulating trial. . .
espilon = 0.2313; alpha = 0.0040
Simulating trial. . .
espilon = 0.2313; alpha = 0.0040
Simulating trial. . .
espilon = 0.2313; alpha = 0.0040
Simulating trial. . .
espilon = 0.2313; alpha = 0.0040
Simulating trial. . .
espilon = 0.2313; alpha = 0.0040
Simulating trial. . .
espilon = 0.2313; alpha = 0.0040
Simulating trial. . .
espilon = 0.2313; alpha = 0.0040
Simulating trial. . .
espilon = 0.2313; alpha = 0.0040
Simulating trial. . .
espilon = 0.2313; alpha = 0.0040
Simulating trial. . .
espilon = 0.2313; alpha = 0.0040
Simulating trial. . .
espilon = 0.2313; alpha = 0.0040
Simulating trial. . .
espilon = 0.2313; alpha = 0.0040
Simulating trial. . .
espilon = 0.2313; alpha = 0.0040
Simulating trial. . .
espilon = 0.2313; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.28)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', 'right')
Agent followed the waypoint forward. (rewarded 2.93)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.30)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.49)
80% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 368
\-------------------------
Simulating trial. . .
espilon = 0.2304; alpha = 0.0040
Simulating trial. . .
espilon = 0.2304; alpha = 0.0040
Simulating trial. . .
espilon = 0.2304; alpha = 0.0040
Simulating trial. . .
espilon = 0.2304; alpha = 0.0040
Simulating trial. . .
espilon = 0.2304; alpha = 0.0040
Simulating trial. . .
espilon = 0.2304; alpha = 0.0040
Simulating trial. . .
espilon = 0.2304; alpha = 0.0040
Simulating trial. . .
espilon = 0.2304; alpha = 0.0040
Simulating trial. . .
espilon = 0.2304; alpha = 0.0040
Simulating trial. . .
espilon = 0.2304; alpha = 0.0040
Simulating trial. . .
espilon = 0.2304; alpha = 0.0040
Simulating trial. . .
espilon = 0.2304; alpha = 0.0040
Simulating trial. . .
espilon = 0.2304; alpha = 0.0040
Simulating trial. . .
espilon = 0.2304; alpha = 0.0040
Simulating trial. . .
espilon = 0.2304; alpha = 0.0040
Simulating trial. . .
espilon = 0.2304; alpha = 0.0040
Simulating trial. . .
espilon = 0.2304; alpha = 0.0040
Simulating trial. . .
espilon = 0.2304; alpha = 0.0040
Simulating trial. . .
espilon = 0.2304; alpha = 0.0040
Simulating trial. . .
espilon = 0.2304; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.41)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 2.66)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 1.62)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 1.01)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.43)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.29)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.09)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 2.44)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.19)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.52)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.55)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.04)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.96)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.52)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent drove right instead of forward. (rewarded -0.12)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 2.26)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'right', None)
Agent properly idled at a red light. (rewarded 1.58)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.12)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.65)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.10)
20% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 369
\-------------------------
Simulating trial. . .
espilon = 0.2295; alpha = 0.0040
Simulating trial. . .
espilon = 0.2295; alpha = 0.0040
Simulating trial. . .
espilon = 0.2295; alpha = 0.0040
Simulating trial. . .
espilon = 0.2295; alpha = 0.0040
Simulating trial. . .
espilon = 0.2295; alpha = 0.0040
Simulating trial. . .
espilon = 0.2295; alpha = 0.0040
Simulating trial. . .
espilon = 0.2295; alpha = 0.0040
Simulating trial. . .
espilon = 0.2295; alpha = 0.0040
Simulating trial. . .
espilon = 0.2295; alpha = 0.0040
Simulating trial. . .
espilon = 0.2295; alpha = 0.0040
Simulating trial. . .
espilon = 0.2295; alpha = 0.0040
Simulating trial. . .
espilon = 0.2295; alpha = 0.0040
Simulating trial. . .
espilon = 0.2295; alpha = 0.0040
Simulating trial. . .
espilon = 0.2295; alpha = 0.0040
Simulating trial. . .
espilon = 0.2295; alpha = 0.0040
Simulating trial. . .
espilon = 0.2295; alpha = 0.0040
Simulating trial. . .
espilon = 0.2295; alpha = 0.0040
Simulating trial. . .
espilon = 0.2295; alpha = 0.0040
Simulating trial. . .
espilon = 0.2295; alpha = 0.0040
Simulating trial. . .
espilon = 0.2295; alpha = 0.0040
Simulating trial. . .
espilon = 0.2295; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'right', 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.54)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent followed the waypoint right. (rewarded 2.68)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded 1.74)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.39)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 0.95)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent drove left instead of forward. (rewarded 1.67)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.35)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.29)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.06)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.95)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.79)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.59)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.57)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.65)
30% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 370
\-------------------------
Simulating trial. . .
espilon = 0.2286; alpha = 0.0040
Simulating trial. . .
espilon = 0.2286; alpha = 0.0040
Simulating trial. . .
espilon = 0.2286; alpha = 0.0040
Simulating trial. . .
espilon = 0.2286; alpha = 0.0040
Simulating trial. . .
espilon = 0.2286; alpha = 0.0040
Simulating trial. . .
espilon = 0.2286; alpha = 0.0040
Simulating trial. . .
espilon = 0.2286; alpha = 0.0040
Simulating trial. . .
espilon = 0.2286; alpha = 0.0040
Simulating trial. . .
espilon = 0.2286; alpha = 0.0040
Simulating trial. . .
espilon = 0.2286; alpha = 0.0040
Simulating trial. . .
espilon = 0.2286; alpha = 0.0040
Simulating trial. . .
espilon = 0.2286; alpha = 0.0040
Simulating trial. . .
espilon = 0.2286; alpha = 0.0040
Simulating trial. . .
espilon = 0.2286; alpha = 0.0040
Simulating trial. . .
espilon = 0.2286; alpha = 0.0040
Simulating trial. . .
espilon = 0.2286; alpha = 0.0040
Simulating trial. . .
espilon = 0.2286; alpha = 0.0040
Simulating trial. . .
espilon = 0.2286; alpha = 0.0040
Simulating trial. . .
espilon = 0.2286; alpha = 0.0040
Simulating trial. . .
espilon = 0.2286; alpha = 0.0040
Simulating trial. . .
espilon = 0.2286; alpha = 0.0040
Simulating trial. . .
espilon = 0.2286; alpha = 0.0040
Simulating trial. . .
espilon = 0.2286; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 2.96)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.58)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.86)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.28)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.54)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.20)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.90)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 0.23)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.54)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.27)
67% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 371
\-------------------------
Simulating trial. . .
espilon = 0.2276; alpha = 0.0040
Simulating trial. . .
espilon = 0.2276; alpha = 0.0040
Simulating trial. . .
espilon = 0.2276; alpha = 0.0040
Simulating trial. . .
espilon = 0.2276; alpha = 0.0040
Simulating trial. . .
espilon = 0.2276; alpha = 0.0040
Simulating trial. . .
espilon = 0.2276; alpha = 0.0040
Simulating trial. . .
espilon = 0.2276; alpha = 0.0040
Simulating trial. . .
espilon = 0.2276; alpha = 0.0040
Simulating trial. . .
espilon = 0.2276; alpha = 0.0040
Simulating trial. . .
espilon = 0.2276; alpha = 0.0040
Simulating trial. . .
espilon = 0.2276; alpha = 0.0040
Simulating trial. . .
espilon = 0.2276; alpha = 0.0040
Simulating trial. . .
espilon = 0.2276; alpha = 0.0040
Simulating trial. . .
espilon = 0.2276; alpha = 0.0040
Simulating trial. . .
espilon = 0.2276; alpha = 0.0040
Simulating trial. . .
espilon = 0.2276; alpha = 0.0040
Simulating trial. . .
espilon = 0.2276; alpha = 0.0040
Simulating trial. . .
espilon = 0.2276; alpha = 0.0040
Simulating trial. . .
espilon = 0.2276; alpha = 0.0040
Simulating trial. . .
espilon = 0.2276; alpha = 0.0040
Simulating trial. . .
espilon = 0.2276; alpha = 0.0040
Simulating trial. . .
espilon = 0.2276; alpha = 0.0040
Simulating trial. . .
espilon = 0.2276; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'right')
Agent followed the waypoint right. (rewarded 2.97)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.34)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.75)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.81)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.12)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.60)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent followed the waypoint right. (rewarded 2.00)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.52)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.26)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 2.76)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.56)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent followed the waypoint forward. (rewarded 1.62)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 372
\-------------------------
Simulating trial. . .
espilon = 0.2267; alpha = 0.0040
Simulating trial. . .
espilon = 0.2267; alpha = 0.0040
Simulating trial. . .
espilon = 0.2267; alpha = 0.0040
Simulating trial. . .
espilon = 0.2267; alpha = 0.0040
Simulating trial. . .
espilon = 0.2267; alpha = 0.0040
Simulating trial. . .
espilon = 0.2267; alpha = 0.0040
Simulating trial. . .
espilon = 0.2267; alpha = 0.0040
Simulating trial. . .
espilon = 0.2267; alpha = 0.0040
Simulating trial. . .
espilon = 0.2267; alpha = 0.0040
Simulating trial. . .
espilon = 0.2267; alpha = 0.0040
Simulating trial. . .
espilon = 0.2267; alpha = 0.0040
Simulating trial. . .
espilon = 0.2267; alpha = 0.0040
Simulating trial. . .
espilon = 0.2267; alpha = 0.0040
Simulating trial. . .
espilon = 0.2267; alpha = 0.0040
Simulating trial. . .
espilon = 0.2267; alpha = 0.0040
Simulating trial. . .
espilon = 0.2267; alpha = 0.0040
Simulating trial. . .
espilon = 0.2267; alpha = 0.0040
Simulating trial. . .
espilon = 0.2267; alpha = 0.0040
Simulating trial. . .
espilon = 0.2267; alpha = 0.0040
Simulating trial. . .
espilon = 0.2267; alpha = 0.0040
Simulating trial. . .
espilon = 0.2267; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent followed the waypoint right. (rewarded 2.88)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', 'left')
Agent drove right instead of forward. (rewarded 0.10)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.87)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.97)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.24)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 1.86)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.09)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.08)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 1.56)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.08)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.61)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.96)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 0.91)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.93)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent followed the waypoint forward. (rewarded 0.94)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'forward', None)
Agent drove left instead of right. (rewarded 1.19)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.56)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.15)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.65)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded -0.70)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 373
\-------------------------
Simulating trial. . .
espilon = 0.2258; alpha = 0.0040
Simulating trial. . .
espilon = 0.2258; alpha = 0.0040
Simulating trial. . .
espilon = 0.2258; alpha = 0.0040
Simulating trial. . .
espilon = 0.2258; alpha = 0.0040
Simulating trial. . .
espilon = 0.2258; alpha = 0.0040
Simulating trial. . .
espilon = 0.2258; alpha = 0.0040
Simulating trial. . .
espilon = 0.2258; alpha = 0.0040
Simulating trial. . .
espilon = 0.2258; alpha = 0.0040
Simulating trial. . .
espilon = 0.2258; alpha = 0.0040
Simulating trial. . .
espilon = 0.2258; alpha = 0.0040
Simulating trial. . .
espilon = 0.2258; alpha = 0.0040
Simulating trial. . .
espilon = 0.2258; alpha = 0.0040
Simulating trial. . .
espilon = 0.2258; alpha = 0.0040
Simulating trial. . .
espilon = 0.2258; alpha = 0.0040
Simulating trial. . .
espilon = 0.2258; alpha = 0.0040
Simulating trial. . .
espilon = 0.2258; alpha = 0.0040
Simulating trial. . .
espilon = 0.2258; alpha = 0.0040
Simulating trial. . .
espilon = 0.2258; alpha = 0.0040
Simulating trial. . .
espilon = 0.2258; alpha = 0.0040
Simulating trial. . .
espilon = 0.2258; alpha = 0.0040
Simulating trial. . .
espilon = 0.2258; alpha = 0.0040
Simulating trial. . .
espilon = 0.2258; alpha = 0.0040
Simulating trial. . .
espilon = 0.2258; alpha = 0.0040
Simulating trial. . .
espilon = 0.2258; alpha = 0.0040
Simulating trial. . .
espilon = 0.2258; alpha = 0.0040
Simulating trial. . .
espilon = 0.2258; alpha = 0.0040
Simulating trial. . .
espilon = 0.2258; alpha = 0.0040
Simulating trial. . .
espilon = 0.2258; alpha = 0.0040
Simulating trial. . .
espilon = 0.2258; alpha = 0.0040
Simulating trial. . .
espilon = 0.2258; alpha = 0.0040
Simulating trial. . .
espilon = 0.2258; alpha = 0.0040
Simulating trial. . .
espilon = 0.2258; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -4.34)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 1.08)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.11)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.09)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.70)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.92)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.78)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.85)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.17)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.59)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.68)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.33)
52% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 374
\-------------------------
Simulating trial. . .
espilon = 0.2249; alpha = 0.0040
Simulating trial. . .
espilon = 0.2249; alpha = 0.0040
Simulating trial. . .
espilon = 0.2249; alpha = 0.0040
Simulating trial. . .
espilon = 0.2249; alpha = 0.0040
Simulating trial. . .
espilon = 0.2249; alpha = 0.0040
Simulating trial. . .
espilon = 0.2249; alpha = 0.0040
Simulating trial. . .
espilon = 0.2249; alpha = 0.0040
Simulating trial. . .
espilon = 0.2249; alpha = 0.0040
Simulating trial. . .
espilon = 0.2249; alpha = 0.0040
Simulating trial. . .
espilon = 0.2249; alpha = 0.0040
Simulating trial. . .
espilon = 0.2249; alpha = 0.0040
Simulating trial. . .
espilon = 0.2249; alpha = 0.0040
Simulating trial. . .
espilon = 0.2249; alpha = 0.0040
Simulating trial. . .
espilon = 0.2249; alpha = 0.0040
Simulating trial. . .
espilon = 0.2249; alpha = 0.0040
Simulating trial. . .
espilon = 0.2249; alpha = 0.0040
Simulating trial. . .
espilon = 0.2249; alpha = 0.0040
Simulating trial. . .
espilon = 0.2249; alpha = 0.0040
Simulating trial. . .
espilon = 0.2249; alpha = 0.0040
Simulating trial. . .
espilon = 0.2249; alpha = 0.0040
Simulating trial. . .
espilon = 0.2249; alpha = 0.0040
Simulating trial. . .
espilon = 0.2249; alpha = 0.0040
Simulating trial. . .
espilon = 0.2249; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.74)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.28)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.96)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.08)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.81)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.29)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -10.04)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', None)
Agent followed the waypoint right. (rewarded 1.12)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'forward')
Agent drove right instead of forward. (rewarded 1.55)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'forward', None)
Agent followed the waypoint left. (rewarded 1.71)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'forward')
Agent drove right instead of left. (rewarded 1.16)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.10)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.48)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent followed the waypoint right. (rewarded 0.80)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', 'left')
Agent drove right instead of left. (rewarded -0.11)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 0.88)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.64)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 0.41)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 0.84)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove forward instead of right. (rewarded 0.61)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 375
\-------------------------
Simulating trial. . .
espilon = 0.2240; alpha = 0.0040
Simulating trial. . .
espilon = 0.2240; alpha = 0.0040
Simulating trial. . .
espilon = 0.2240; alpha = 0.0040
Simulating trial. . .
espilon = 0.2240; alpha = 0.0040
Simulating trial. . .
espilon = 0.2240; alpha = 0.0040
Simulating trial. . .
espilon = 0.2240; alpha = 0.0040
Simulating trial. . .
espilon = 0.2240; alpha = 0.0040
Simulating trial. . .
espilon = 0.2240; alpha = 0.0040
Simulating trial. . .
espilon = 0.2240; alpha = 0.0040
Simulating trial. . .
espilon = 0.2240; alpha = 0.0040
Simulating trial. . .
espilon = 0.2240; alpha = 0.0040
Simulating trial. . .
espilon = 0.2240; alpha = 0.0040
Simulating trial. . .
espilon = 0.2240; alpha = 0.0040
Simulating trial. . .
espilon = 0.2240; alpha = 0.0040
Simulating trial. . .
espilon = 0.2240; alpha = 0.0040
Simulating trial. . .
espilon = 0.2240; alpha = 0.0040
Simulating trial. . .
espilon = 0.2240; alpha = 0.0040
Simulating trial. . .
espilon = 0.2240; alpha = 0.0040
Simulating trial. . .
espilon = 0.2240; alpha = 0.0040
Simulating trial. . .
espilon = 0.2240; alpha = 0.0040
Simulating trial. . .
espilon = 0.2240; alpha = 0.0040
Simulating trial. . .
espilon = 0.2240; alpha = 0.0040
Simulating trial. . .
espilon = 0.2240; alpha = 0.0040
Simulating trial. . .
espilon = 0.2240; alpha = 0.0040
Simulating trial. . .
espilon = 0.2240; alpha = 0.0040
Simulating trial. . .
espilon = 0.2240; alpha = 0.0040
Simulating trial. . .
espilon = 0.2240; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.42)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 2.08)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.00)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.78)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.37)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.50)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.44)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.62)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.11)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.01)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.74)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 0.56)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.84)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.97)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 1.86)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.01)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', None)
Agent followed the waypoint right. (rewarded 0.85)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 0.95)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.18)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 0.67)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 376
\-------------------------
Simulating trial. . .
espilon = 0.2231; alpha = 0.0040
Simulating trial. . .
espilon = 0.2231; alpha = 0.0040
Simulating trial. . .
espilon = 0.2231; alpha = 0.0040
Simulating trial. . .
espilon = 0.2231; alpha = 0.0040
Simulating trial. . .
espilon = 0.2231; alpha = 0.0040
Simulating trial. . .
espilon = 0.2231; alpha = 0.0040
Simulating trial. . .
espilon = 0.2231; alpha = 0.0040
Simulating trial. . .
espilon = 0.2231; alpha = 0.0040
Simulating trial. . .
espilon = 0.2231; alpha = 0.0040
Simulating trial. . .
espilon = 0.2231; alpha = 0.0040
Simulating trial. . .
espilon = 0.2231; alpha = 0.0040
Simulating trial. . .
espilon = 0.2231; alpha = 0.0040
Simulating trial. . .
espilon = 0.2231; alpha = 0.0040
Simulating trial. . .
espilon = 0.2231; alpha = 0.0040
Simulating trial. . .
espilon = 0.2231; alpha = 0.0040
Simulating trial. . .
espilon = 0.2231; alpha = 0.0040
Simulating trial. . .
espilon = 0.2231; alpha = 0.0040
Simulating trial. . .
espilon = 0.2231; alpha = 0.0040
Simulating trial. . .
espilon = 0.2231; alpha = 0.0040
Simulating trial. . .
espilon = 0.2231; alpha = 0.0040
Simulating trial. . .
espilon = 0.2231; alpha = 0.0040
Simulating trial. . .
espilon = 0.2231; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, 'right')
Agent followed the waypoint right. (rewarded 1.88)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.62)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.99)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.43)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.50)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.52)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.60)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.15)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.73)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 0.92)
50% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 377
\-------------------------
Simulating trial. . .
espilon = 0.2222; alpha = 0.0040
Simulating trial. . .
espilon = 0.2222; alpha = 0.0040
Simulating trial. . .
espilon = 0.2222; alpha = 0.0040
Simulating trial. . .
espilon = 0.2222; alpha = 0.0040
Simulating trial. . .
espilon = 0.2222; alpha = 0.0040
Simulating trial. . .
espilon = 0.2222; alpha = 0.0040
Simulating trial. . .
espilon = 0.2222; alpha = 0.0040
Simulating trial. . .
espilon = 0.2222; alpha = 0.0040
Simulating trial. . .
espilon = 0.2222; alpha = 0.0040
Simulating trial. . .
espilon = 0.2222; alpha = 0.0040
Simulating trial. . .
espilon = 0.2222; alpha = 0.0040
Simulating trial. . .
espilon = 0.2222; alpha = 0.0040
Simulating trial. . .
espilon = 0.2222; alpha = 0.0040
Simulating trial. . .
espilon = 0.2222; alpha = 0.0040
Simulating trial. . .
espilon = 0.2222; alpha = 0.0040
Simulating trial. . .
espilon = 0.2222; alpha = 0.0040
Simulating trial. . .
espilon = 0.2222; alpha = 0.0040
Simulating trial. . .
espilon = 0.2222; alpha = 0.0040
Simulating trial. . .
espilon = 0.2222; alpha = 0.0040
Simulating trial. . .
espilon = 0.2222; alpha = 0.0040
Simulating trial. . .
espilon = 0.2222; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.11)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.97)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.72)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.15)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.60)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.50)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.99)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.39)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent drove right instead of left. (rewarded 0.74)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.59)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 1.25)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 0.95)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.89)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.38)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.08)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.01)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 0.80)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'right')
Agent drove right instead of forward. (rewarded 0.09)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent attempted driving left through a red light. (rewarded -10.45)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 0.67)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'right', None)
Agent drove forward instead of left. (rewarded 0.01)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.82)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.27)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent attempted driving forward through a red light. (rewarded -9.06)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent drove right instead of left. (rewarded -0.10)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 378
\-------------------------
Simulating trial. . .
espilon = 0.2214; alpha = 0.0040
Simulating trial. . .
espilon = 0.2214; alpha = 0.0040
Simulating trial. . .
espilon = 0.2214; alpha = 0.0040
Simulating trial. . .
espilon = 0.2214; alpha = 0.0040
Simulating trial. . .
espilon = 0.2214; alpha = 0.0040
Simulating trial. . .
espilon = 0.2214; alpha = 0.0040
Simulating trial. . .
espilon = 0.2214; alpha = 0.0040
Simulating trial. . .
espilon = 0.2214; alpha = 0.0040
Simulating trial. . .
espilon = 0.2214; alpha = 0.0040
Simulating trial. . .
espilon = 0.2214; alpha = 0.0040
Simulating trial. . .
espilon = 0.2214; alpha = 0.0040
Simulating trial. . .
espilon = 0.2214; alpha = 0.0040
Simulating trial. . .
espilon = 0.2214; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.38)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.05)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 1.02)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.01)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.85)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.73)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.91)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent attempted driving left through a red light. (rewarded -10.54)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.77)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.01)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.20)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 1.19)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.62)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.55)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', None)
Agent properly idled at a red light. (rewarded 2.28)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.68)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.45)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.06)
28% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 379
\-------------------------
Simulating trial. . .
espilon = 0.2205; alpha = 0.0040
Simulating trial. . .
espilon = 0.2205; alpha = 0.0040
Simulating trial. . .
espilon = 0.2205; alpha = 0.0040
Simulating trial. . .
espilon = 0.2205; alpha = 0.0040
Simulating trial. . .
espilon = 0.2205; alpha = 0.0040
Simulating trial. . .
espilon = 0.2205; alpha = 0.0040
Simulating trial. . .
espilon = 0.2205; alpha = 0.0040
Simulating trial. . .
espilon = 0.2205; alpha = 0.0040
Simulating trial. . .
espilon = 0.2205; alpha = 0.0040
Simulating trial. . .
espilon = 0.2205; alpha = 0.0040
Simulating trial. . .
espilon = 0.2205; alpha = 0.0040
Simulating trial. . .
espilon = 0.2205; alpha = 0.0040
Simulating trial. . .
espilon = 0.2205; alpha = 0.0040
Simulating trial. . .
espilon = 0.2205; alpha = 0.0040
Simulating trial. . .
espilon = 0.2205; alpha = 0.0040
Simulating trial. . .
espilon = 0.2205; alpha = 0.0040
Simulating trial. . .
espilon = 0.2205; alpha = 0.0040
Simulating trial. . .
espilon = 0.2205; alpha = 0.0040
Simulating trial. . .
espilon = 0.2205; alpha = 0.0040
Simulating trial. . .
espilon = 0.2205; alpha = 0.0040
Simulating trial. . .
espilon = 0.2205; alpha = 0.0040
Simulating trial. . .
espilon = 0.2205; alpha = 0.0040
Simulating trial. . .
espilon = 0.2205; alpha = 0.0040
Simulating trial. . .
espilon = 0.2205; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent drove right instead of left. (rewarded 1.52)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent followed the waypoint right. (rewarded 2.26)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.05)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.11)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.75)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.88)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.83)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.38)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.54)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.55)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 2.32)
45% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 380
\-------------------------
Simulating trial. . .
espilon = 0.2196; alpha = 0.0040
Simulating trial. . .
espilon = 0.2196; alpha = 0.0040
Simulating trial. . .
espilon = 0.2196; alpha = 0.0040
Simulating trial. . .
espilon = 0.2196; alpha = 0.0040
Simulating trial. . .
espilon = 0.2196; alpha = 0.0040
Simulating trial. . .
espilon = 0.2196; alpha = 0.0040
Simulating trial. . .
espilon = 0.2196; alpha = 0.0040
Simulating trial. . .
espilon = 0.2196; alpha = 0.0040
Simulating trial. . .
espilon = 0.2196; alpha = 0.0040
Simulating trial. . .
espilon = 0.2196; alpha = 0.0040
Simulating trial. . .
espilon = 0.2196; alpha = 0.0040
Simulating trial. . .
espilon = 0.2196; alpha = 0.0040
Simulating trial. . .
espilon = 0.2196; alpha = 0.0040
Simulating trial. . .
espilon = 0.2196; alpha = 0.0040
Simulating trial. . .
espilon = 0.2196; alpha = 0.0040
Simulating trial. . .
espilon = 0.2196; alpha = 0.0040
Simulating trial. . .
espilon = 0.2196; alpha = 0.0040
Simulating trial. . .
espilon = 0.2196; alpha = 0.0040
Simulating trial. . .
espilon = 0.2196; alpha = 0.0040
Simulating trial. . .
espilon = 0.2196; alpha = 0.0040
Simulating trial. . .
espilon = 0.2196; alpha = 0.0040
Simulating trial. . .
espilon = 0.2196; alpha = 0.0040
Simulating trial. . .
espilon = 0.2196; alpha = 0.0040
Simulating trial. . .
espilon = 0.2196; alpha = 0.0040
Simulating trial. . .
espilon = 0.2196; alpha = 0.0040
Simulating trial. . .
espilon = 0.2196; alpha = 0.0040
Simulating trial. . .
espilon = 0.2196; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.18)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.56)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 1.30)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 0.54)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.51)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent followed the waypoint forward. (rewarded 2.10)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.87)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded 1.36)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.00)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 0.86)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.24)
45% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 381
\-------------------------
Simulating trial. . .
espilon = 0.2187; alpha = 0.0040
Simulating trial. . .
espilon = 0.2187; alpha = 0.0040
Simulating trial. . .
espilon = 0.2187; alpha = 0.0040
Simulating trial. . .
espilon = 0.2187; alpha = 0.0040
Simulating trial. . .
espilon = 0.2187; alpha = 0.0040
Simulating trial. . .
espilon = 0.2187; alpha = 0.0040
Simulating trial. . .
espilon = 0.2187; alpha = 0.0040
Simulating trial. . .
espilon = 0.2187; alpha = 0.0040
Simulating trial. . .
espilon = 0.2187; alpha = 0.0040
Simulating trial. . .
espilon = 0.2187; alpha = 0.0040
Simulating trial. . .
espilon = 0.2187; alpha = 0.0040
Simulating trial. . .
espilon = 0.2187; alpha = 0.0040
Simulating trial. . .
espilon = 0.2187; alpha = 0.0040
Simulating trial. . .
espilon = 0.2187; alpha = 0.0040
Simulating trial. . .
espilon = 0.2187; alpha = 0.0040
Simulating trial. . .
espilon = 0.2187; alpha = 0.0040
Simulating trial. . .
espilon = 0.2187; alpha = 0.0040
Simulating trial. . .
espilon = 0.2187; alpha = 0.0040
Simulating trial. . .
espilon = 0.2187; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.02)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 0.03)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.51)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.46)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent followed the waypoint right. (rewarded 2.16)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.77)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.61)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.05)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.74)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 1.97)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.07)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.60)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 382
\-------------------------
Simulating trial. . .
espilon = 0.2178; alpha = 0.0040
Simulating trial. . .
espilon = 0.2178; alpha = 0.0040
Simulating trial. . .
espilon = 0.2178; alpha = 0.0040
Simulating trial. . .
espilon = 0.2178; alpha = 0.0040
Simulating trial. . .
espilon = 0.2178; alpha = 0.0040
Simulating trial. . .
espilon = 0.2178; alpha = 0.0040
Simulating trial. . .
espilon = 0.2178; alpha = 0.0040
Simulating trial. . .
espilon = 0.2178; alpha = 0.0040
Simulating trial. . .
espilon = 0.2178; alpha = 0.0040
Simulating trial. . .
espilon = 0.2178; alpha = 0.0040
Simulating trial. . .
espilon = 0.2178; alpha = 0.0040
Simulating trial. . .
espilon = 0.2178; alpha = 0.0040
Simulating trial. . .
espilon = 0.2178; alpha = 0.0040
Simulating trial. . .
espilon = 0.2178; alpha = 0.0040
Simulating trial. . .
espilon = 0.2178; alpha = 0.0040
Simulating trial. . .
espilon = 0.2178; alpha = 0.0040
Simulating trial. . .
espilon = 0.2178; alpha = 0.0040
Simulating trial. . .
espilon = 0.2178; alpha = 0.0040
Simulating trial. . .
espilon = 0.2178; alpha = 0.0040
Simulating trial. . .
espilon = 0.2178; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.83)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.22)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.16)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.87)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.96)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent drove forward instead of left. (rewarded 1.08)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.73)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent drove forward instead of left. (rewarded 0.18)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 0.84)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 0.89)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.11)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.43)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.32)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 0.88)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.34)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.56)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.65)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.40)
10% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 383
\-------------------------
Simulating trial. . .
espilon = 0.2170; alpha = 0.0040
Simulating trial. . .
espilon = 0.2170; alpha = 0.0040
Simulating trial. . .
espilon = 0.2170; alpha = 0.0040
Simulating trial. . .
espilon = 0.2170; alpha = 0.0040
Simulating trial. . .
espilon = 0.2170; alpha = 0.0040
Simulating trial. . .
espilon = 0.2170; alpha = 0.0040
Simulating trial. . .
espilon = 0.2170; alpha = 0.0040
Simulating trial. . .
espilon = 0.2170; alpha = 0.0040
Simulating trial. . .
espilon = 0.2170; alpha = 0.0040
Simulating trial. . .
espilon = 0.2170; alpha = 0.0040
Simulating trial. . .
espilon = 0.2170; alpha = 0.0040
Simulating trial. . .
espilon = 0.2170; alpha = 0.0040
Simulating trial. . .
espilon = 0.2170; alpha = 0.0040
Simulating trial. . .
espilon = 0.2170; alpha = 0.0040
Simulating trial. . .
espilon = 0.2170; alpha = 0.0040
Simulating trial. . .
espilon = 0.2170; alpha = 0.0040
Simulating trial. . .
espilon = 0.2170; alpha = 0.0040
Simulating trial. . .
espilon = 0.2170; alpha = 0.0040
Simulating trial. . .
espilon = 0.2170; alpha = 0.0040
Simulating trial. . .
espilon = 0.2170; alpha = 0.0040
Simulating trial. . .
espilon = 0.2170; alpha = 0.0040
Simulating trial. . .
espilon = 0.2170; alpha = 0.0040
Simulating trial. . .
espilon = 0.2170; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.10)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.78)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.91)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.96)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.23)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 1.19)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'right')
Agent properly idled at a red light. (rewarded 1.00)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.52)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.98)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', None)
Agent drove left instead of forward. (rewarded 1.82)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'right')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.15)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'right')
Agent followed the waypoint right. (rewarded 0.96)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.16)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'forward')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.43)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.85)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.12)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent attempted driving forward through a red light. (rewarded -9.48)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.25)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent drove right instead of forward. (rewarded 1.33)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'forward')
Agent drove right instead of left. (rewarded 0.27)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.22)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.05)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent followed the waypoint right. (rewarded 2.14)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.84)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.55)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 0.78)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.01)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'right')
Agent drove right instead of forward. (rewarded 1.14)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.06)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.03)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 384
\-------------------------
Simulating trial. . .
espilon = 0.2161; alpha = 0.0040
Simulating trial. . .
espilon = 0.2161; alpha = 0.0040
Simulating trial. . .
espilon = 0.2161; alpha = 0.0040
Simulating trial. . .
espilon = 0.2161; alpha = 0.0040
Simulating trial. . .
espilon = 0.2161; alpha = 0.0040
Simulating trial. . .
espilon = 0.2161; alpha = 0.0040
Simulating trial. . .
espilon = 0.2161; alpha = 0.0040
Simulating trial. . .
espilon = 0.2161; alpha = 0.0040
Simulating trial. . .
espilon = 0.2161; alpha = 0.0040
Simulating trial. . .
espilon = 0.2161; alpha = 0.0040
Simulating trial. . .
espilon = 0.2161; alpha = 0.0040
Simulating trial. . .
espilon = 0.2161; alpha = 0.0040
Simulating trial. . .
espilon = 0.2161; alpha = 0.0040
Simulating trial. . .
espilon = 0.2161; alpha = 0.0040
Simulating trial. . .
espilon = 0.2161; alpha = 0.0040
Simulating trial. . .
espilon = 0.2161; alpha = 0.0040
Simulating trial. . .
espilon = 0.2161; alpha = 0.0040
Simulating trial. . .
espilon = 0.2161; alpha = 0.0040
Simulating trial. . .
espilon = 0.2161; alpha = 0.0040
Simulating trial. . .
espilon = 0.2161; alpha = 0.0040
Simulating trial. . .
espilon = 0.2161; alpha = 0.0040
Simulating trial. . .
espilon = 0.2161; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.95)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent followed the waypoint right. (rewarded 2.14)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.52)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 2.41)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.51)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.80)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.93)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', 'forward')
Agent drove right instead of left. (rewarded 1.19)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.22)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.01)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.82)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.49)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent drove left instead of forward. (rewarded 0.05)
63% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.90)
60% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 1.17)
57% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 1.40)
54% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 385
\-------------------------
Simulating trial. . .
espilon = 0.2152; alpha = 0.0040
Simulating trial. . .
espilon = 0.2152; alpha = 0.0040
Simulating trial. . .
espilon = 0.2152; alpha = 0.0040
Simulating trial. . .
espilon = 0.2152; alpha = 0.0040
Simulating trial. . .
espilon = 0.2152; alpha = 0.0040
Simulating trial. . .
espilon = 0.2152; alpha = 0.0040
Simulating trial. . .
espilon = 0.2152; alpha = 0.0040
Simulating trial. . .
espilon = 0.2152; alpha = 0.0040
Simulating trial. . .
espilon = 0.2152; alpha = 0.0040
Simulating trial. . .
espilon = 0.2152; alpha = 0.0040
Simulating trial. . .
espilon = 0.2152; alpha = 0.0040
Simulating trial. . .
espilon = 0.2152; alpha = 0.0040
Simulating trial. . .
espilon = 0.2152; alpha = 0.0040
Simulating trial. . .
espilon = 0.2152; alpha = 0.0040
Simulating trial. . .
espilon = 0.2152; alpha = 0.0040
Simulating trial. . .
espilon = 0.2152; alpha = 0.0040
Simulating trial. . .
espilon = 0.2152; alpha = 0.0040
Simulating trial. . .
espilon = 0.2152; alpha = 0.0040
Simulating trial. . .
espilon = 0.2152; alpha = 0.0040
Simulating trial. . .
espilon = 0.2152; alpha = 0.0040
Simulating trial. . .
espilon = 0.2152; alpha = 0.0040
Simulating trial. . .
espilon = 0.2152; alpha = 0.0040
Simulating trial. . .
espilon = 0.2152; alpha = 0.0040
Simulating trial. . .
espilon = 0.2152; alpha = 0.0040
Simulating trial. . .
espilon = 0.2152; alpha = 0.0040
Simulating trial. . .
espilon = 0.2152; alpha = 0.0040
Simulating trial. . .
espilon = 0.2152; alpha = 0.0040
Simulating trial. . .
espilon = 0.2152; alpha = 0.0040
Simulating trial. . .
espilon = 0.2152; alpha = 0.0040
Simulating trial. . .
espilon = 0.2152; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'left')
Agent drove left instead of right. (rewarded 1.23)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent followed the waypoint left. (rewarded 1.90)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.00)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.20)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.80)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.78)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent drove right instead of left. (rewarded 0.80)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 2.62)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.19)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.55)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'forward')
Agent drove right instead of forward. (rewarded 1.54)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.11)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent drove left instead of forward. (rewarded 1.62)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.15)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.49)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.80)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'forward')
Agent followed the waypoint right. (rewarded 2.57)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.38)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.76)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.48)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 0.48)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.35)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'left')
Agent drove forward instead of right. (rewarded 1.06)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 0.90)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.98)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 386
\-------------------------
Simulating trial. . .
espilon = 0.2144; alpha = 0.0040
Simulating trial. . .
espilon = 0.2144; alpha = 0.0040
Simulating trial. . .
espilon = 0.2144; alpha = 0.0040
Simulating trial. . .
espilon = 0.2144; alpha = 0.0040
Simulating trial. . .
espilon = 0.2144; alpha = 0.0040
Simulating trial. . .
espilon = 0.2144; alpha = 0.0040
Simulating trial. . .
espilon = 0.2144; alpha = 0.0040
Simulating trial. . .
espilon = 0.2144; alpha = 0.0040
Simulating trial. . .
espilon = 0.2144; alpha = 0.0040
Simulating trial. . .
espilon = 0.2144; alpha = 0.0040
Simulating trial. . .
espilon = 0.2144; alpha = 0.0040
Simulating trial. . .
espilon = 0.2144; alpha = 0.0040
Simulating trial. . .
espilon = 0.2144; alpha = 0.0040
Simulating trial. . .
espilon = 0.2144; alpha = 0.0040
Simulating trial. . .
espilon = 0.2144; alpha = 0.0040
Simulating trial. . .
espilon = 0.2144; alpha = 0.0040
Simulating trial. . .
espilon = 0.2144; alpha = 0.0040
Simulating trial. . .
espilon = 0.2144; alpha = 0.0040
Simulating trial. . .
espilon = 0.2144; alpha = 0.0040
Simulating trial. . .
espilon = 0.2144; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.52)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'right', None)
Agent followed the waypoint forward. (rewarded 1.71)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.59)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.05)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, 'left')
Agent followed the waypoint right. (rewarded 1.55)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.37)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.64)
72% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 387
\-------------------------
Simulating trial. . .
espilon = 0.2135; alpha = 0.0040
Simulating trial. . .
espilon = 0.2135; alpha = 0.0040
Simulating trial. . .
espilon = 0.2135; alpha = 0.0040
Simulating trial. . .
espilon = 0.2135; alpha = 0.0040
Simulating trial. . .
espilon = 0.2135; alpha = 0.0040
Simulating trial. . .
espilon = 0.2135; alpha = 0.0040
Simulating trial. . .
espilon = 0.2135; alpha = 0.0040
Simulating trial. . .
espilon = 0.2135; alpha = 0.0040
Simulating trial. . .
espilon = 0.2135; alpha = 0.0040
Simulating trial. . .
espilon = 0.2135; alpha = 0.0040
Simulating trial. . .
espilon = 0.2135; alpha = 0.0040
Simulating trial. . .
espilon = 0.2135; alpha = 0.0040
Simulating trial. . .
espilon = 0.2135; alpha = 0.0040
Simulating trial. . .
espilon = 0.2135; alpha = 0.0040
Simulating trial. . .
espilon = 0.2135; alpha = 0.0040
Simulating trial. . .
espilon = 0.2135; alpha = 0.0040
Simulating trial. . .
espilon = 0.2135; alpha = 0.0040
Simulating trial. . .
espilon = 0.2135; alpha = 0.0040
Simulating trial. . .
espilon = 0.2135; alpha = 0.0040
Simulating trial. . .
espilon = 0.2135; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.77)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.84)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.71)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.99)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.44)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.39)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.56)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.97)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.63)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.43)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', 'left')
Agent drove forward instead of right. (rewarded 0.75)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'right', 'forward')
Agent followed the waypoint right. (rewarded 2.36)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', None)
Agent followed the waypoint right. (rewarded 2.39)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent attempted driving left through a red light. (rewarded -9.54)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded 1.49)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent followed the waypoint right. (rewarded 1.90)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'left')
Agent drove forward instead of right. (rewarded 1.13)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent attempted driving left through a red light. (rewarded -10.05)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.74)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.17)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent followed the waypoint right. (rewarded 2.04)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent followed the waypoint right. (rewarded 2.25)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.66)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.80)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', 'right')
Agent drove right instead of left. (rewarded 0.28)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 388
\-------------------------
Simulating trial. . .
espilon = 0.2127; alpha = 0.0040
Simulating trial. . .
espilon = 0.2127; alpha = 0.0040
Simulating trial. . .
espilon = 0.2127; alpha = 0.0040
Simulating trial. . .
espilon = 0.2127; alpha = 0.0040
Simulating trial. . .
espilon = 0.2127; alpha = 0.0040
Simulating trial. . .
espilon = 0.2127; alpha = 0.0040
Simulating trial. . .
espilon = 0.2127; alpha = 0.0040
Simulating trial. . .
espilon = 0.2127; alpha = 0.0040
Simulating trial. . .
espilon = 0.2127; alpha = 0.0040
Simulating trial. . .
espilon = 0.2127; alpha = 0.0040
Simulating trial. . .
espilon = 0.2127; alpha = 0.0040
Simulating trial. . .
espilon = 0.2127; alpha = 0.0040
Simulating trial. . .
espilon = 0.2127; alpha = 0.0040
Simulating trial. . .
espilon = 0.2127; alpha = 0.0040
Simulating trial. . .
espilon = 0.2127; alpha = 0.0040
Simulating trial. . .
espilon = 0.2127; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'right', 'left')
Agent drove forward instead of left. (rewarded 0.75)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'right')
Agent drove right instead of left. (rewarded 0.07)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.20)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 0.99)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.40)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 0.99)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.91)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.60)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, None)
Agent followed the waypoint right. (rewarded 1.55)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 1.66)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 2.14)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.98)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 389
\-------------------------
Simulating trial. . .
espilon = 0.2118; alpha = 0.0040
Simulating trial. . .
espilon = 0.2118; alpha = 0.0040
Simulating trial. . .
espilon = 0.2118; alpha = 0.0040
Simulating trial. . .
espilon = 0.2118; alpha = 0.0040
Simulating trial. . .
espilon = 0.2118; alpha = 0.0040
Simulating trial. . .
espilon = 0.2118; alpha = 0.0040
Simulating trial. . .
espilon = 0.2118; alpha = 0.0040
Simulating trial. . .
espilon = 0.2118; alpha = 0.0040
Simulating trial. . .
espilon = 0.2118; alpha = 0.0040
Simulating trial. . .
espilon = 0.2118; alpha = 0.0040
Simulating trial. . .
espilon = 0.2118; alpha = 0.0040
Simulating trial. . .
espilon = 0.2118; alpha = 0.0040
Simulating trial. . .
espilon = 0.2118; alpha = 0.0040
Simulating trial. . .
espilon = 0.2118; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.66)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.72)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.04)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.73)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.62)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.82)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', 'left')
Agent followed the waypoint forward. (rewarded 2.88)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent followed the waypoint forward. (rewarded 1.38)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.42)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.61)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'right')
Agent drove right instead of forward. (rewarded 1.46)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.18)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.05)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.35)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.85)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.41)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.96)
32% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 390
\-------------------------
Simulating trial. . .
espilon = 0.2110; alpha = 0.0040
Simulating trial. . .
espilon = 0.2110; alpha = 0.0040
Simulating trial. . .
espilon = 0.2110; alpha = 0.0040
Simulating trial. . .
espilon = 0.2110; alpha = 0.0040
Simulating trial. . .
espilon = 0.2110; alpha = 0.0040
Simulating trial. . .
espilon = 0.2110; alpha = 0.0040
Simulating trial. . .
espilon = 0.2110; alpha = 0.0040
Simulating trial. . .
espilon = 0.2110; alpha = 0.0040
Simulating trial. . .
espilon = 0.2110; alpha = 0.0040
Simulating trial. . .
espilon = 0.2110; alpha = 0.0040
Simulating trial. . .
espilon = 0.2110; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', 'left')
Agent drove forward instead of right. (rewarded 0.86)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.48)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.56)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.79)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.39)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'forward')
Agent attempted driving left through a red light. (rewarded -10.62)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.47)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.92)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.07)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.86)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.18)
45% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 391
\-------------------------
Simulating trial. . .
espilon = 0.2101; alpha = 0.0040
Simulating trial. . .
espilon = 0.2101; alpha = 0.0040
Simulating trial. . .
espilon = 0.2101; alpha = 0.0040
Simulating trial. . .
espilon = 0.2101; alpha = 0.0040
Simulating trial. . .
espilon = 0.2101; alpha = 0.0040
Simulating trial. . .
espilon = 0.2101; alpha = 0.0040
Simulating trial. . .
espilon = 0.2101; alpha = 0.0040
Simulating trial. . .
espilon = 0.2101; alpha = 0.0040
Simulating trial. . .
espilon = 0.2101; alpha = 0.0040
Simulating trial. . .
espilon = 0.2101; alpha = 0.0040
Simulating trial. . .
espilon = 0.2101; alpha = 0.0040
Simulating trial. . .
espilon = 0.2101; alpha = 0.0040
Simulating trial. . .
espilon = 0.2101; alpha = 0.0040
Simulating trial. . .
espilon = 0.2101; alpha = 0.0040
Simulating trial. . .
espilon = 0.2101; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.95)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 2.31)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.73)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.43)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'forward')
Agent followed the waypoint right. (rewarded 2.21)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 0.92)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.31)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'right', None)
Agent drove forward instead of left. (rewarded 0.27)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.20)
55% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 392
\-------------------------
Simulating trial. . .
espilon = 0.2093; alpha = 0.0040
Simulating trial. . .
espilon = 0.2093; alpha = 0.0040
Simulating trial. . .
espilon = 0.2093; alpha = 0.0040
Simulating trial. . .
espilon = 0.2093; alpha = 0.0040
Simulating trial. . .
espilon = 0.2093; alpha = 0.0040
Simulating trial. . .
espilon = 0.2093; alpha = 0.0040
Simulating trial. . .
espilon = 0.2093; alpha = 0.0040
Simulating trial. . .
espilon = 0.2093; alpha = 0.0040
Simulating trial. . .
espilon = 0.2093; alpha = 0.0040
Simulating trial. . .
espilon = 0.2093; alpha = 0.0040
Simulating trial. . .
espilon = 0.2093; alpha = 0.0040
Simulating trial. . .
espilon = 0.2093; alpha = 0.0040
Simulating trial. . .
espilon = 0.2093; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -5.47)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.53)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.39)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.95)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', 'right')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.30)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'right')
Agent attempted driving forward through a red light. (rewarded -10.61)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.35)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.61)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 1.52)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 2.67)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.64)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.76)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', 'left')
Agent followed the waypoint left. (rewarded 1.06)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.18)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.03)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.39)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 0.82)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent drove right instead of left. (rewarded 0.58)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.87)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.85)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.62)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 0.94)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.69)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.69)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.76)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.71)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 1.05)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded -0.00)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.88)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.30)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 393
\-------------------------
Simulating trial. . .
espilon = 0.2085; alpha = 0.0040
Simulating trial. . .
espilon = 0.2085; alpha = 0.0040
Simulating trial. . .
espilon = 0.2085; alpha = 0.0040
Simulating trial. . .
espilon = 0.2085; alpha = 0.0040
Simulating trial. . .
espilon = 0.2085; alpha = 0.0040
Simulating trial. . .
espilon = 0.2085; alpha = 0.0040
Simulating trial. . .
espilon = 0.2085; alpha = 0.0040
Simulating trial. . .
espilon = 0.2085; alpha = 0.0040
Simulating trial. . .
espilon = 0.2085; alpha = 0.0040
Simulating trial. . .
espilon = 0.2085; alpha = 0.0040
Simulating trial. . .
espilon = 0.2085; alpha = 0.0040
Simulating trial. . .
espilon = 0.2085; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'forward')
Agent properly idled at a red light. (rewarded 2.76)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.63)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.72)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.17)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.81)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.74)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.86)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.68)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.98)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.37)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 1.32)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.67)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.07)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.44)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.01)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.11)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 0.64)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 1.15)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.59)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.65)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.56)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent drove right instead of left. (rewarded 0.77)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -10.50)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.59)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded -0.59)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 394
\-------------------------
Simulating trial. . .
espilon = 0.2076; alpha = 0.0040
Simulating trial. . .
espilon = 0.2076; alpha = 0.0040
Simulating trial. . .
espilon = 0.2076; alpha = 0.0040
Simulating trial. . .
espilon = 0.2076; alpha = 0.0040
Simulating trial. . .
espilon = 0.2076; alpha = 0.0040
Simulating trial. . .
espilon = 0.2076; alpha = 0.0040
Simulating trial. . .
espilon = 0.2076; alpha = 0.0040
Simulating trial. . .
espilon = 0.2076; alpha = 0.0040
Simulating trial. . .
espilon = 0.2076; alpha = 0.0040
Simulating trial. . .
espilon = 0.2076; alpha = 0.0040
Simulating trial. . .
espilon = 0.2076; alpha = 0.0040
Simulating trial. . .
espilon = 0.2076; alpha = 0.0040
Simulating trial. . .
espilon = 0.2076; alpha = 0.0040
Simulating trial. . .
espilon = 0.2076; alpha = 0.0040
Simulating trial. . .
espilon = 0.2076; alpha = 0.0040
Simulating trial. . .
espilon = 0.2076; alpha = 0.0040
Simulating trial. . .
espilon = 0.2076; alpha = 0.0040
Simulating trial. . .
espilon = 0.2076; alpha = 0.0040
Simulating trial. . .
espilon = 0.2076; alpha = 0.0040
Simulating trial. . .
espilon = 0.2076; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.49)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.05)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.29)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.97)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.12)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.39)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'right')
Agent drove forward instead of right. (rewarded 1.62)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.83)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.33)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.69)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 2.09)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.64)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.07)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.69)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.41)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.21)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.30)
32% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 395
\-------------------------
Simulating trial. . .
espilon = 0.2068; alpha = 0.0040
Simulating trial. . .
espilon = 0.2068; alpha = 0.0040
Simulating trial. . .
espilon = 0.2068; alpha = 0.0040
Simulating trial. . .
espilon = 0.2068; alpha = 0.0040
Simulating trial. . .
espilon = 0.2068; alpha = 0.0040
Simulating trial. . .
espilon = 0.2068; alpha = 0.0040
Simulating trial. . .
espilon = 0.2068; alpha = 0.0040
Simulating trial. . .
espilon = 0.2068; alpha = 0.0040
Simulating trial. . .
espilon = 0.2068; alpha = 0.0040
Simulating trial. . .
espilon = 0.2068; alpha = 0.0040
Simulating trial. . .
espilon = 0.2068; alpha = 0.0040
Simulating trial. . .
espilon = 0.2068; alpha = 0.0040
Simulating trial. . .
espilon = 0.2068; alpha = 0.0040
Simulating trial. . .
espilon = 0.2068; alpha = 0.0040
Simulating trial. . .
espilon = 0.2068; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 2.46)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 1.30)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.07)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.60)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.51)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.47)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.16)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.27)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 2.39)
64% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 396
\-------------------------
Simulating trial. . .
espilon = 0.2060; alpha = 0.0040
Simulating trial. . .
espilon = 0.2060; alpha = 0.0040
Simulating trial. . .
espilon = 0.2060; alpha = 0.0040
Simulating trial. . .
espilon = 0.2060; alpha = 0.0040
Simulating trial. . .
espilon = 0.2060; alpha = 0.0040
Simulating trial. . .
espilon = 0.2060; alpha = 0.0040
Simulating trial. . .
espilon = 0.2060; alpha = 0.0040
Simulating trial. . .
espilon = 0.2060; alpha = 0.0040
Simulating trial. . .
espilon = 0.2060; alpha = 0.0040
Simulating trial. . .
espilon = 0.2060; alpha = 0.0040
Simulating trial. . .
espilon = 0.2060; alpha = 0.0040
Simulating trial. . .
espilon = 0.2060; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.72)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 1.23)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'right', 'right')
Agent followed the waypoint forward. (rewarded 2.90)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.02)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.79)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.88)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 2.12)
65% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 397
\-------------------------
Simulating trial. . .
espilon = 0.2052; alpha = 0.0040
Simulating trial. . .
espilon = 0.2052; alpha = 0.0040
Simulating trial. . .
espilon = 0.2052; alpha = 0.0040
Simulating trial. . .
espilon = 0.2052; alpha = 0.0040
Simulating trial. . .
espilon = 0.2052; alpha = 0.0040
Simulating trial. . .
espilon = 0.2052; alpha = 0.0040
Simulating trial. . .
espilon = 0.2052; alpha = 0.0040
Simulating trial. . .
espilon = 0.2052; alpha = 0.0040
Simulating trial. . .
espilon = 0.2052; alpha = 0.0040
Simulating trial. . .
espilon = 0.2052; alpha = 0.0040
Simulating trial. . .
espilon = 0.2052; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'right')
Agent properly idled at a red light. (rewarded 2.93)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.11)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.63)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.99)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.42)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.36)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.44)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent drove left instead of forward. (rewarded 0.30)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 2.64)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.13)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.77)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.76)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.12)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.21)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent drove right instead of forward. (rewarded 1.71)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.66)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.27)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'right')
Agent drove right instead of left. (rewarded 0.26)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.04)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'right', None)
Agent attempted driving left through a red light. (rewarded -10.03)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'right')
Agent properly idled at a red light. (rewarded 0.74)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.67)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 0.65)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 1.57)
20% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 398
\-------------------------
Simulating trial. . .
espilon = 0.2043; alpha = 0.0040
Simulating trial. . .
espilon = 0.2043; alpha = 0.0040
Simulating trial. . .
espilon = 0.2043; alpha = 0.0040
Simulating trial. . .
espilon = 0.2043; alpha = 0.0040
Simulating trial. . .
espilon = 0.2043; alpha = 0.0040
Simulating trial. . .
espilon = 0.2043; alpha = 0.0040
Simulating trial. . .
espilon = 0.2043; alpha = 0.0040
Simulating trial. . .
espilon = 0.2043; alpha = 0.0040
Simulating trial. . .
espilon = 0.2043; alpha = 0.0040
Simulating trial. . .
espilon = 0.2043; alpha = 0.0040
Simulating trial. . .
espilon = 0.2043; alpha = 0.0040
Simulating trial. . .
espilon = 0.2043; alpha = 0.0040
Simulating trial. . .
espilon = 0.2043; alpha = 0.0040
Simulating trial. . .
espilon = 0.2043; alpha = 0.0040
Simulating trial. . .
espilon = 0.2043; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.39)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 1.88)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.65)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.26)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent followed the waypoint forward. (rewarded 1.73)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent followed the waypoint right. (rewarded 2.24)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.41)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 1.48)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.58)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.03)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.60)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'right', 'right')
Agent drove forward instead of left. (rewarded 1.40)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 1.48)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.42)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'right', None)
Agent attempted driving forward through a red light. (rewarded -9.58)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.94)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.56)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.56)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 399
\-------------------------
Simulating trial. . .
espilon = 0.2035; alpha = 0.0040
Simulating trial. . .
espilon = 0.2035; alpha = 0.0040
Simulating trial. . .
espilon = 0.2035; alpha = 0.0040
Simulating trial. . .
espilon = 0.2035; alpha = 0.0040
Simulating trial. . .
espilon = 0.2035; alpha = 0.0040
Simulating trial. . .
espilon = 0.2035; alpha = 0.0040
Simulating trial. . .
espilon = 0.2035; alpha = 0.0040
Simulating trial. . .
espilon = 0.2035; alpha = 0.0040
Simulating trial. . .
espilon = 0.2035; alpha = 0.0040
Simulating trial. . .
espilon = 0.2035; alpha = 0.0040
Simulating trial. . .
espilon = 0.2035; alpha = 0.0040
Simulating trial. . .
espilon = 0.2035; alpha = 0.0040
Simulating trial. . .
espilon = 0.2035; alpha = 0.0040
Simulating trial. . .
espilon = 0.2035; alpha = 0.0040
Simulating trial. . .
espilon = 0.2035; alpha = 0.0040
Simulating trial. . .
espilon = 0.2035; alpha = 0.0040
Simulating trial. . .
espilon = 0.2035; alpha = 0.0040
Simulating trial. . .
espilon = 0.2035; alpha = 0.0040
Simulating trial. . .
espilon = 0.2035; alpha = 0.0040
Simulating trial. . .
espilon = 0.2035; alpha = 0.0040
Simulating trial. . .
espilon = 0.2035; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 1.73)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'right')
Agent followed the waypoint right. (rewarded 2.58)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.48)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.43)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.22)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.03)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'forward')
Agent followed the waypoint forward. (rewarded 1.24)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.60)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 0.99)
55% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 400
\-------------------------
Simulating trial. . .
espilon = 0.2027; alpha = 0.0040
Simulating trial. . .
espilon = 0.2027; alpha = 0.0040
Simulating trial. . .
espilon = 0.2027; alpha = 0.0040
Simulating trial. . .
espilon = 0.2027; alpha = 0.0040
Simulating trial. . .
espilon = 0.2027; alpha = 0.0040
Simulating trial. . .
espilon = 0.2027; alpha = 0.0040
Simulating trial. . .
espilon = 0.2027; alpha = 0.0040
Simulating trial. . .
espilon = 0.2027; alpha = 0.0040
Simulating trial. . .
espilon = 0.2027; alpha = 0.0040
Simulating trial. . .
espilon = 0.2027; alpha = 0.0040
Simulating trial. . .
espilon = 0.2027; alpha = 0.0040
Simulating trial. . .
espilon = 0.2027; alpha = 0.0040
Simulating trial. . .
espilon = 0.2027; alpha = 0.0040
Simulating trial. . .
espilon = 0.2027; alpha = 0.0040
Simulating trial. . .
espilon = 0.2027; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.75)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.53)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.85)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.65)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.51)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 0.68)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 0.84)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.38)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.99)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.01)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 1.81)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 2.04)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent followed the waypoint left. (rewarded 0.84)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.74)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.88)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', 'forward')
Agent drove forward instead of right. (rewarded -0.14)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'right')
Agent followed the waypoint right. (rewarded 1.52)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.37)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', 'left')
Agent followed the waypoint right. (rewarded 2.22)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.51)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.47)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.80)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 0.10)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'right', 'forward')
Agent properly idled at a red light. (rewarded 1.46)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'forward')
Agent followed the waypoint right. (rewarded 0.64)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 401
\-------------------------
Simulating trial. . .
espilon = 0.2019; alpha = 0.0040
Simulating trial. . .
espilon = 0.2019; alpha = 0.0040
Simulating trial. . .
espilon = 0.2019; alpha = 0.0040
Simulating trial. . .
espilon = 0.2019; alpha = 0.0040
Simulating trial. . .
espilon = 0.2019; alpha = 0.0040
Simulating trial. . .
espilon = 0.2019; alpha = 0.0040
Simulating trial. . .
espilon = 0.2019; alpha = 0.0040
Simulating trial. . .
espilon = 0.2019; alpha = 0.0040
Simulating trial. . .
espilon = 0.2019; alpha = 0.0040
Simulating trial. . .
espilon = 0.2019; alpha = 0.0040
Simulating trial. . .
espilon = 0.2019; alpha = 0.0040
Simulating trial. . .
espilon = 0.2019; alpha = 0.0040
Simulating trial. . .
espilon = 0.2019; alpha = 0.0040
Simulating trial. . .
espilon = 0.2019; alpha = 0.0040
Simulating trial. . .
espilon = 0.2019; alpha = 0.0040
Simulating trial. . .
espilon = 0.2019; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.32)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.83)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.38)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'forward')
Agent properly idled at a red light. (rewarded 1.67)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.21)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.44)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.64)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'right')
Agent properly idled at a red light. (rewarded 1.54)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.74)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.44)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.15)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.15)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.37)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 1.05)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.40)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.41)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'left')
Agent followed the waypoint left. (rewarded 1.59)
15% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 402
\-------------------------
Simulating trial. . .
espilon = 0.2011; alpha = 0.0040
Simulating trial. . .
espilon = 0.2011; alpha = 0.0040
Simulating trial. . .
espilon = 0.2011; alpha = 0.0040
Simulating trial. . .
espilon = 0.2011; alpha = 0.0040
Simulating trial. . .
espilon = 0.2011; alpha = 0.0040
Simulating trial. . .
espilon = 0.2011; alpha = 0.0040
Simulating trial. . .
espilon = 0.2011; alpha = 0.0040
Simulating trial. . .
espilon = 0.2011; alpha = 0.0040
Simulating trial. . .
espilon = 0.2011; alpha = 0.0040
Simulating trial. . .
espilon = 0.2011; alpha = 0.0040
Simulating trial. . .
espilon = 0.2011; alpha = 0.0040
Simulating trial. . .
espilon = 0.2011; alpha = 0.0040
Simulating trial. . .
espilon = 0.2011; alpha = 0.0040
Simulating trial. . .
espilon = 0.2011; alpha = 0.0040
Simulating trial. . .
espilon = 0.2011; alpha = 0.0040
Simulating trial. . .
espilon = 0.2011; alpha = 0.0040
Simulating trial. . .
espilon = 0.2011; alpha = 0.0040
Simulating trial. . .
espilon = 0.2011; alpha = 0.0040
Simulating trial. . .
espilon = 0.2011; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.59)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.71)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.50)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.80)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.86)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.94)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.82)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.20)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.61)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'right')
Agent drove right instead of left. (rewarded 0.34)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.36)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.47)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.79)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded -0.14)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.01)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'forward')
Agent drove forward instead of right. (rewarded -0.48)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', 'left')
Agent drove forward instead of right. (rewarded 0.65)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 0.50)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'forward', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.01)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'right', 'forward')
Agent followed the waypoint forward. (rewarded 1.25)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 403
\-------------------------
Simulating trial. . .
espilon = 0.2003; alpha = 0.0040
Simulating trial. . .
espilon = 0.2003; alpha = 0.0040
Simulating trial. . .
espilon = 0.2003; alpha = 0.0040
Simulating trial. . .
espilon = 0.2003; alpha = 0.0040
Simulating trial. . .
espilon = 0.2003; alpha = 0.0040
Simulating trial. . .
espilon = 0.2003; alpha = 0.0040
Simulating trial. . .
espilon = 0.2003; alpha = 0.0040
Simulating trial. . .
espilon = 0.2003; alpha = 0.0040
Simulating trial. . .
espilon = 0.2003; alpha = 0.0040
Simulating trial. . .
espilon = 0.2003; alpha = 0.0040
Simulating trial. . .
espilon = 0.2003; alpha = 0.0040
Simulating trial. . .
espilon = 0.2003; alpha = 0.0040
Simulating trial. . .
espilon = 0.2003; alpha = 0.0040
Simulating trial. . .
espilon = 0.2003; alpha = 0.0040
Simulating trial. . .
espilon = 0.2003; alpha = 0.0040
Simulating trial. . .
espilon = 0.2003; alpha = 0.0040
Simulating trial. . .
espilon = 0.2003; alpha = 0.0040
Simulating trial. . .
espilon = 0.2003; alpha = 0.0040
Simulating trial. . .
espilon = 0.2003; alpha = 0.0040
Simulating trial. . .
espilon = 0.2003; alpha = 0.0040
Simulating trial. . .
espilon = 0.2003; alpha = 0.0040
Simulating trial. . .
espilon = 0.2003; alpha = 0.0040
Simulating trial. . .
espilon = 0.2003; alpha = 0.0040
Simulating trial. . .
espilon = 0.2003; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', 'forward')
Agent properly idled at a red light. (rewarded 2.60)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 1.36)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.59)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.02)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.71)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'left')
Agent drove right instead of left. (rewarded 1.62)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded 1.10)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.73)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.60)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 1.24)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 0.86)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.25)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded -0.18)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 0.73)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.04)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.92)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'right', 'forward')
Agent drove forward instead of left. (rewarded -0.04)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.30)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent followed the waypoint left. (rewarded 2.16)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 0.19)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 404
\-------------------------
Simulating trial. . .
espilon = 0.1995; alpha = 0.0040
Simulating trial. . .
espilon = 0.1995; alpha = 0.0040
Simulating trial. . .
espilon = 0.1995; alpha = 0.0040
Simulating trial. . .
espilon = 0.1995; alpha = 0.0040
Simulating trial. . .
espilon = 0.1995; alpha = 0.0040
Simulating trial. . .
espilon = 0.1995; alpha = 0.0040
Simulating trial. . .
espilon = 0.1995; alpha = 0.0040
Simulating trial. . .
espilon = 0.1995; alpha = 0.0040
Simulating trial. . .
espilon = 0.1995; alpha = 0.0040
Simulating trial. . .
espilon = 0.1995; alpha = 0.0040
Simulating trial. . .
espilon = 0.1995; alpha = 0.0040
Simulating trial. . .
espilon = 0.1995; alpha = 0.0040
Simulating trial. . .
espilon = 0.1995; alpha = 0.0040
Simulating trial. . .
espilon = 0.1995; alpha = 0.0040
Simulating trial. . .
espilon = 0.1995; alpha = 0.0040
Simulating trial. . .
espilon = 0.1995; alpha = 0.0040
Simulating trial. . .
espilon = 0.1995; alpha = 0.0040
Simulating trial. . .
espilon = 0.1995; alpha = 0.0040
Simulating trial. . .
espilon = 0.1995; alpha = 0.0040
Simulating trial. . .
espilon = 0.1995; alpha = 0.0040
Simulating trial. . .
espilon = 0.1995; alpha = 0.0040
Simulating trial. . .
espilon = 0.1995; alpha = 0.0040
Simulating trial. . .
espilon = 0.1995; alpha = 0.0040
Simulating trial. . .
espilon = 0.1995; alpha = 0.0040
Simulating trial. . .
espilon = 0.1995; alpha = 0.0040
Simulating trial. . .
espilon = 0.1995; alpha = 0.0040
Simulating trial. . .
espilon = 0.1995; alpha = 0.0040
Simulating trial. . .
espilon = 0.1995; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent drove forward instead of left. (rewarded 1.25)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 0.70)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', 'left')
Agent followed the waypoint right. (rewarded 1.45)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.98)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 1.74)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.85)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.85)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.58)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'forward')
Agent properly idled at a red light. (rewarded 2.23)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', 'forward')
Agent drove forward instead of left. (rewarded 0.54)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.71)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.54)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.69)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent drove forward instead of left. (rewarded -0.18)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.49)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.13)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.38)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.45)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 1.62)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.44)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.80)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 0.99)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.96)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 0.72)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent drove right instead of left. (rewarded 0.93)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent drove left instead of right. (rewarded 0.67)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.60)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.18)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent attempted driving forward through a red light. (rewarded -9.45)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.22)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 405
\-------------------------
Simulating trial. . .
espilon = 0.1987; alpha = 0.0040
Simulating trial. . .
espilon = 0.1987; alpha = 0.0040
Simulating trial. . .
espilon = 0.1987; alpha = 0.0040
Simulating trial. . .
espilon = 0.1987; alpha = 0.0040
Simulating trial. . .
espilon = 0.1987; alpha = 0.0040
Simulating trial. . .
espilon = 0.1987; alpha = 0.0040
Simulating trial. . .
espilon = 0.1987; alpha = 0.0040
Simulating trial. . .
espilon = 0.1987; alpha = 0.0040
Simulating trial. . .
espilon = 0.1987; alpha = 0.0040
Simulating trial. . .
espilon = 0.1987; alpha = 0.0040
Simulating trial. . .
espilon = 0.1987; alpha = 0.0040
Simulating trial. . .
espilon = 0.1987; alpha = 0.0040
Simulating trial. . .
espilon = 0.1987; alpha = 0.0040
Simulating trial. . .
espilon = 0.1987; alpha = 0.0040
Simulating trial. . .
espilon = 0.1987; alpha = 0.0040
Simulating trial. . .
espilon = 0.1987; alpha = 0.0040
Simulating trial. . .
espilon = 0.1987; alpha = 0.0040
Simulating trial. . .
espilon = 0.1987; alpha = 0.0040
Simulating trial. . .
espilon = 0.1987; alpha = 0.0040
Simulating trial. . .
espilon = 0.1987; alpha = 0.0040
Simulating trial. . .
espilon = 0.1987; alpha = 0.0040
Simulating trial. . .
espilon = 0.1987; alpha = 0.0040
Simulating trial. . .
espilon = 0.1987; alpha = 0.0040
Simulating trial. . .
espilon = 0.1987; alpha = 0.0040
Simulating trial. . .
espilon = 0.1987; alpha = 0.0040
Simulating trial. . .
espilon = 0.1987; alpha = 0.0040
Simulating trial. . .
espilon = 0.1987; alpha = 0.0040
Simulating trial. . .
espilon = 0.1987; alpha = 0.0040
Simulating trial. . .
espilon = 0.1987; alpha = 0.0040
Simulating trial. . .
espilon = 0.1987; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'right', 'forward')
Agent followed the waypoint right. (rewarded 1.35)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.44)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.72)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.02)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent drove right instead of forward. (rewarded 0.50)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 2.48)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.98)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', None)
Agent attempted driving left through a red light. (rewarded -9.10)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.29)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.45)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.57)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.72)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.66)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.01)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent followed the waypoint right. (rewarded 2.18)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 406
\-------------------------
Simulating trial. . .
espilon = 0.1979; alpha = 0.0040
Simulating trial. . .
espilon = 0.1979; alpha = 0.0040
Simulating trial. . .
espilon = 0.1979; alpha = 0.0040
Simulating trial. . .
espilon = 0.1979; alpha = 0.0040
Simulating trial. . .
espilon = 0.1979; alpha = 0.0040
Simulating trial. . .
espilon = 0.1979; alpha = 0.0040
Simulating trial. . .
espilon = 0.1979; alpha = 0.0040
Simulating trial. . .
espilon = 0.1979; alpha = 0.0040
Simulating trial. . .
espilon = 0.1979; alpha = 0.0040
Simulating trial. . .
espilon = 0.1979; alpha = 0.0040
Simulating trial. . .
espilon = 0.1979; alpha = 0.0040
Simulating trial. . .
espilon = 0.1979; alpha = 0.0040
Simulating trial. . .
espilon = 0.1979; alpha = 0.0040
Simulating trial. . .
espilon = 0.1979; alpha = 0.0040
Simulating trial. . .
espilon = 0.1979; alpha = 0.0040
Simulating trial. . .
espilon = 0.1979; alpha = 0.0040
Simulating trial. . .
espilon = 0.1979; alpha = 0.0040
Simulating trial. . .
espilon = 0.1979; alpha = 0.0040
Simulating trial. . .
espilon = 0.1979; alpha = 0.0040
Simulating trial. . .
espilon = 0.1979; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 2.32)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'forward', None)
Agent properly idled at a red light. (rewarded 2.05)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.93)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'right', None)
Agent properly idled at a red light. (rewarded 1.80)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.07)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 2.59)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.35)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.67)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.39)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.66)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.40)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.53)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 0.98)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 0.79)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.96)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 407
\-------------------------
Simulating trial. . .
espilon = 0.1971; alpha = 0.0040
Simulating trial. . .
espilon = 0.1971; alpha = 0.0040
Simulating trial. . .
espilon = 0.1971; alpha = 0.0040
Simulating trial. . .
espilon = 0.1971; alpha = 0.0040
Simulating trial. . .
espilon = 0.1971; alpha = 0.0040
Simulating trial. . .
espilon = 0.1971; alpha = 0.0040
Simulating trial. . .
espilon = 0.1971; alpha = 0.0040
Simulating trial. . .
espilon = 0.1971; alpha = 0.0040
Simulating trial. . .
espilon = 0.1971; alpha = 0.0040
Simulating trial. . .
espilon = 0.1971; alpha = 0.0040
Simulating trial. . .
espilon = 0.1971; alpha = 0.0040
Simulating trial. . .
espilon = 0.1971; alpha = 0.0040
Simulating trial. . .
espilon = 0.1971; alpha = 0.0040
Simulating trial. . .
espilon = 0.1971; alpha = 0.0040
Simulating trial. . .
espilon = 0.1971; alpha = 0.0040
Simulating trial. . .
espilon = 0.1971; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.85)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.79)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.19)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.58)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 1.62)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove right instead of forward. (rewarded 1.02)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.20)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'right')
Agent properly idled at a red light. (rewarded 1.08)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'right')
Agent properly idled at a red light. (rewarded 1.18)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'right', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.62)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.65)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.94)
52% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 408
\-------------------------
Simulating trial. . .
espilon = 0.1963; alpha = 0.0040
Simulating trial. . .
espilon = 0.1963; alpha = 0.0040
Simulating trial. . .
espilon = 0.1963; alpha = 0.0040
Simulating trial. . .
espilon = 0.1963; alpha = 0.0040
Simulating trial. . .
espilon = 0.1963; alpha = 0.0040
Simulating trial. . .
espilon = 0.1963; alpha = 0.0040
Simulating trial. . .
espilon = 0.1963; alpha = 0.0040
Simulating trial. . .
espilon = 0.1963; alpha = 0.0040
Simulating trial. . .
espilon = 0.1963; alpha = 0.0040
Simulating trial. . .
espilon = 0.1963; alpha = 0.0040
Simulating trial. . .
espilon = 0.1963; alpha = 0.0040
Simulating trial. . .
espilon = 0.1963; alpha = 0.0040
Simulating trial. . .
espilon = 0.1963; alpha = 0.0040
Simulating trial. . .
espilon = 0.1963; alpha = 0.0040
Simulating trial. . .
espilon = 0.1963; alpha = 0.0040
Simulating trial. . .
espilon = 0.1963; alpha = 0.0040
Simulating trial. . .
espilon = 0.1963; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'right')
Agent followed the waypoint forward. (rewarded 2.92)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 2.62)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent drove right instead of forward. (rewarded 1.42)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.96)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.18)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 2.28)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 0.42)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.44)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.54)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'forward')
Agent followed the waypoint right. (rewarded 2.59)
67% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 409
\-------------------------
Simulating trial. . .
espilon = 0.1955; alpha = 0.0040
Simulating trial. . .
espilon = 0.1955; alpha = 0.0040
Simulating trial. . .
espilon = 0.1955; alpha = 0.0040
Simulating trial. . .
espilon = 0.1955; alpha = 0.0040
Simulating trial. . .
espilon = 0.1955; alpha = 0.0040
Simulating trial. . .
espilon = 0.1955; alpha = 0.0040
Simulating trial. . .
espilon = 0.1955; alpha = 0.0040
Simulating trial. . .
espilon = 0.1955; alpha = 0.0040
Simulating trial. . .
espilon = 0.1955; alpha = 0.0040
Simulating trial. . .
espilon = 0.1955; alpha = 0.0040
Simulating trial. . .
espilon = 0.1955; alpha = 0.0040
Simulating trial. . .
espilon = 0.1955; alpha = 0.0040
Simulating trial. . .
espilon = 0.1955; alpha = 0.0040
Simulating trial. . .
espilon = 0.1955; alpha = 0.0040
Simulating trial. . .
espilon = 0.1955; alpha = 0.0040
Simulating trial. . .
espilon = 0.1955; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.59)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.57)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.35)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.90)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.84)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 2.33)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.58)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.35)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', 'forward')
Agent drove right instead of forward. (rewarded 1.77)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove right instead of left. (rewarded 1.31)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'right', None)
Agent drove left instead of right. (rewarded 0.98)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.03)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -10.46)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.92)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 0.65)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.16)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.03)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent followed the waypoint left. (rewarded 2.14)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.26)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.51)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.47)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 0.98)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.64)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 2.00)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.92)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 410
\-------------------------
Simulating trial. . .
espilon = 0.1948; alpha = 0.0040
Simulating trial. . .
espilon = 0.1948; alpha = 0.0040
Simulating trial. . .
espilon = 0.1948; alpha = 0.0040
Simulating trial. . .
espilon = 0.1948; alpha = 0.0040
Simulating trial. . .
espilon = 0.1948; alpha = 0.0040
Simulating trial. . .
espilon = 0.1948; alpha = 0.0040
Simulating trial. . .
espilon = 0.1948; alpha = 0.0040
Simulating trial. . .
espilon = 0.1948; alpha = 0.0040
Simulating trial. . .
espilon = 0.1948; alpha = 0.0040
Simulating trial. . .
espilon = 0.1948; alpha = 0.0040
Simulating trial. . .
espilon = 0.1948; alpha = 0.0040
Simulating trial. . .
espilon = 0.1948; alpha = 0.0040
Simulating trial. . .
espilon = 0.1948; alpha = 0.0040
Simulating trial. . .
espilon = 0.1948; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent attempted driving forward through a red light. (rewarded -9.80)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.79)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.03)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.97)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', 'forward')
Agent drove forward instead of left. (rewarded 0.74)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'forward', 'forward')
Agent drove right instead of left. (rewarded -0.04)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent drove right instead of forward. (rewarded 0.94)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', 'forward')
Agent drove forward instead of left. (rewarded 0.22)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent followed the waypoint left. (rewarded 1.78)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.17)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.60)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.53)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.11)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 0.80)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.40)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.23)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.20)
15% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 411
\-------------------------
Simulating trial. . .
espilon = 0.1940; alpha = 0.0040
Simulating trial. . .
espilon = 0.1940; alpha = 0.0040
Simulating trial. . .
espilon = 0.1940; alpha = 0.0040
Simulating trial. . .
espilon = 0.1940; alpha = 0.0040
Simulating trial. . .
espilon = 0.1940; alpha = 0.0040
Simulating trial. . .
espilon = 0.1940; alpha = 0.0040
Simulating trial. . .
espilon = 0.1940; alpha = 0.0040
Simulating trial. . .
espilon = 0.1940; alpha = 0.0040
Simulating trial. . .
espilon = 0.1940; alpha = 0.0040
Simulating trial. . .
espilon = 0.1940; alpha = 0.0040
Simulating trial. . .
espilon = 0.1940; alpha = 0.0040
Simulating trial. . .
espilon = 0.1940; alpha = 0.0040
Simulating trial. . .
espilon = 0.1940; alpha = 0.0040
Simulating trial. . .
espilon = 0.1940; alpha = 0.0040
Simulating trial. . .
espilon = 0.1940; alpha = 0.0040
Simulating trial. . .
espilon = 0.1940; alpha = 0.0040
Simulating trial. . .
espilon = 0.1940; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', 'forward')
Agent drove forward instead of left. (rewarded 0.93)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.36)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 1.95)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.10)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'right')
Agent drove right instead of left. (rewarded 0.79)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.99)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.34)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.00)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.41)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.14)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.22)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.05)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 412
\-------------------------
Simulating trial. . .
espilon = 0.1932; alpha = 0.0040
Simulating trial. . .
espilon = 0.1932; alpha = 0.0040
Simulating trial. . .
espilon = 0.1932; alpha = 0.0040
Simulating trial. . .
espilon = 0.1932; alpha = 0.0040
Simulating trial. . .
espilon = 0.1932; alpha = 0.0040
Simulating trial. . .
espilon = 0.1932; alpha = 0.0040
Simulating trial. . .
espilon = 0.1932; alpha = 0.0040
Simulating trial. . .
espilon = 0.1932; alpha = 0.0040
Simulating trial. . .
espilon = 0.1932; alpha = 0.0040
Simulating trial. . .
espilon = 0.1932; alpha = 0.0040
Simulating trial. . .
espilon = 0.1932; alpha = 0.0040
Simulating trial. . .
espilon = 0.1932; alpha = 0.0040
Simulating trial. . .
espilon = 0.1932; alpha = 0.0040
Simulating trial. . .
espilon = 0.1932; alpha = 0.0040
Simulating trial. . .
espilon = 0.1932; alpha = 0.0040
Simulating trial. . .
espilon = 0.1932; alpha = 0.0040
Simulating trial. . .
espilon = 0.1932; alpha = 0.0040
Simulating trial. . .
espilon = 0.1932; alpha = 0.0040
Simulating trial. . .
espilon = 0.1932; alpha = 0.0040
Simulating trial. . .
espilon = 0.1932; alpha = 0.0040
Simulating trial. . .
espilon = 0.1932; alpha = 0.0040
Simulating trial. . .
espilon = 0.1932; alpha = 0.0040
Simulating trial. . .
espilon = 0.1932; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', 'right')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.00)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.66)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.59)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.34)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.99)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.56)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.41)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.40)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.58)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.80)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.85)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.07)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.69)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.37)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 1.25)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.35)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.42)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'right', 'left')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.12)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.35)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.00)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.00)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 0.08)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.50)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 0.88)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.59)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.04)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.10)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.15)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.19)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.52)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 413
\-------------------------
Simulating trial. . .
espilon = 0.1924; alpha = 0.0040
Simulating trial. . .
espilon = 0.1924; alpha = 0.0040
Simulating trial. . .
espilon = 0.1924; alpha = 0.0040
Simulating trial. . .
espilon = 0.1924; alpha = 0.0040
Simulating trial. . .
espilon = 0.1924; alpha = 0.0040
Simulating trial. . .
espilon = 0.1924; alpha = 0.0040
Simulating trial. . .
espilon = 0.1924; alpha = 0.0040
Simulating trial. . .
espilon = 0.1924; alpha = 0.0040
Simulating trial. . .
espilon = 0.1924; alpha = 0.0040
Simulating trial. . .
espilon = 0.1924; alpha = 0.0040
Simulating trial. . .
espilon = 0.1924; alpha = 0.0040
Simulating trial. . .
espilon = 0.1924; alpha = 0.0040
Simulating trial. . .
espilon = 0.1924; alpha = 0.0040
Simulating trial. . .
espilon = 0.1924; alpha = 0.0040
Simulating trial. . .
espilon = 0.1924; alpha = 0.0040
Simulating trial. . .
espilon = 0.1924; alpha = 0.0040
Simulating trial. . .
espilon = 0.1924; alpha = 0.0040
Simulating trial. . .
espilon = 0.1924; alpha = 0.0040
Simulating trial. . .
espilon = 0.1924; alpha = 0.0040
Simulating trial. . .
espilon = 0.1924; alpha = 0.0040
Simulating trial. . .
espilon = 0.1924; alpha = 0.0040
Simulating trial. . .
espilon = 0.1924; alpha = 0.0040
Simulating trial. . .
espilon = 0.1924; alpha = 0.0040
Simulating trial. . .
espilon = 0.1924; alpha = 0.0040
Simulating trial. . .
espilon = 0.1924; alpha = 0.0040
Simulating trial. . .
espilon = 0.1924; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 2.56)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.08)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.99)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.24)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.90)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.29)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.78)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'forward')
Agent drove right instead of left. (rewarded 1.25)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.57)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent drove forward instead of right. (rewarded 0.66)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 1.48)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 2.23)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', 'forward')
Agent followed the waypoint forward. (rewarded 2.59)
35% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 414
\-------------------------
Simulating trial. . .
espilon = 0.1917; alpha = 0.0040
Simulating trial. . .
espilon = 0.1917; alpha = 0.0040
Simulating trial. . .
espilon = 0.1917; alpha = 0.0040
Simulating trial. . .
espilon = 0.1917; alpha = 0.0040
Simulating trial. . .
espilon = 0.1917; alpha = 0.0040
Simulating trial. . .
espilon = 0.1917; alpha = 0.0040
Simulating trial. . .
espilon = 0.1917; alpha = 0.0040
Simulating trial. . .
espilon = 0.1917; alpha = 0.0040
Simulating trial. . .
espilon = 0.1917; alpha = 0.0040
Simulating trial. . .
espilon = 0.1917; alpha = 0.0040
Simulating trial. . .
espilon = 0.1917; alpha = 0.0040
Simulating trial. . .
espilon = 0.1917; alpha = 0.0040
Simulating trial. . .
espilon = 0.1917; alpha = 0.0040
Simulating trial. . .
espilon = 0.1917; alpha = 0.0040
Simulating trial. . .
espilon = 0.1917; alpha = 0.0040
Simulating trial. . .
espilon = 0.1917; alpha = 0.0040
Simulating trial. . .
espilon = 0.1917; alpha = 0.0040
Simulating trial. . .
espilon = 0.1917; alpha = 0.0040
Simulating trial. . .
espilon = 0.1917; alpha = 0.0040
Simulating trial. . .
espilon = 0.1917; alpha = 0.0040
Simulating trial. . .
espilon = 0.1917; alpha = 0.0040
Simulating trial. . .
espilon = 0.1917; alpha = 0.0040
Simulating trial. . .
espilon = 0.1917; alpha = 0.0040
Simulating trial. . .
espilon = 0.1917; alpha = 0.0040
Simulating trial. . .
espilon = 0.1917; alpha = 0.0040
Simulating trial. . .
espilon = 0.1917; alpha = 0.0040
Simulating trial. . .
espilon = 0.1917; alpha = 0.0040
Simulating trial. . .
espilon = 0.1917; alpha = 0.0040
Simulating trial. . .
espilon = 0.1917; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.73)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.30)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.01)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 2.92)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.27)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 0.39)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 1.16)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.07)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.72)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.15)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.84)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.34)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent drove right instead of left. (rewarded -0.12)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.52)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent drove right instead of left. (rewarded 1.19)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 1.81)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.26)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.69)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.58)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.26)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.15)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.70)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 0.53)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.68)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.10)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.26)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'right', 'right')
Agent drove forward instead of left. (rewarded -0.24)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.64)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.71)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 0.37)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 415
\-------------------------
Simulating trial. . .
espilon = 0.1909; alpha = 0.0040
Simulating trial. . .
espilon = 0.1909; alpha = 0.0040
Simulating trial. . .
espilon = 0.1909; alpha = 0.0040
Simulating trial. . .
espilon = 0.1909; alpha = 0.0040
Simulating trial. . .
espilon = 0.1909; alpha = 0.0040
Simulating trial. . .
espilon = 0.1909; alpha = 0.0040
Simulating trial. . .
espilon = 0.1909; alpha = 0.0040
Simulating trial. . .
espilon = 0.1909; alpha = 0.0040
Simulating trial. . .
espilon = 0.1909; alpha = 0.0040
Simulating trial. . .
espilon = 0.1909; alpha = 0.0040
Simulating trial. . .
espilon = 0.1909; alpha = 0.0040
Simulating trial. . .
espilon = 0.1909; alpha = 0.0040
Simulating trial. . .
espilon = 0.1909; alpha = 0.0040
Simulating trial. . .
espilon = 0.1909; alpha = 0.0040
Simulating trial. . .
espilon = 0.1909; alpha = 0.0040
Simulating trial. . .
espilon = 0.1909; alpha = 0.0040
Simulating trial. . .
espilon = 0.1909; alpha = 0.0040
Simulating trial. . .
espilon = 0.1909; alpha = 0.0040
Simulating trial. . .
espilon = 0.1909; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.28)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 1.25)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 2.71)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.53)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.05)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'forward')
Agent followed the waypoint forward. (rewarded 2.91)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 0.33)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 1.54)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.34)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.18)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 0.80)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent drove right instead of forward. (rewarded 0.07)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.02)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.88)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 0.60)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.36)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'forward', 'right')
Agent drove forward instead of left. (rewarded 0.16)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.49)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent drove right instead of left. (rewarded -0.03)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.11)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 416
\-------------------------
Simulating trial. . .
espilon = 0.1901; alpha = 0.0040
Simulating trial. . .
espilon = 0.1901; alpha = 0.0040
Simulating trial. . .
espilon = 0.1901; alpha = 0.0040
Simulating trial. . .
espilon = 0.1901; alpha = 0.0040
Simulating trial. . .
espilon = 0.1901; alpha = 0.0040
Simulating trial. . .
espilon = 0.1901; alpha = 0.0040
Simulating trial. . .
espilon = 0.1901; alpha = 0.0040
Simulating trial. . .
espilon = 0.1901; alpha = 0.0040
Simulating trial. . .
espilon = 0.1901; alpha = 0.0040
Simulating trial. . .
espilon = 0.1901; alpha = 0.0040
Simulating trial. . .
espilon = 0.1901; alpha = 0.0040
Simulating trial. . .
espilon = 0.1901; alpha = 0.0040
Simulating trial. . .
espilon = 0.1901; alpha = 0.0040
Simulating trial. . .
espilon = 0.1901; alpha = 0.0040
Simulating trial. . .
espilon = 0.1901; alpha = 0.0040
Simulating trial. . .
espilon = 0.1901; alpha = 0.0040
Simulating trial. . .
espilon = 0.1901; alpha = 0.0040
Simulating trial. . .
espilon = 0.1901; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 1.89)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.91)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.70)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.42)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.34)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.78)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 1.90)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.31)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 1.56)
64% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 417
\-------------------------
Simulating trial. . .
espilon = 0.1894; alpha = 0.0040
Simulating trial. . .
espilon = 0.1894; alpha = 0.0040
Simulating trial. . .
espilon = 0.1894; alpha = 0.0040
Simulating trial. . .
espilon = 0.1894; alpha = 0.0040
Simulating trial. . .
espilon = 0.1894; alpha = 0.0040
Simulating trial. . .
espilon = 0.1894; alpha = 0.0040
Simulating trial. . .
espilon = 0.1894; alpha = 0.0040
Simulating trial. . .
espilon = 0.1894; alpha = 0.0040
Simulating trial. . .
espilon = 0.1894; alpha = 0.0040
Simulating trial. . .
espilon = 0.1894; alpha = 0.0040
Simulating trial. . .
espilon = 0.1894; alpha = 0.0040
Simulating trial. . .
espilon = 0.1894; alpha = 0.0040
Simulating trial. . .
espilon = 0.1894; alpha = 0.0040
Simulating trial. . .
espilon = 0.1894; alpha = 0.0040
Simulating trial. . .
espilon = 0.1894; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.62)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 2.18)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.01)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.90)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.38)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.49)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.52)
65% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 418
\-------------------------
Simulating trial. . .
espilon = 0.1886; alpha = 0.0040
Simulating trial. . .
espilon = 0.1886; alpha = 0.0040
Simulating trial. . .
espilon = 0.1886; alpha = 0.0040
Simulating trial. . .
espilon = 0.1886; alpha = 0.0040
Simulating trial. . .
espilon = 0.1886; alpha = 0.0040
Simulating trial. . .
espilon = 0.1886; alpha = 0.0040
Simulating trial. . .
espilon = 0.1886; alpha = 0.0040
Simulating trial. . .
espilon = 0.1886; alpha = 0.0040
Simulating trial. . .
espilon = 0.1886; alpha = 0.0040
Simulating trial. . .
espilon = 0.1886; alpha = 0.0040
Simulating trial. . .
espilon = 0.1886; alpha = 0.0040
Simulating trial. . .
espilon = 0.1886; alpha = 0.0040
Simulating trial. . .
espilon = 0.1886; alpha = 0.0040
Simulating trial. . .
espilon = 0.1886; alpha = 0.0040
Simulating trial. . .
espilon = 0.1886; alpha = 0.0040
Simulating trial. . .
espilon = 0.1886; alpha = 0.0040
Simulating trial. . .
espilon = 0.1886; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'right')
Agent properly idled at a red light. (rewarded 2.19)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.03)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.46)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.29)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.99)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 1.59)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.87)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 1.59)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 0.88)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.16)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', 'forward')
Agent attempted driving forward through a red light. (rewarded -9.50)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent drove right instead of forward. (rewarded 0.37)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 0.19)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'forward')
Agent drove forward instead of left. (rewarded 0.64)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', 'left')
Agent properly idled at a red light. (rewarded 0.72)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.85)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 0.21)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 0.50)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.04)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -10.95)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 419
\-------------------------
Simulating trial. . .
espilon = 0.1879; alpha = 0.0040
Simulating trial. . .
espilon = 0.1879; alpha = 0.0040
Simulating trial. . .
espilon = 0.1879; alpha = 0.0040
Simulating trial. . .
espilon = 0.1879; alpha = 0.0040
Simulating trial. . .
espilon = 0.1879; alpha = 0.0040
Simulating trial. . .
espilon = 0.1879; alpha = 0.0040
Simulating trial. . .
espilon = 0.1879; alpha = 0.0040
Simulating trial. . .
espilon = 0.1879; alpha = 0.0040
Simulating trial. . .
espilon = 0.1879; alpha = 0.0040
Simulating trial. . .
espilon = 0.1879; alpha = 0.0040
Simulating trial. . .
espilon = 0.1879; alpha = 0.0040
Simulating trial. . .
espilon = 0.1879; alpha = 0.0040
Simulating trial. . .
espilon = 0.1879; alpha = 0.0040
Simulating trial. . .
espilon = 0.1879; alpha = 0.0040
Simulating trial. . .
espilon = 0.1879; alpha = 0.0040
Simulating trial. . .
espilon = 0.1879; alpha = 0.0040
Simulating trial. . .
espilon = 0.1879; alpha = 0.0040
Simulating trial. . .
espilon = 0.1879; alpha = 0.0040
Simulating trial. . .
espilon = 0.1879; alpha = 0.0040
Simulating trial. . .
espilon = 0.1879; alpha = 0.0040
Simulating trial. . .
espilon = 0.1879; alpha = 0.0040
Simulating trial. . .
espilon = 0.1879; alpha = 0.0040
Simulating trial. . .
espilon = 0.1879; alpha = 0.0040
Simulating trial. . .
espilon = 0.1879; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.33)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent attempted driving forward through a red light. (rewarded -9.36)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.33)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.43)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.35)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.03)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.32)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent drove right instead of forward. (rewarded 1.10)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'forward')
Agent drove forward instead of left. (rewarded 1.58)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 0.18)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 0.98)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.67)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.97)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.93)
44% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 420
\-------------------------
Simulating trial. . .
espilon = 0.1871; alpha = 0.0040
Simulating trial. . .
espilon = 0.1871; alpha = 0.0040
Simulating trial. . .
espilon = 0.1871; alpha = 0.0040
Simulating trial. . .
espilon = 0.1871; alpha = 0.0040
Simulating trial. . .
espilon = 0.1871; alpha = 0.0040
Simulating trial. . .
espilon = 0.1871; alpha = 0.0040
Simulating trial. . .
espilon = 0.1871; alpha = 0.0040
Simulating trial. . .
espilon = 0.1871; alpha = 0.0040
Simulating trial. . .
espilon = 0.1871; alpha = 0.0040
Simulating trial. . .
espilon = 0.1871; alpha = 0.0040
Simulating trial. . .
espilon = 0.1871; alpha = 0.0040
Simulating trial. . .
espilon = 0.1871; alpha = 0.0040
Simulating trial. . .
espilon = 0.1871; alpha = 0.0040
Simulating trial. . .
espilon = 0.1871; alpha = 0.0040
Simulating trial. . .
espilon = 0.1871; alpha = 0.0040
Simulating trial. . .
espilon = 0.1871; alpha = 0.0040
Simulating trial. . .
espilon = 0.1871; alpha = 0.0040
Simulating trial. . .
espilon = 0.1871; alpha = 0.0040
Simulating trial. . .
espilon = 0.1871; alpha = 0.0040
Simulating trial. . .
espilon = 0.1871; alpha = 0.0040
Simulating trial. . .
espilon = 0.1871; alpha = 0.0040
Simulating trial. . .
espilon = 0.1871; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', 'right')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.16)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.35)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 2.26)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 0.90)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 2.15)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.59)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.71)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.31)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.54)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.31)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.12)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'right', 'left')
Agent drove forward instead of left. (rewarded 1.37)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.13)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded 1.58)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'right')
Agent drove forward instead of right. (rewarded 1.40)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.31)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.51)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent drove right instead of left. (rewarded 1.45)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'right', None)
Agent drove forward instead of right. (rewarded 0.64)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 0.66)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.73)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 0.52)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.47)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.76)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.60)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 421
\-------------------------
Simulating trial. . .
espilon = 0.1864; alpha = 0.0040
Simulating trial. . .
espilon = 0.1864; alpha = 0.0040
Simulating trial. . .
espilon = 0.1864; alpha = 0.0040
Simulating trial. . .
espilon = 0.1864; alpha = 0.0040
Simulating trial. . .
espilon = 0.1864; alpha = 0.0040
Simulating trial. . .
espilon = 0.1864; alpha = 0.0040
Simulating trial. . .
espilon = 0.1864; alpha = 0.0040
Simulating trial. . .
espilon = 0.1864; alpha = 0.0040
Simulating trial. . .
espilon = 0.1864; alpha = 0.0040
Simulating trial. . .
espilon = 0.1864; alpha = 0.0040
Simulating trial. . .
espilon = 0.1864; alpha = 0.0040
Simulating trial. . .
espilon = 0.1864; alpha = 0.0040
Simulating trial. . .
espilon = 0.1864; alpha = 0.0040
Simulating trial. . .
espilon = 0.1864; alpha = 0.0040
Simulating trial. . .
espilon = 0.1864; alpha = 0.0040
Simulating trial. . .
espilon = 0.1864; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.77)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'forward', None)
Agent drove left instead of right. (rewarded 0.85)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.75)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.34)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.36)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent drove right instead of forward. (rewarded 0.29)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.35)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.54)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.73)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent attempted driving forward through a red light. (rewarded -10.09)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.35)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.72)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove right instead of left. (rewarded 0.53)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.23)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.76)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.72)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent drove right instead of forward. (rewarded 0.24)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.38)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', 'left')
Agent properly idled at a red light. (rewarded 2.00)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.58)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.29)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.24)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.80)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.15)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.52)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 422
\-------------------------
Simulating trial. . .
espilon = 0.1856; alpha = 0.0040
Simulating trial. . .
espilon = 0.1856; alpha = 0.0040
Simulating trial. . .
espilon = 0.1856; alpha = 0.0040
Simulating trial. . .
espilon = 0.1856; alpha = 0.0040
Simulating trial. . .
espilon = 0.1856; alpha = 0.0040
Simulating trial. . .
espilon = 0.1856; alpha = 0.0040
Simulating trial. . .
espilon = 0.1856; alpha = 0.0040
Simulating trial. . .
espilon = 0.1856; alpha = 0.0040
Simulating trial. . .
espilon = 0.1856; alpha = 0.0040
Simulating trial. . .
espilon = 0.1856; alpha = 0.0040
Simulating trial. . .
espilon = 0.1856; alpha = 0.0040
Simulating trial. . .
espilon = 0.1856; alpha = 0.0040
Simulating trial. . .
espilon = 0.1856; alpha = 0.0040
Simulating trial. . .
espilon = 0.1856; alpha = 0.0040
Simulating trial. . .
espilon = 0.1856; alpha = 0.0040
Simulating trial. . .
espilon = 0.1856; alpha = 0.0040
Simulating trial. . .
espilon = 0.1856; alpha = 0.0040
Simulating trial. . .
espilon = 0.1856; alpha = 0.0040
Simulating trial. . .
espilon = 0.1856; alpha = 0.0040
Simulating trial. . .
espilon = 0.1856; alpha = 0.0040
Simulating trial. . .
espilon = 0.1856; alpha = 0.0040
Simulating trial. . .
espilon = 0.1856; alpha = 0.0040
Simulating trial. . .
espilon = 0.1856; alpha = 0.0040
Simulating trial. . .
espilon = 0.1856; alpha = 0.0040
Simulating trial. . .
espilon = 0.1856; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.46)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.61)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.80)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.33)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.45)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove right instead of left. (rewarded 1.01)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.42)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 1.97)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 2.57)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.44)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.08)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.25)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'left')
Agent drove right instead of left. (rewarded 0.70)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent followed the waypoint right. (rewarded 1.04)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'left', None)
Agent followed the waypoint right. (rewarded 2.07)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.13)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.05)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.65)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.48)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 0.12)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent drove forward instead of left. (rewarded 0.84)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'left')
Agent drove right instead of left. (rewarded 1.30)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.17)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent attempted driving forward through a red light. (rewarded -10.70)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 0.40)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 423
\-------------------------
Simulating trial. . .
espilon = 0.1849; alpha = 0.0040
Simulating trial. . .
espilon = 0.1849; alpha = 0.0040
Simulating trial. . .
espilon = 0.1849; alpha = 0.0040
Simulating trial. . .
espilon = 0.1849; alpha = 0.0040
Simulating trial. . .
espilon = 0.1849; alpha = 0.0040
Simulating trial. . .
espilon = 0.1849; alpha = 0.0040
Simulating trial. . .
espilon = 0.1849; alpha = 0.0040
Simulating trial. . .
espilon = 0.1849; alpha = 0.0040
Simulating trial. . .
espilon = 0.1849; alpha = 0.0040
Simulating trial. . .
espilon = 0.1849; alpha = 0.0040
Simulating trial. . .
espilon = 0.1849; alpha = 0.0040
Simulating trial. . .
espilon = 0.1849; alpha = 0.0040
Simulating trial. . .
espilon = 0.1849; alpha = 0.0040
Simulating trial. . .
espilon = 0.1849; alpha = 0.0040
Simulating trial. . .
espilon = 0.1849; alpha = 0.0040
Simulating trial. . .
espilon = 0.1849; alpha = 0.0040
Simulating trial. . .
espilon = 0.1849; alpha = 0.0040
Simulating trial. . .
espilon = 0.1849; alpha = 0.0040
Simulating trial. . .
espilon = 0.1849; alpha = 0.0040
Simulating trial. . .
espilon = 0.1849; alpha = 0.0040
Simulating trial. . .
espilon = 0.1849; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.42)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent drove left instead of forward. (rewarded 0.35)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.60)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.81)
80% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 424
\-------------------------
Simulating trial. . .
espilon = 0.1842; alpha = 0.0040
Simulating trial. . .
espilon = 0.1842; alpha = 0.0040
Simulating trial. . .
espilon = 0.1842; alpha = 0.0040
Simulating trial. . .
espilon = 0.1842; alpha = 0.0040
Simulating trial. . .
espilon = 0.1842; alpha = 0.0040
Simulating trial. . .
espilon = 0.1842; alpha = 0.0040
Simulating trial. . .
espilon = 0.1842; alpha = 0.0040
Simulating trial. . .
espilon = 0.1842; alpha = 0.0040
Simulating trial. . .
espilon = 0.1842; alpha = 0.0040
Simulating trial. . .
espilon = 0.1842; alpha = 0.0040
Simulating trial. . .
espilon = 0.1842; alpha = 0.0040
Simulating trial. . .
espilon = 0.1842; alpha = 0.0040
Simulating trial. . .
espilon = 0.1842; alpha = 0.0040
Simulating trial. . .
espilon = 0.1842; alpha = 0.0040
Simulating trial. . .
espilon = 0.1842; alpha = 0.0040
Simulating trial. . .
espilon = 0.1842; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.08)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', 'forward')
Agent attempted driving left through a red light. (rewarded -10.53)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.42)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.19)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.71)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.01)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 0.92)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 2.52)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.46)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.49)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'forward')
Agent drove forward instead of right. (rewarded 1.17)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.46)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 2.05)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.58)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.50)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.18)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.45)
15% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 425
\-------------------------
Simulating trial. . .
espilon = 0.1834; alpha = 0.0040
Simulating trial. . .
espilon = 0.1834; alpha = 0.0040
Simulating trial. . .
espilon = 0.1834; alpha = 0.0040
Simulating trial. . .
espilon = 0.1834; alpha = 0.0040
Simulating trial. . .
espilon = 0.1834; alpha = 0.0040
Simulating trial. . .
espilon = 0.1834; alpha = 0.0040
Simulating trial. . .
espilon = 0.1834; alpha = 0.0040
Simulating trial. . .
espilon = 0.1834; alpha = 0.0040
Simulating trial. . .
espilon = 0.1834; alpha = 0.0040
Simulating trial. . .
espilon = 0.1834; alpha = 0.0040
Simulating trial. . .
espilon = 0.1834; alpha = 0.0040
Simulating trial. . .
espilon = 0.1834; alpha = 0.0040
Simulating trial. . .
espilon = 0.1834; alpha = 0.0040
Simulating trial. . .
espilon = 0.1834; alpha = 0.0040
Simulating trial. . .
espilon = 0.1834; alpha = 0.0040
Simulating trial. . .
espilon = 0.1834; alpha = 0.0040
Simulating trial. . .
espilon = 0.1834; alpha = 0.0040
Simulating trial. . .
espilon = 0.1834; alpha = 0.0040
Simulating trial. . .
espilon = 0.1834; alpha = 0.0040
Simulating trial. . .
espilon = 0.1834; alpha = 0.0040
Simulating trial. . .
espilon = 0.1834; alpha = 0.0040
Simulating trial. . .
espilon = 0.1834; alpha = 0.0040
Simulating trial. . .
espilon = 0.1834; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.44)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.68)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.06)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.12)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.52)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.25)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove left instead of forward. (rewarded 1.24)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.02)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 2.29)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.39)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.58)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 0.79)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 426
\-------------------------
Simulating trial. . .
espilon = 0.1827; alpha = 0.0040
Simulating trial. . .
espilon = 0.1827; alpha = 0.0040
Simulating trial. . .
espilon = 0.1827; alpha = 0.0040
Simulating trial. . .
espilon = 0.1827; alpha = 0.0040
Simulating trial. . .
espilon = 0.1827; alpha = 0.0040
Simulating trial. . .
espilon = 0.1827; alpha = 0.0040
Simulating trial. . .
espilon = 0.1827; alpha = 0.0040
Simulating trial. . .
espilon = 0.1827; alpha = 0.0040
Simulating trial. . .
espilon = 0.1827; alpha = 0.0040
Simulating trial. . .
espilon = 0.1827; alpha = 0.0040
Simulating trial. . .
espilon = 0.1827; alpha = 0.0040
Simulating trial. . .
espilon = 0.1827; alpha = 0.0040
Simulating trial. . .
espilon = 0.1827; alpha = 0.0040
Simulating trial. . .
espilon = 0.1827; alpha = 0.0040
Simulating trial. . .
espilon = 0.1827; alpha = 0.0040
Simulating trial. . .
espilon = 0.1827; alpha = 0.0040
Simulating trial. . .
espilon = 0.1827; alpha = 0.0040
Simulating trial. . .
espilon = 0.1827; alpha = 0.0040
Simulating trial. . .
espilon = 0.1827; alpha = 0.0040
Simulating trial. . .
espilon = 0.1827; alpha = 0.0040
Simulating trial. . .
espilon = 0.1827; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent attempted driving left through a red light. (rewarded -10.24)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.84)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.97)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.80)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.68)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.67)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.69)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.52)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.09)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 2.37)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent drove right instead of left. (rewarded 0.26)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.68)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.76)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.90)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'right')
Agent followed the waypoint right. (rewarded 0.99)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'left')
Agent drove forward instead of right. (rewarded 0.18)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 0.94)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 0.72)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 427
\-------------------------
Simulating trial. . .
espilon = 0.1820; alpha = 0.0040
Simulating trial. . .
espilon = 0.1820; alpha = 0.0040
Simulating trial. . .
espilon = 0.1820; alpha = 0.0040
Simulating trial. . .
espilon = 0.1820; alpha = 0.0040
Simulating trial. . .
espilon = 0.1820; alpha = 0.0040
Simulating trial. . .
espilon = 0.1820; alpha = 0.0040
Simulating trial. . .
espilon = 0.1820; alpha = 0.0040
Simulating trial. . .
espilon = 0.1820; alpha = 0.0040
Simulating trial. . .
espilon = 0.1820; alpha = 0.0040
Simulating trial. . .
espilon = 0.1820; alpha = 0.0040
Simulating trial. . .
espilon = 0.1820; alpha = 0.0040
Simulating trial. . .
espilon = 0.1820; alpha = 0.0040
Simulating trial. . .
espilon = 0.1820; alpha = 0.0040
Simulating trial. . .
espilon = 0.1820; alpha = 0.0040
Simulating trial. . .
espilon = 0.1820; alpha = 0.0040
Simulating trial. . .
espilon = 0.1820; alpha = 0.0040
Simulating trial. . .
espilon = 0.1820; alpha = 0.0040
Simulating trial. . .
espilon = 0.1820; alpha = 0.0040
Simulating trial. . .
espilon = 0.1820; alpha = 0.0040
Simulating trial. . .
espilon = 0.1820; alpha = 0.0040
Simulating trial. . .
espilon = 0.1820; alpha = 0.0040
Simulating trial. . .
espilon = 0.1820; alpha = 0.0040
Simulating trial. . .
espilon = 0.1820; alpha = 0.0040
Simulating trial. . .
espilon = 0.1820; alpha = 0.0040
Simulating trial. . .
espilon = 0.1820; alpha = 0.0040
Simulating trial. . .
espilon = 0.1820; alpha = 0.0040
Simulating trial. . .
espilon = 0.1820; alpha = 0.0040
Simulating trial. . .
espilon = 0.1820; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.00)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.73)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.07)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.96)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.33)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 2.00)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.39)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.49)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.19)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.12)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.80)
56% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 428
\-------------------------
Simulating trial. . .
espilon = 0.1812; alpha = 0.0040
Simulating trial. . .
espilon = 0.1812; alpha = 0.0040
Simulating trial. . .
espilon = 0.1812; alpha = 0.0040
Simulating trial. . .
espilon = 0.1812; alpha = 0.0040
Simulating trial. . .
espilon = 0.1812; alpha = 0.0040
Simulating trial. . .
espilon = 0.1812; alpha = 0.0040
Simulating trial. . .
espilon = 0.1812; alpha = 0.0040
Simulating trial. . .
espilon = 0.1812; alpha = 0.0040
Simulating trial. . .
espilon = 0.1812; alpha = 0.0040
Simulating trial. . .
espilon = 0.1812; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.05)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.14)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.19)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.33)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.00)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.18)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'left')
Agent drove left instead of right. (rewarded 0.56)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'right')
Agent properly idled at a red light. (rewarded 1.01)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'forward', None)
Agent properly idled at a red light. (rewarded 1.33)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'right')
Agent followed the waypoint forward. (rewarded 1.06)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 2.55)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 0.96)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 429
\-------------------------
Simulating trial. . .
espilon = 0.1805; alpha = 0.0040
Simulating trial. . .
espilon = 0.1805; alpha = 0.0040
Simulating trial. . .
espilon = 0.1805; alpha = 0.0040
Simulating trial. . .
espilon = 0.1805; alpha = 0.0040
Simulating trial. . .
espilon = 0.1805; alpha = 0.0040
Simulating trial. . .
espilon = 0.1805; alpha = 0.0040
Simulating trial. . .
espilon = 0.1805; alpha = 0.0040
Simulating trial. . .
espilon = 0.1805; alpha = 0.0040
Simulating trial. . .
espilon = 0.1805; alpha = 0.0040
Simulating trial. . .
espilon = 0.1805; alpha = 0.0040
Simulating trial. . .
espilon = 0.1805; alpha = 0.0040
Simulating trial. . .
espilon = 0.1805; alpha = 0.0040
Simulating trial. . .
espilon = 0.1805; alpha = 0.0040
Simulating trial. . .
espilon = 0.1805; alpha = 0.0040
Simulating trial. . .
espilon = 0.1805; alpha = 0.0040
Simulating trial. . .
espilon = 0.1805; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 2.67)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.33)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 1.74)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.23)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.05)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.63)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent drove right instead of left. (rewarded 0.23)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.27)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.28)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.93)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.37)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.84)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 2.27)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.99)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.72)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 2.47)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.97)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -5.58)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 0.67)
5% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 430
\-------------------------
Simulating trial. . .
espilon = 0.1798; alpha = 0.0040
Simulating trial. . .
espilon = 0.1798; alpha = 0.0040
Simulating trial. . .
espilon = 0.1798; alpha = 0.0040
Simulating trial. . .
espilon = 0.1798; alpha = 0.0040
Simulating trial. . .
espilon = 0.1798; alpha = 0.0040
Simulating trial. . .
espilon = 0.1798; alpha = 0.0040
Simulating trial. . .
espilon = 0.1798; alpha = 0.0040
Simulating trial. . .
espilon = 0.1798; alpha = 0.0040
Simulating trial. . .
espilon = 0.1798; alpha = 0.0040
Simulating trial. . .
espilon = 0.1798; alpha = 0.0040
Simulating trial. . .
espilon = 0.1798; alpha = 0.0040
Simulating trial. . .
espilon = 0.1798; alpha = 0.0040
Simulating trial. . .
espilon = 0.1798; alpha = 0.0040
Simulating trial. . .
espilon = 0.1798; alpha = 0.0040
Simulating trial. . .
espilon = 0.1798; alpha = 0.0040
Simulating trial. . .
espilon = 0.1798; alpha = 0.0040
Simulating trial. . .
espilon = 0.1798; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.81)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.44)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.73)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.78)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.92)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.35)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.08)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.57)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.03)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.35)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.69)
45% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 431
\-------------------------
Simulating trial. . .
espilon = 0.1791; alpha = 0.0040
Simulating trial. . .
espilon = 0.1791; alpha = 0.0040
Simulating trial. . .
espilon = 0.1791; alpha = 0.0040
Simulating trial. . .
espilon = 0.1791; alpha = 0.0040
Simulating trial. . .
espilon = 0.1791; alpha = 0.0040
Simulating trial. . .
espilon = 0.1791; alpha = 0.0040
Simulating trial. . .
espilon = 0.1791; alpha = 0.0040
Simulating trial. . .
espilon = 0.1791; alpha = 0.0040
Simulating trial. . .
espilon = 0.1791; alpha = 0.0040
Simulating trial. . .
espilon = 0.1791; alpha = 0.0040
Simulating trial. . .
espilon = 0.1791; alpha = 0.0040
Simulating trial. . .
espilon = 0.1791; alpha = 0.0040
Simulating trial. . .
espilon = 0.1791; alpha = 0.0040
Simulating trial. . .
espilon = 0.1791; alpha = 0.0040
Simulating trial. . .
espilon = 0.1791; alpha = 0.0040
Simulating trial. . .
espilon = 0.1791; alpha = 0.0040
Simulating trial. . .
espilon = 0.1791; alpha = 0.0040
Simulating trial. . .
espilon = 0.1791; alpha = 0.0040
Simulating trial. . .
espilon = 0.1791; alpha = 0.0040
Simulating trial. . .
espilon = 0.1791; alpha = 0.0040
Simulating trial. . .
espilon = 0.1791; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -4.37)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.07)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 1.51)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.44)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.08)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.56)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.38)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.10)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'right', 'left')
Agent properly idled at a red light. (rewarded 2.25)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.06)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'right')
Agent followed the waypoint forward. (rewarded 1.63)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.37)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.77)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.55)
44% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 432
\-------------------------
Simulating trial. . .
espilon = 0.1784; alpha = 0.0040
Simulating trial. . .
espilon = 0.1784; alpha = 0.0040
Simulating trial. . .
espilon = 0.1784; alpha = 0.0040
Simulating trial. . .
espilon = 0.1784; alpha = 0.0040
Simulating trial. . .
espilon = 0.1784; alpha = 0.0040
Simulating trial. . .
espilon = 0.1784; alpha = 0.0040
Simulating trial. . .
espilon = 0.1784; alpha = 0.0040
Simulating trial. . .
espilon = 0.1784; alpha = 0.0040
Simulating trial. . .
espilon = 0.1784; alpha = 0.0040
Simulating trial. . .
espilon = 0.1784; alpha = 0.0040
Simulating trial. . .
espilon = 0.1784; alpha = 0.0040
Simulating trial. . .
espilon = 0.1784; alpha = 0.0040
Simulating trial. . .
espilon = 0.1784; alpha = 0.0040
Simulating trial. . .
espilon = 0.1784; alpha = 0.0040
Simulating trial. . .
espilon = 0.1784; alpha = 0.0040
Simulating trial. . .
espilon = 0.1784; alpha = 0.0040
Simulating trial. . .
espilon = 0.1784; alpha = 0.0040
Simulating trial. . .
espilon = 0.1784; alpha = 0.0040
Simulating trial. . .
espilon = 0.1784; alpha = 0.0040
Simulating trial. . .
espilon = 0.1784; alpha = 0.0040
Simulating trial. . .
espilon = 0.1784; alpha = 0.0040
Simulating trial. . .
espilon = 0.1784; alpha = 0.0040
Simulating trial. . .
espilon = 0.1784; alpha = 0.0040
Simulating trial. . .
espilon = 0.1784; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.76)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -9.82)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.50)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.01)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.99)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.44)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -9.79)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -4.55)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.85)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.66)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.43)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent drove right instead of forward. (rewarded 1.58)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.99)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'forward')
Agent drove forward instead of left. (rewarded 1.54)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded -0.03)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.99)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.78)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 2.53)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'right')
Agent properly idled at a red light. (rewarded 2.22)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.00)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.80)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.79)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.59)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.30)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.67)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 433
\-------------------------
Simulating trial. . .
espilon = 0.1776; alpha = 0.0040
Simulating trial. . .
espilon = 0.1776; alpha = 0.0040
Simulating trial. . .
espilon = 0.1776; alpha = 0.0040
Simulating trial. . .
espilon = 0.1776; alpha = 0.0040
Simulating trial. . .
espilon = 0.1776; alpha = 0.0040
Simulating trial. . .
espilon = 0.1776; alpha = 0.0040
Simulating trial. . .
espilon = 0.1776; alpha = 0.0040
Simulating trial. . .
espilon = 0.1776; alpha = 0.0040
Simulating trial. . .
espilon = 0.1776; alpha = 0.0040
Simulating trial. . .
espilon = 0.1776; alpha = 0.0040
Simulating trial. . .
espilon = 0.1776; alpha = 0.0040
Simulating trial. . .
espilon = 0.1776; alpha = 0.0040
Simulating trial. . .
espilon = 0.1776; alpha = 0.0040
Simulating trial. . .
espilon = 0.1776; alpha = 0.0040
Simulating trial. . .
espilon = 0.1776; alpha = 0.0040
Simulating trial. . .
espilon = 0.1776; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.56)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 1.68)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.04)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.68)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.89)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.29)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 2.57)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.32)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.74)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.16)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.61)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.31)
52% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 434
\-------------------------
Simulating trial. . .
espilon = 0.1769; alpha = 0.0040
Simulating trial. . .
espilon = 0.1769; alpha = 0.0040
Simulating trial. . .
espilon = 0.1769; alpha = 0.0040
Simulating trial. . .
espilon = 0.1769; alpha = 0.0040
Simulating trial. . .
espilon = 0.1769; alpha = 0.0040
Simulating trial. . .
espilon = 0.1769; alpha = 0.0040
Simulating trial. . .
espilon = 0.1769; alpha = 0.0040
Simulating trial. . .
espilon = 0.1769; alpha = 0.0040
Simulating trial. . .
espilon = 0.1769; alpha = 0.0040
Simulating trial. . .
espilon = 0.1769; alpha = 0.0040
Simulating trial. . .
espilon = 0.1769; alpha = 0.0040
Simulating trial. . .
espilon = 0.1769; alpha = 0.0040
Simulating trial. . .
espilon = 0.1769; alpha = 0.0040
Simulating trial. . .
espilon = 0.1769; alpha = 0.0040
Simulating trial. . .
espilon = 0.1769; alpha = 0.0040
Simulating trial. . .
espilon = 0.1769; alpha = 0.0040
Simulating trial. . .
espilon = 0.1769; alpha = 0.0040
Simulating trial. . .
espilon = 0.1769; alpha = 0.0040
Simulating trial. . .
espilon = 0.1769; alpha = 0.0040
Simulating trial. . .
espilon = 0.1769; alpha = 0.0040
Simulating trial. . .
espilon = 0.1769; alpha = 0.0040
Simulating trial. . .
espilon = 0.1769; alpha = 0.0040
Simulating trial. . .
espilon = 0.1769; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', 'left')
Agent drove left instead of right. (rewarded 1.93)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.82)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.88)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'right')
Agent properly idled at a red light. (rewarded 2.61)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'right')
Agent properly idled at a red light. (rewarded 0.97)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.18)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.95)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.59)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.90)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.83)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.69)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 0.03)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.11)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent drove right instead of forward. (rewarded -0.16)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.18)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.43)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.04)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 0.55)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 1.88)
37% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 435
\-------------------------
Simulating trial. . .
espilon = 0.1762; alpha = 0.0040
Simulating trial. . .
espilon = 0.1762; alpha = 0.0040
Simulating trial. . .
espilon = 0.1762; alpha = 0.0040
Simulating trial. . .
espilon = 0.1762; alpha = 0.0040
Simulating trial. . .
espilon = 0.1762; alpha = 0.0040
Simulating trial. . .
espilon = 0.1762; alpha = 0.0040
Simulating trial. . .
espilon = 0.1762; alpha = 0.0040
Simulating trial. . .
espilon = 0.1762; alpha = 0.0040
Simulating trial. . .
espilon = 0.1762; alpha = 0.0040
Simulating trial. . .
espilon = 0.1762; alpha = 0.0040
Simulating trial. . .
espilon = 0.1762; alpha = 0.0040
Simulating trial. . .
espilon = 0.1762; alpha = 0.0040
Simulating trial. . .
espilon = 0.1762; alpha = 0.0040
Simulating trial. . .
espilon = 0.1762; alpha = 0.0040
Simulating trial. . .
espilon = 0.1762; alpha = 0.0040
Simulating trial. . .
espilon = 0.1762; alpha = 0.0040
Simulating trial. . .
espilon = 0.1762; alpha = 0.0040
Simulating trial. . .
espilon = 0.1762; alpha = 0.0040
Simulating trial. . .
espilon = 0.1762; alpha = 0.0040
Simulating trial. . .
espilon = 0.1762; alpha = 0.0040
Simulating trial. . .
espilon = 0.1762; alpha = 0.0040
Simulating trial. . .
espilon = 0.1762; alpha = 0.0040
Simulating trial. . .
espilon = 0.1762; alpha = 0.0040
Simulating trial. . .
espilon = 0.1762; alpha = 0.0040
Simulating trial. . .
espilon = 0.1762; alpha = 0.0040
Simulating trial. . .
espilon = 0.1762; alpha = 0.0040
Simulating trial. . .
espilon = 0.1762; alpha = 0.0040
Simulating trial. . .
espilon = 0.1762; alpha = 0.0040
Simulating trial. . .
espilon = 0.1762; alpha = 0.0040
Simulating trial. . .
espilon = 0.1762; alpha = 0.0040
Simulating trial. . .
espilon = 0.1762; alpha = 0.0040
Simulating trial. . .
espilon = 0.1762; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.28)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.27)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.34)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.51)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.37)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.90)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.89)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.14)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.00)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.68)
67% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 436
\-------------------------
Simulating trial. . .
espilon = 0.1755; alpha = 0.0040
Simulating trial. . .
espilon = 0.1755; alpha = 0.0040
Simulating trial. . .
espilon = 0.1755; alpha = 0.0040
Simulating trial. . .
espilon = 0.1755; alpha = 0.0040
Simulating trial. . .
espilon = 0.1755; alpha = 0.0040
Simulating trial. . .
espilon = 0.1755; alpha = 0.0040
Simulating trial. . .
espilon = 0.1755; alpha = 0.0040
Simulating trial. . .
espilon = 0.1755; alpha = 0.0040
Simulating trial. . .
espilon = 0.1755; alpha = 0.0040
Simulating trial. . .
espilon = 0.1755; alpha = 0.0040
Simulating trial. . .
espilon = 0.1755; alpha = 0.0040
Simulating trial. . .
espilon = 0.1755; alpha = 0.0040
Simulating trial. . .
espilon = 0.1755; alpha = 0.0040
Simulating trial. . .
espilon = 0.1755; alpha = 0.0040
Simulating trial. . .
espilon = 0.1755; alpha = 0.0040
Simulating trial. . .
espilon = 0.1755; alpha = 0.0040
Simulating trial. . .
espilon = 0.1755; alpha = 0.0040
Simulating trial. . .
espilon = 0.1755; alpha = 0.0040
Simulating trial. . .
espilon = 0.1755; alpha = 0.0040
Simulating trial. . .
espilon = 0.1755; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 0.90)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.83)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent drove right instead of left. (rewarded 0.70)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 1.81)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.81)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.35)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.03)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.40)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 2.15)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.22)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.30)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.48)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 437
\-------------------------
Simulating trial. . .
espilon = 0.1748; alpha = 0.0040
Simulating trial. . .
espilon = 0.1748; alpha = 0.0040
Simulating trial. . .
espilon = 0.1748; alpha = 0.0040
Simulating trial. . .
espilon = 0.1748; alpha = 0.0040
Simulating trial. . .
espilon = 0.1748; alpha = 0.0040
Simulating trial. . .
espilon = 0.1748; alpha = 0.0040
Simulating trial. . .
espilon = 0.1748; alpha = 0.0040
Simulating trial. . .
espilon = 0.1748; alpha = 0.0040
Simulating trial. . .
espilon = 0.1748; alpha = 0.0040
Simulating trial. . .
espilon = 0.1748; alpha = 0.0040
Simulating trial. . .
espilon = 0.1748; alpha = 0.0040
Simulating trial. . .
espilon = 0.1748; alpha = 0.0040
Simulating trial. . .
espilon = 0.1748; alpha = 0.0040
Simulating trial. . .
espilon = 0.1748; alpha = 0.0040
Simulating trial. . .
espilon = 0.1748; alpha = 0.0040
Simulating trial. . .
espilon = 0.1748; alpha = 0.0040
Simulating trial. . .
espilon = 0.1748; alpha = 0.0040
Simulating trial. . .
espilon = 0.1748; alpha = 0.0040
Simulating trial. . .
espilon = 0.1748; alpha = 0.0040
Simulating trial. . .
espilon = 0.1748; alpha = 0.0040
Simulating trial. . .
espilon = 0.1748; alpha = 0.0040
Simulating trial. . .
espilon = 0.1748; alpha = 0.0040
Simulating trial. . .
espilon = 0.1748; alpha = 0.0040
Simulating trial. . .
espilon = 0.1748; alpha = 0.0040
Simulating trial. . .
espilon = 0.1748; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'left', None)
Agent drove forward instead of left. (rewarded 0.33)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.82)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.47)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 1.75)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 2.80)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.68)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'right', 'left')
Agent properly idled at a red light. (rewarded 2.49)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.73)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'forward')
Agent properly idled at a red light. (rewarded 2.35)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.37)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.01)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.12)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.15)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.20)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.85)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent drove right instead of forward. (rewarded 0.82)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent followed the waypoint left. (rewarded 0.46)
15% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 438
\-------------------------
Simulating trial. . .
espilon = 0.1741; alpha = 0.0040
Simulating trial. . .
espilon = 0.1741; alpha = 0.0040
Simulating trial. . .
espilon = 0.1741; alpha = 0.0040
Simulating trial. . .
espilon = 0.1741; alpha = 0.0040
Simulating trial. . .
espilon = 0.1741; alpha = 0.0040
Simulating trial. . .
espilon = 0.1741; alpha = 0.0040
Simulating trial. . .
espilon = 0.1741; alpha = 0.0040
Simulating trial. . .
espilon = 0.1741; alpha = 0.0040
Simulating trial. . .
espilon = 0.1741; alpha = 0.0040
Simulating trial. . .
espilon = 0.1741; alpha = 0.0040
Simulating trial. . .
espilon = 0.1741; alpha = 0.0040
Simulating trial. . .
espilon = 0.1741; alpha = 0.0040
Simulating trial. . .
espilon = 0.1741; alpha = 0.0040
Simulating trial. . .
espilon = 0.1741; alpha = 0.0040
Simulating trial. . .
espilon = 0.1741; alpha = 0.0040
Simulating trial. . .
espilon = 0.1741; alpha = 0.0040
Simulating trial. . .
espilon = 0.1741; alpha = 0.0040
Simulating trial. . .
espilon = 0.1741; alpha = 0.0040
Simulating trial. . .
espilon = 0.1741; alpha = 0.0040
Simulating trial. . .
espilon = 0.1741; alpha = 0.0040
Simulating trial. . .
espilon = 0.1741; alpha = 0.0040
Simulating trial. . .
espilon = 0.1741; alpha = 0.0040
Simulating trial. . .
espilon = 0.1741; alpha = 0.0040
Simulating trial. . .
espilon = 0.1741; alpha = 0.0040
Simulating trial. . .
espilon = 0.1741; alpha = 0.0040
Simulating trial. . .
espilon = 0.1741; alpha = 0.0040
Simulating trial. . .
espilon = 0.1741; alpha = 0.0040
Simulating trial. . .
espilon = 0.1741; alpha = 0.0040
Simulating trial. . .
espilon = 0.1741; alpha = 0.0040
Simulating trial. . .
espilon = 0.1741; alpha = 0.0040
Simulating trial. . .
espilon = 0.1741; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'right', 'left')
Agent properly idled at a red light. (rewarded 2.71)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.71)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.25)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 1.11)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.44)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 0.60)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.84)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.53)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 0.72)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent drove right instead of left. (rewarded 1.44)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'right')
Agent followed the waypoint right. (rewarded 1.70)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.23)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.33)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.97)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.11)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.15)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.03)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.23)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 0.78)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', 'forward')
Agent drove right instead of forward. (rewarded 0.62)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 439
\-------------------------
Simulating trial. . .
espilon = 0.1734; alpha = 0.0040
Simulating trial. . .
espilon = 0.1734; alpha = 0.0040
Simulating trial. . .
espilon = 0.1734; alpha = 0.0040
Simulating trial. . .
espilon = 0.1734; alpha = 0.0040
Simulating trial. . .
espilon = 0.1734; alpha = 0.0040
Simulating trial. . .
espilon = 0.1734; alpha = 0.0040
Simulating trial. . .
espilon = 0.1734; alpha = 0.0040
Simulating trial. . .
espilon = 0.1734; alpha = 0.0040
Simulating trial. . .
espilon = 0.1734; alpha = 0.0040
Simulating trial. . .
espilon = 0.1734; alpha = 0.0040
Simulating trial. . .
espilon = 0.1734; alpha = 0.0040
Simulating trial. . .
espilon = 0.1734; alpha = 0.0040
Simulating trial. . .
espilon = 0.1734; alpha = 0.0040
Simulating trial. . .
espilon = 0.1734; alpha = 0.0040
Simulating trial. . .
espilon = 0.1734; alpha = 0.0040
Simulating trial. . .
espilon = 0.1734; alpha = 0.0040
Simulating trial. . .
espilon = 0.1734; alpha = 0.0040
Simulating trial. . .
espilon = 0.1734; alpha = 0.0040
Simulating trial. . .
espilon = 0.1734; alpha = 0.0040
Simulating trial. . .
espilon = 0.1734; alpha = 0.0040
Simulating trial. . .
espilon = 0.1734; alpha = 0.0040
Simulating trial. . .
espilon = 0.1734; alpha = 0.0040
Simulating trial. . .
espilon = 0.1734; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 1.60)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.03)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.71)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.99)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.69)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.23)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.62)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.69)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.62)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.56)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.72)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.70)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.53)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.51)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.02)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.02)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.34)
15% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 440
\-------------------------
Simulating trial. . .
espilon = 0.1727; alpha = 0.0040
Simulating trial. . .
espilon = 0.1727; alpha = 0.0040
Simulating trial. . .
espilon = 0.1727; alpha = 0.0040
Simulating trial. . .
espilon = 0.1727; alpha = 0.0040
Simulating trial. . .
espilon = 0.1727; alpha = 0.0040
Simulating trial. . .
espilon = 0.1727; alpha = 0.0040
Simulating trial. . .
espilon = 0.1727; alpha = 0.0040
Simulating trial. . .
espilon = 0.1727; alpha = 0.0040
Simulating trial. . .
espilon = 0.1727; alpha = 0.0040
Simulating trial. . .
espilon = 0.1727; alpha = 0.0040
Simulating trial. . .
espilon = 0.1727; alpha = 0.0040
Simulating trial. . .
espilon = 0.1727; alpha = 0.0040
Simulating trial. . .
espilon = 0.1727; alpha = 0.0040
Simulating trial. . .
espilon = 0.1727; alpha = 0.0040
Simulating trial. . .
espilon = 0.1727; alpha = 0.0040
Simulating trial. . .
espilon = 0.1727; alpha = 0.0040
Simulating trial. . .
espilon = 0.1727; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.70)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'left')
Agent followed the waypoint right. (rewarded 2.37)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.08)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.44)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.44)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.41)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded -0.06)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'right', None)
Agent drove forward instead of left. (rewarded 1.08)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent drove right instead of left. (rewarded 1.04)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 2.69)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 2.81)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.35)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded -0.02)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.51)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.68)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.70)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.11)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.56)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', 'left')
Agent attempted driving forward through a red light. (rewarded -9.49)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'left')
Agent properly idled at a red light. (rewarded 2.01)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.98)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent followed the waypoint forward. (rewarded 2.30)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.02)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.91)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 0.93)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 441
\-------------------------
Simulating trial. . .
espilon = 0.1720; alpha = 0.0040
Simulating trial. . .
espilon = 0.1720; alpha = 0.0040
Simulating trial. . .
espilon = 0.1720; alpha = 0.0040
Simulating trial. . .
espilon = 0.1720; alpha = 0.0040
Simulating trial. . .
espilon = 0.1720; alpha = 0.0040
Simulating trial. . .
espilon = 0.1720; alpha = 0.0040
Simulating trial. . .
espilon = 0.1720; alpha = 0.0040
Simulating trial. . .
espilon = 0.1720; alpha = 0.0040
Simulating trial. . .
espilon = 0.1720; alpha = 0.0040
Simulating trial. . .
espilon = 0.1720; alpha = 0.0040
Simulating trial. . .
espilon = 0.1720; alpha = 0.0040
Simulating trial. . .
espilon = 0.1720; alpha = 0.0040
Simulating trial. . .
espilon = 0.1720; alpha = 0.0040
Simulating trial. . .
espilon = 0.1720; alpha = 0.0040
Simulating trial. . .
espilon = 0.1720; alpha = 0.0040
Simulating trial. . .
espilon = 0.1720; alpha = 0.0040
Simulating trial. . .
espilon = 0.1720; alpha = 0.0040
Simulating trial. . .
espilon = 0.1720; alpha = 0.0040
Simulating trial. . .
espilon = 0.1720; alpha = 0.0040
Simulating trial. . .
espilon = 0.1720; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', 'right')
Agent drove right instead of left. (rewarded 1.92)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 2.65)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', 'left')
Agent properly idled at a red light. (rewarded 1.51)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.13)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.71)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.75)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.29)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.12)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', 'left')
Agent drove right instead of forward. (rewarded 0.23)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.82)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.02)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.00)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.85)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.17)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'right', 'left')
Agent properly idled at a red light. (rewarded 0.83)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.03)
47% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 442
\-------------------------
Simulating trial. . .
espilon = 0.1714; alpha = 0.0040
Simulating trial. . .
espilon = 0.1714; alpha = 0.0040
Simulating trial. . .
espilon = 0.1714; alpha = 0.0040
Simulating trial. . .
espilon = 0.1714; alpha = 0.0040
Simulating trial. . .
espilon = 0.1714; alpha = 0.0040
Simulating trial. . .
espilon = 0.1714; alpha = 0.0040
Simulating trial. . .
espilon = 0.1714; alpha = 0.0040
Simulating trial. . .
espilon = 0.1714; alpha = 0.0040
Simulating trial. . .
espilon = 0.1714; alpha = 0.0040
Simulating trial. . .
espilon = 0.1714; alpha = 0.0040
Simulating trial. . .
espilon = 0.1714; alpha = 0.0040
Simulating trial. . .
espilon = 0.1714; alpha = 0.0040
Simulating trial. . .
espilon = 0.1714; alpha = 0.0040
Simulating trial. . .
espilon = 0.1714; alpha = 0.0040
Simulating trial. . .
espilon = 0.1714; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.14)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 0.30)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.54)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.48)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, 'forward')
Agent properly idled at a red light. (rewarded 2.67)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.02)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.10)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 2.67)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'left')
Agent followed the waypoint forward. (rewarded 2.22)
55% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 443
\-------------------------
Simulating trial. . .
espilon = 0.1707; alpha = 0.0040
Simulating trial. . .
espilon = 0.1707; alpha = 0.0040
Simulating trial. . .
espilon = 0.1707; alpha = 0.0040
Simulating trial. . .
espilon = 0.1707; alpha = 0.0040
Simulating trial. . .
espilon = 0.1707; alpha = 0.0040
Simulating trial. . .
espilon = 0.1707; alpha = 0.0040
Simulating trial. . .
espilon = 0.1707; alpha = 0.0040
Simulating trial. . .
espilon = 0.1707; alpha = 0.0040
Simulating trial. . .
espilon = 0.1707; alpha = 0.0040
Simulating trial. . .
espilon = 0.1707; alpha = 0.0040
Simulating trial. . .
espilon = 0.1707; alpha = 0.0040
Simulating trial. . .
espilon = 0.1707; alpha = 0.0040
Simulating trial. . .
espilon = 0.1707; alpha = 0.0040
Simulating trial. . .
espilon = 0.1707; alpha = 0.0040
Simulating trial. . .
espilon = 0.1707; alpha = 0.0040
Simulating trial. . .
espilon = 0.1707; alpha = 0.0040
Simulating trial. . .
espilon = 0.1707; alpha = 0.0040
Simulating trial. . .
espilon = 0.1707; alpha = 0.0040
Simulating trial. . .
espilon = 0.1707; alpha = 0.0040
Simulating trial. . .
espilon = 0.1707; alpha = 0.0040
Simulating trial. . .
espilon = 0.1707; alpha = 0.0040
Simulating trial. . .
espilon = 0.1707; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.66)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.85)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.51)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.53)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.54)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.45)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 0.99)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.34)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 0.75)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'right')
Agent properly idled at a red light. (rewarded 1.78)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.50)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.34)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.09)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.28)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.65)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.70)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.99)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 0.76)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.70)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.25)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.56)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.17)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.96)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.07)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.88)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 444
\-------------------------
Simulating trial. . .
espilon = 0.1700; alpha = 0.0040
Simulating trial. . .
espilon = 0.1700; alpha = 0.0040
Simulating trial. . .
espilon = 0.1700; alpha = 0.0040
Simulating trial. . .
espilon = 0.1700; alpha = 0.0040
Simulating trial. . .
espilon = 0.1700; alpha = 0.0040
Simulating trial. . .
espilon = 0.1700; alpha = 0.0040
Simulating trial. . .
espilon = 0.1700; alpha = 0.0040
Simulating trial. . .
espilon = 0.1700; alpha = 0.0040
Simulating trial. . .
espilon = 0.1700; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.87)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded 1.96)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.19)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 2.68)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.49)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.57)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.78)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.60)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.83)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.43)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.78)
56% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 445
\-------------------------
Simulating trial. . .
espilon = 0.1693; alpha = 0.0040
Simulating trial. . .
espilon = 0.1693; alpha = 0.0040
Simulating trial. . .
espilon = 0.1693; alpha = 0.0040
Simulating trial. . .
espilon = 0.1693; alpha = 0.0040
Simulating trial. . .
espilon = 0.1693; alpha = 0.0040
Simulating trial. . .
espilon = 0.1693; alpha = 0.0040
Simulating trial. . .
espilon = 0.1693; alpha = 0.0040
Simulating trial. . .
espilon = 0.1693; alpha = 0.0040
Simulating trial. . .
espilon = 0.1693; alpha = 0.0040
Simulating trial. . .
espilon = 0.1693; alpha = 0.0040
Simulating trial. . .
espilon = 0.1693; alpha = 0.0040
Simulating trial. . .
espilon = 0.1693; alpha = 0.0040
Simulating trial. . .
espilon = 0.1693; alpha = 0.0040
Simulating trial. . .
espilon = 0.1693; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.79)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.23)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.09)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.14)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.52)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.39)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 0.24)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 1.22)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent drove forward instead of left. (rewarded 1.63)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.02)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.12)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.32)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 2.60)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.25)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent drove left instead of forward. (rewarded 0.32)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.10)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent followed the waypoint left. (rewarded 0.75)
43% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 446
\-------------------------
Simulating trial. . .
espilon = 0.1686; alpha = 0.0040
Simulating trial. . .
espilon = 0.1686; alpha = 0.0040
Simulating trial. . .
espilon = 0.1686; alpha = 0.0040
Simulating trial. . .
espilon = 0.1686; alpha = 0.0040
Simulating trial. . .
espilon = 0.1686; alpha = 0.0040
Simulating trial. . .
espilon = 0.1686; alpha = 0.0040
Simulating trial. . .
espilon = 0.1686; alpha = 0.0040
Simulating trial. . .
espilon = 0.1686; alpha = 0.0040
Simulating trial. . .
espilon = 0.1686; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 1.02)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.96)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.88)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove left instead of forward. (rewarded 1.42)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -41.00)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.13)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.77)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 2.56)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.24)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.84)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.46)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.34)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.79)
35% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 447
\-------------------------
Simulating trial. . .
espilon = 0.1680; alpha = 0.0040
Simulating trial. . .
espilon = 0.1680; alpha = 0.0040
Simulating trial. . .
espilon = 0.1680; alpha = 0.0040
Simulating trial. . .
espilon = 0.1680; alpha = 0.0040
Simulating trial. . .
espilon = 0.1680; alpha = 0.0040
Simulating trial. . .
espilon = 0.1680; alpha = 0.0040
Simulating trial. . .
espilon = 0.1680; alpha = 0.0040
Simulating trial. . .
espilon = 0.1680; alpha = 0.0040
Simulating trial. . .
espilon = 0.1680; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.57)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.27)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.89)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.06)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.95)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'right')
Agent drove right instead of forward. (rewarded 1.82)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent followed the waypoint left. (rewarded 2.92)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.03)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 0.99)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 1.91)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.85)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.50)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 2.66)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'right')
Agent properly idled at a red light. (rewarded 2.79)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 2.76)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.73)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 0.21)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 1.34)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.20)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.59)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.94)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent followed the waypoint left. (rewarded 1.66)
27% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 448
\-------------------------
Simulating trial. . .
espilon = 0.1673; alpha = 0.0040
Simulating trial. . .
espilon = 0.1673; alpha = 0.0040
Simulating trial. . .
espilon = 0.1673; alpha = 0.0040
Simulating trial. . .
espilon = 0.1673; alpha = 0.0040
Simulating trial. . .
espilon = 0.1673; alpha = 0.0040
Simulating trial. . .
espilon = 0.1673; alpha = 0.0040
Simulating trial. . .
espilon = 0.1673; alpha = 0.0040
Simulating trial. . .
espilon = 0.1673; alpha = 0.0040
Simulating trial. . .
espilon = 0.1673; alpha = 0.0040
Simulating trial. . .
espilon = 0.1673; alpha = 0.0040
Simulating trial. . .
espilon = 0.1673; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'right')
Agent drove forward instead of left. (rewarded 0.26)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.92)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.86)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'forward')
Agent drove right instead of left. (rewarded 1.14)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.61)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.45)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.24)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'right')
Agent drove right instead of left. (rewarded 1.22)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.50)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 1.70)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.33)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.16)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 0.78)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.80)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.60)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.35)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.40)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 0.62)
10% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 449
\-------------------------
Simulating trial. . .
espilon = 0.1666; alpha = 0.0040
Simulating trial. . .
espilon = 0.1666; alpha = 0.0040
Simulating trial. . .
espilon = 0.1666; alpha = 0.0040
Simulating trial. . .
espilon = 0.1666; alpha = 0.0040
Simulating trial. . .
espilon = 0.1666; alpha = 0.0040
Simulating trial. . .
espilon = 0.1666; alpha = 0.0040
Simulating trial. . .
espilon = 0.1666; alpha = 0.0040
Simulating trial. . .
espilon = 0.1666; alpha = 0.0040
Simulating trial. . .
espilon = 0.1666; alpha = 0.0040
Simulating trial. . .
espilon = 0.1666; alpha = 0.0040
Simulating trial. . .
espilon = 0.1666; alpha = 0.0040
Simulating trial. . .
espilon = 0.1666; alpha = 0.0040
Simulating trial. . .
espilon = 0.1666; alpha = 0.0040
Simulating trial. . .
espilon = 0.1666; alpha = 0.0040
Simulating trial. . .
espilon = 0.1666; alpha = 0.0040
Simulating trial. . .
espilon = 0.1666; alpha = 0.0040
Simulating trial. . .
espilon = 0.1666; alpha = 0.0040
Simulating trial. . .
espilon = 0.1666; alpha = 0.0040
Simulating trial. . .
espilon = 0.1666; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'right', 'forward')
Agent drove forward instead of left. (rewarded 1.73)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 0.15)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.78)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.75)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.14)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.29)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 1.03)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.30)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.31)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.88)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 450
\-------------------------
Simulating trial. . .
espilon = 0.1660; alpha = 0.0040
Simulating trial. . .
espilon = 0.1660; alpha = 0.0040
Simulating trial. . .
espilon = 0.1660; alpha = 0.0040
Simulating trial. . .
espilon = 0.1660; alpha = 0.0040
Simulating trial. . .
espilon = 0.1660; alpha = 0.0040
Simulating trial. . .
espilon = 0.1660; alpha = 0.0040
Simulating trial. . .
espilon = 0.1660; alpha = 0.0040
Simulating trial. . .
espilon = 0.1660; alpha = 0.0040
Simulating trial. . .
espilon = 0.1660; alpha = 0.0040
Simulating trial. . .
espilon = 0.1660; alpha = 0.0040
Simulating trial. . .
espilon = 0.1660; alpha = 0.0040
Simulating trial. . .
espilon = 0.1660; alpha = 0.0040
Simulating trial. . .
espilon = 0.1660; alpha = 0.0040
Simulating trial. . .
espilon = 0.1660; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.14)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.10)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.85)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 2.62)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 0.47)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.24)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.21)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.94)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.01)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent followed the waypoint left. (rewarded 2.86)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'right', None)
Agent drove right instead of forward. (rewarded 0.56)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.71)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent followed the waypoint left. (rewarded 2.15)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.16)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.36)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.18)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.81)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'right')
Agent followed the waypoint forward. (rewarded 1.46)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.79)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.79)
33% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 451
\-------------------------
Simulating trial. . .
espilon = 0.1653; alpha = 0.0040
Simulating trial. . .
espilon = 0.1653; alpha = 0.0040
Simulating trial. . .
espilon = 0.1653; alpha = 0.0040
Simulating trial. . .
espilon = 0.1653; alpha = 0.0040
Simulating trial. . .
espilon = 0.1653; alpha = 0.0040
Simulating trial. . .
espilon = 0.1653; alpha = 0.0040
Simulating trial. . .
espilon = 0.1653; alpha = 0.0040
Simulating trial. . .
espilon = 0.1653; alpha = 0.0040
Simulating trial. . .
espilon = 0.1653; alpha = 0.0040
Simulating trial. . .
espilon = 0.1653; alpha = 0.0040
Simulating trial. . .
espilon = 0.1653; alpha = 0.0040
Simulating trial. . .
espilon = 0.1653; alpha = 0.0040
Simulating trial. . .
espilon = 0.1653; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.48)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 0.80)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.35)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.31)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 2.42)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.43)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'forward')
Agent drove forward instead of left. (rewarded 0.56)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.59)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.43)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 0.94)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'right', 'left')
Agent attempted driving forward through a red light. (rewarded -9.79)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.64)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 452
\-------------------------
Simulating trial. . .
espilon = 0.1646; alpha = 0.0040
Simulating trial. . .
espilon = 0.1646; alpha = 0.0040
Simulating trial. . .
espilon = 0.1646; alpha = 0.0040
Simulating trial. . .
espilon = 0.1646; alpha = 0.0040
Simulating trial. . .
espilon = 0.1646; alpha = 0.0040
Simulating trial. . .
espilon = 0.1646; alpha = 0.0040
Simulating trial. . .
espilon = 0.1646; alpha = 0.0040
Simulating trial. . .
espilon = 0.1646; alpha = 0.0040
Simulating trial. . .
espilon = 0.1646; alpha = 0.0040
Simulating trial. . .
espilon = 0.1646; alpha = 0.0040
Simulating trial. . .
espilon = 0.1646; alpha = 0.0040
Simulating trial. . .
espilon = 0.1646; alpha = 0.0040
Simulating trial. . .
espilon = 0.1646; alpha = 0.0040
Simulating trial. . .
espilon = 0.1646; alpha = 0.0040
Simulating trial. . .
espilon = 0.1646; alpha = 0.0040
Simulating trial. . .
espilon = 0.1646; alpha = 0.0040
Simulating trial. . .
espilon = 0.1646; alpha = 0.0040
Simulating trial. . .
espilon = 0.1646; alpha = 0.0040
Simulating trial. . .
espilon = 0.1646; alpha = 0.0040
Simulating trial. . .
espilon = 0.1646; alpha = 0.0040
Simulating trial. . .
espilon = 0.1646; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', 'left')
Agent properly idled at a red light. (rewarded 1.82)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.76)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.85)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.91)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.94)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.89)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.38)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.15)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.81)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 2.59)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', 'forward')
Agent drove right instead of forward. (rewarded 0.50)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.81)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.63)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.02)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent followed the waypoint left. (rewarded 0.96)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.58)
20% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 453
\-------------------------
Simulating trial. . .
espilon = 0.1640; alpha = 0.0040
Simulating trial. . .
espilon = 0.1640; alpha = 0.0040
Simulating trial. . .
espilon = 0.1640; alpha = 0.0040
Simulating trial. . .
espilon = 0.1640; alpha = 0.0040
Simulating trial. . .
espilon = 0.1640; alpha = 0.0040
Simulating trial. . .
espilon = 0.1640; alpha = 0.0040
Simulating trial. . .
espilon = 0.1640; alpha = 0.0040
Simulating trial. . .
espilon = 0.1640; alpha = 0.0040
Simulating trial. . .
espilon = 0.1640; alpha = 0.0040
Simulating trial. . .
espilon = 0.1640; alpha = 0.0040
Simulating trial. . .
espilon = 0.1640; alpha = 0.0040
Simulating trial. . .
espilon = 0.1640; alpha = 0.0040
Simulating trial. . .
espilon = 0.1640; alpha = 0.0040
Simulating trial. . .
espilon = 0.1640; alpha = 0.0040
Simulating trial. . .
espilon = 0.1640; alpha = 0.0040
Simulating trial. . .
espilon = 0.1640; alpha = 0.0040
Simulating trial. . .
espilon = 0.1640; alpha = 0.0040
Simulating trial. . .
espilon = 0.1640; alpha = 0.0040
Simulating trial. . .
espilon = 0.1640; alpha = 0.0040
Simulating trial. . .
espilon = 0.1640; alpha = 0.0040
Simulating trial. . .
espilon = 0.1640; alpha = 0.0040
Simulating trial. . .
espilon = 0.1640; alpha = 0.0040
Simulating trial. . .
espilon = 0.1640; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.55)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'forward')
Agent followed the waypoint right. (rewarded 1.02)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.03)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 2.54)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove right instead of forward. (rewarded 1.36)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.37)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 2.32)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', 'left')
Agent drove left instead of right. (rewarded 0.72)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.65)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'left')
Agent followed the waypoint right. (rewarded 2.76)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'left')
Agent drove forward instead of right. (rewarded -0.02)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.47)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'left')
Agent drove forward instead of right. (rewarded 0.93)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.18)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent followed the waypoint right. (rewarded 2.43)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.12)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.18)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.06)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.43)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 0.08)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 0.72)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', 'left')
Agent drove forward instead of left. (rewarded 0.41)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.68)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent drove right instead of left. (rewarded -0.15)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.19)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 454
\-------------------------
Simulating trial. . .
espilon = 0.1633; alpha = 0.0040
Simulating trial. . .
espilon = 0.1633; alpha = 0.0040
Simulating trial. . .
espilon = 0.1633; alpha = 0.0040
Simulating trial. . .
espilon = 0.1633; alpha = 0.0040
Simulating trial. . .
espilon = 0.1633; alpha = 0.0040
Simulating trial. . .
espilon = 0.1633; alpha = 0.0040
Simulating trial. . .
espilon = 0.1633; alpha = 0.0040
Simulating trial. . .
espilon = 0.1633; alpha = 0.0040
Simulating trial. . .
espilon = 0.1633; alpha = 0.0040
Simulating trial. . .
espilon = 0.1633; alpha = 0.0040
Simulating trial. . .
espilon = 0.1633; alpha = 0.0040
Simulating trial. . .
espilon = 0.1633; alpha = 0.0040
Simulating trial. . .
espilon = 0.1633; alpha = 0.0040
Simulating trial. . .
espilon = 0.1633; alpha = 0.0040
Simulating trial. . .
espilon = 0.1633; alpha = 0.0040
Simulating trial. . .
espilon = 0.1633; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 2.89)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.17)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.23)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 1.27)
80% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 455
\-------------------------
Simulating trial. . .
espilon = 0.1627; alpha = 0.0040
Simulating trial. . .
espilon = 0.1627; alpha = 0.0040
Simulating trial. . .
espilon = 0.1627; alpha = 0.0040
Simulating trial. . .
espilon = 0.1627; alpha = 0.0040
Simulating trial. . .
espilon = 0.1627; alpha = 0.0040
Simulating trial. . .
espilon = 0.1627; alpha = 0.0040
Simulating trial. . .
espilon = 0.1627; alpha = 0.0040
Simulating trial. . .
espilon = 0.1627; alpha = 0.0040
Simulating trial. . .
espilon = 0.1627; alpha = 0.0040
Simulating trial. . .
espilon = 0.1627; alpha = 0.0040
Simulating trial. . .
espilon = 0.1627; alpha = 0.0040
Simulating trial. . .
espilon = 0.1627; alpha = 0.0040
Simulating trial. . .
espilon = 0.1627; alpha = 0.0040
Simulating trial. . .
espilon = 0.1627; alpha = 0.0040
Simulating trial. . .
espilon = 0.1627; alpha = 0.0040
Simulating trial. . .
espilon = 0.1627; alpha = 0.0040
Simulating trial. . .
espilon = 0.1627; alpha = 0.0040
Simulating trial. . .
espilon = 0.1627; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.15)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.06)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.87)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.92)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.13)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.31)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.79)
65% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 456
\-------------------------
Simulating trial. . .
espilon = 0.1620; alpha = 0.0040
Simulating trial. . .
espilon = 0.1620; alpha = 0.0040
Simulating trial. . .
espilon = 0.1620; alpha = 0.0040
Simulating trial. . .
espilon = 0.1620; alpha = 0.0040
Simulating trial. . .
espilon = 0.1620; alpha = 0.0040
Simulating trial. . .
espilon = 0.1620; alpha = 0.0040
Simulating trial. . .
espilon = 0.1620; alpha = 0.0040
Simulating trial. . .
espilon = 0.1620; alpha = 0.0040
Simulating trial. . .
espilon = 0.1620; alpha = 0.0040
Simulating trial. . .
espilon = 0.1620; alpha = 0.0040
Simulating trial. . .
espilon = 0.1620; alpha = 0.0040
Simulating trial. . .
espilon = 0.1620; alpha = 0.0040
Simulating trial. . .
espilon = 0.1620; alpha = 0.0040
Simulating trial. . .
espilon = 0.1620; alpha = 0.0040
Simulating trial. . .
espilon = 0.1620; alpha = 0.0040
Simulating trial. . .
espilon = 0.1620; alpha = 0.0040
Simulating trial. . .
espilon = 0.1620; alpha = 0.0040
Simulating trial. . .
espilon = 0.1620; alpha = 0.0040
Simulating trial. . .
espilon = 0.1620; alpha = 0.0040
Simulating trial. . .
espilon = 0.1620; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'right', 'left')
Agent followed the waypoint right. (rewarded 2.24)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.98)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.07)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.89)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.15)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'forward')
Agent drove right instead of forward. (rewarded 1.86)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent followed the waypoint left. (rewarded 2.61)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.55)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.57)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.52)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'right', 'forward')
Agent drove forward instead of right. (rewarded 1.10)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.61)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, None)
Agent followed the waypoint right. (rewarded 1.01)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 2.62)
44% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 457
\-------------------------
Simulating trial. . .
espilon = 0.1614; alpha = 0.0040
Simulating trial. . .
espilon = 0.1614; alpha = 0.0040
Simulating trial. . .
espilon = 0.1614; alpha = 0.0040
Simulating trial. . .
espilon = 0.1614; alpha = 0.0040
Simulating trial. . .
espilon = 0.1614; alpha = 0.0040
Simulating trial. . .
espilon = 0.1614; alpha = 0.0040
Simulating trial. . .
espilon = 0.1614; alpha = 0.0040
Simulating trial. . .
espilon = 0.1614; alpha = 0.0040
Simulating trial. . .
espilon = 0.1614; alpha = 0.0040
Simulating trial. . .
espilon = 0.1614; alpha = 0.0040
Simulating trial. . .
espilon = 0.1614; alpha = 0.0040
Simulating trial. . .
espilon = 0.1614; alpha = 0.0040
Simulating trial. . .
espilon = 0.1614; alpha = 0.0040
Simulating trial. . .
espilon = 0.1614; alpha = 0.0040
Simulating trial. . .
espilon = 0.1614; alpha = 0.0040
Simulating trial. . .
espilon = 0.1614; alpha = 0.0040
Simulating trial. . .
espilon = 0.1614; alpha = 0.0040
Simulating trial. . .
espilon = 0.1614; alpha = 0.0040
Simulating trial. . .
espilon = 0.1614; alpha = 0.0040
Simulating trial. . .
espilon = 0.1614; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent followed the waypoint right. (rewarded 1.17)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'right', 'left')
Agent properly idled at a red light. (rewarded 1.19)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.28)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.23)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.67)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.90)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.15)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.85)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.86)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded -0.00)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent followed the waypoint left. (rewarded 1.31)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.67)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 458
\-------------------------
Simulating trial. . .
espilon = 0.1607; alpha = 0.0040
Simulating trial. . .
espilon = 0.1607; alpha = 0.0040
Simulating trial. . .
espilon = 0.1607; alpha = 0.0040
Simulating trial. . .
espilon = 0.1607; alpha = 0.0040
Simulating trial. . .
espilon = 0.1607; alpha = 0.0040
Simulating trial. . .
espilon = 0.1607; alpha = 0.0040
Simulating trial. . .
espilon = 0.1607; alpha = 0.0040
Simulating trial. . .
espilon = 0.1607; alpha = 0.0040
Simulating trial. . .
espilon = 0.1607; alpha = 0.0040
Simulating trial. . .
espilon = 0.1607; alpha = 0.0040
Simulating trial. . .
espilon = 0.1607; alpha = 0.0040
Simulating trial. . .
espilon = 0.1607; alpha = 0.0040
Simulating trial. . .
espilon = 0.1607; alpha = 0.0040
Simulating trial. . .
espilon = 0.1607; alpha = 0.0040
Simulating trial. . .
espilon = 0.1607; alpha = 0.0040
Simulating trial. . .
espilon = 0.1607; alpha = 0.0040
Simulating trial. . .
espilon = 0.1607; alpha = 0.0040
Simulating trial. . .
espilon = 0.1607; alpha = 0.0040
Simulating trial. . .
espilon = 0.1607; alpha = 0.0040
Simulating trial. . .
espilon = 0.1607; alpha = 0.0040
Simulating trial. . .
espilon = 0.1607; alpha = 0.0040
Simulating trial. . .
espilon = 0.1607; alpha = 0.0040
Simulating trial. . .
espilon = 0.1607; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.07)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.41)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.47)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.14)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.36)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.44)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.28)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.23)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.15)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent followed the waypoint forward. (rewarded 1.70)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.90)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', 'right')
Agent drove right instead of forward. (rewarded 0.27)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.85)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.57)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.52)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.14)
47% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 459
\-------------------------
Simulating trial. . .
espilon = 0.1601; alpha = 0.0040
Simulating trial. . .
espilon = 0.1601; alpha = 0.0040
Simulating trial. . .
espilon = 0.1601; alpha = 0.0040
Simulating trial. . .
espilon = 0.1601; alpha = 0.0040
Simulating trial. . .
espilon = 0.1601; alpha = 0.0040
Simulating trial. . .
espilon = 0.1601; alpha = 0.0040
Simulating trial. . .
espilon = 0.1601; alpha = 0.0040
Simulating trial. . .
espilon = 0.1601; alpha = 0.0040
Simulating trial. . .
espilon = 0.1601; alpha = 0.0040
Simulating trial. . .
espilon = 0.1601; alpha = 0.0040
Simulating trial. . .
espilon = 0.1601; alpha = 0.0040
Simulating trial. . .
espilon = 0.1601; alpha = 0.0040
Simulating trial. . .
espilon = 0.1601; alpha = 0.0040
Simulating trial. . .
espilon = 0.1601; alpha = 0.0040
Simulating trial. . .
espilon = 0.1601; alpha = 0.0040
Simulating trial. . .
espilon = 0.1601; alpha = 0.0040
Simulating trial. . .
espilon = 0.1601; alpha = 0.0040
Simulating trial. . .
espilon = 0.1601; alpha = 0.0040
Simulating trial. . .
espilon = 0.1601; alpha = 0.0040
Simulating trial. . .
espilon = 0.1601; alpha = 0.0040
Simulating trial. . .
espilon = 0.1601; alpha = 0.0040
Simulating trial. . .
espilon = 0.1601; alpha = 0.0040
Simulating trial. . .
espilon = 0.1601; alpha = 0.0040
Simulating trial. . .
espilon = 0.1601; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent drove left instead of right. (rewarded 0.85)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.76)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.78)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.40)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 0.79)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.51)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.20)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 2.07)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 2.09)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 0.25)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.57)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent drove right instead of left. (rewarded 0.16)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.49)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.39)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.91)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.46)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 0.76)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.34)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 0.97)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.33)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.03)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.69)
12% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 460
\-------------------------
Simulating trial. . .
espilon = 0.1595; alpha = 0.0040
Simulating trial. . .
espilon = 0.1595; alpha = 0.0040
Simulating trial. . .
espilon = 0.1595; alpha = 0.0040
Simulating trial. . .
espilon = 0.1595; alpha = 0.0040
Simulating trial. . .
espilon = 0.1595; alpha = 0.0040
Simulating trial. . .
espilon = 0.1595; alpha = 0.0040
Simulating trial. . .
espilon = 0.1595; alpha = 0.0040
Simulating trial. . .
espilon = 0.1595; alpha = 0.0040
Simulating trial. . .
espilon = 0.1595; alpha = 0.0040
Simulating trial. . .
espilon = 0.1595; alpha = 0.0040
Simulating trial. . .
espilon = 0.1595; alpha = 0.0040
Simulating trial. . .
espilon = 0.1595; alpha = 0.0040
Simulating trial. . .
espilon = 0.1595; alpha = 0.0040
Simulating trial. . .
espilon = 0.1595; alpha = 0.0040
Simulating trial. . .
espilon = 0.1595; alpha = 0.0040
Simulating trial. . .
espilon = 0.1595; alpha = 0.0040
Simulating trial. . .
espilon = 0.1595; alpha = 0.0040
Simulating trial. . .
espilon = 0.1595; alpha = 0.0040
Simulating trial. . .
espilon = 0.1595; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 2.19)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.48)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'right')
Agent followed the waypoint forward. (rewarded 1.57)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'forward')
Agent drove right instead of left. (rewarded 0.48)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.75)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 0.44)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.05)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent attempted driving forward through a red light. (rewarded -9.50)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 1.40)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.13)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.52)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.96)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.75)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', 'forward')
Agent drove forward instead of right. (rewarded 0.56)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.37)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 1.30)
36% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 461
\-------------------------
Simulating trial. . .
espilon = 0.1588; alpha = 0.0040
Simulating trial. . .
espilon = 0.1588; alpha = 0.0040
Simulating trial. . .
espilon = 0.1588; alpha = 0.0040
Simulating trial. . .
espilon = 0.1588; alpha = 0.0040
Simulating trial. . .
espilon = 0.1588; alpha = 0.0040
Simulating trial. . .
espilon = 0.1588; alpha = 0.0040
Simulating trial. . .
espilon = 0.1588; alpha = 0.0040
Simulating trial. . .
espilon = 0.1588; alpha = 0.0040
Simulating trial. . .
espilon = 0.1588; alpha = 0.0040
Simulating trial. . .
espilon = 0.1588; alpha = 0.0040
Simulating trial. . .
espilon = 0.1588; alpha = 0.0040
Simulating trial. . .
espilon = 0.1588; alpha = 0.0040
Simulating trial. . .
espilon = 0.1588; alpha = 0.0040
Simulating trial. . .
espilon = 0.1588; alpha = 0.0040
Simulating trial. . .
espilon = 0.1588; alpha = 0.0040
Simulating trial. . .
espilon = 0.1588; alpha = 0.0040
Simulating trial. . .
espilon = 0.1588; alpha = 0.0040
Simulating trial. . .
espilon = 0.1588; alpha = 0.0040
Simulating trial. . .
espilon = 0.1588; alpha = 0.0040
Simulating trial. . .
espilon = 0.1588; alpha = 0.0040
Simulating trial. . .
espilon = 0.1588; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.93)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.09)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.13)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.30)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.66)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 2.83)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.62)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.74)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.11)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 0.87)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', None)
Agent attempted driving left through a red light. (rewarded -9.21)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.55)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.64)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 2.33)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.89)
25% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 462
\-------------------------
Simulating trial. . .
espilon = 0.1582; alpha = 0.0040
Simulating trial. . .
espilon = 0.1582; alpha = 0.0040
Simulating trial. . .
espilon = 0.1582; alpha = 0.0040
Simulating trial. . .
espilon = 0.1582; alpha = 0.0040
Simulating trial. . .
espilon = 0.1582; alpha = 0.0040
Simulating trial. . .
espilon = 0.1582; alpha = 0.0040
Simulating trial. . .
espilon = 0.1582; alpha = 0.0040
Simulating trial. . .
espilon = 0.1582; alpha = 0.0040
Simulating trial. . .
espilon = 0.1582; alpha = 0.0040
Simulating trial. . .
espilon = 0.1582; alpha = 0.0040
Simulating trial. . .
espilon = 0.1582; alpha = 0.0040
Simulating trial. . .
espilon = 0.1582; alpha = 0.0040
Simulating trial. . .
espilon = 0.1582; alpha = 0.0040
Simulating trial. . .
espilon = 0.1582; alpha = 0.0040
Simulating trial. . .
espilon = 0.1582; alpha = 0.0040
Simulating trial. . .
espilon = 0.1582; alpha = 0.0040
Simulating trial. . .
espilon = 0.1582; alpha = 0.0040
Simulating trial. . .
espilon = 0.1582; alpha = 0.0040
Simulating trial. . .
espilon = 0.1582; alpha = 0.0040
Simulating trial. . .
espilon = 0.1582; alpha = 0.0040
Simulating trial. . .
espilon = 0.1582; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.63)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', 'forward')
Agent followed the waypoint forward. (rewarded 2.84)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 2.01)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.50)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'left')
Agent followed the waypoint right. (rewarded 2.35)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.21)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.88)
72% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 463
\-------------------------
Simulating trial. . .
espilon = 0.1576; alpha = 0.0040
Simulating trial. . .
espilon = 0.1576; alpha = 0.0040
Simulating trial. . .
espilon = 0.1576; alpha = 0.0040
Simulating trial. . .
espilon = 0.1576; alpha = 0.0040
Simulating trial. . .
espilon = 0.1576; alpha = 0.0040
Simulating trial. . .
espilon = 0.1576; alpha = 0.0040
Simulating trial. . .
espilon = 0.1576; alpha = 0.0040
Simulating trial. . .
espilon = 0.1576; alpha = 0.0040
Simulating trial. . .
espilon = 0.1576; alpha = 0.0040
Simulating trial. . .
espilon = 0.1576; alpha = 0.0040
Simulating trial. . .
espilon = 0.1576; alpha = 0.0040
Simulating trial. . .
espilon = 0.1576; alpha = 0.0040
Simulating trial. . .
espilon = 0.1576; alpha = 0.0040
Simulating trial. . .
espilon = 0.1576; alpha = 0.0040
Simulating trial. . .
espilon = 0.1576; alpha = 0.0040
Simulating trial. . .
espilon = 0.1576; alpha = 0.0040
Simulating trial. . .
espilon = 0.1576; alpha = 0.0040
Simulating trial. . .
espilon = 0.1576; alpha = 0.0040
Simulating trial. . .
espilon = 0.1576; alpha = 0.0040
Simulating trial. . .
espilon = 0.1576; alpha = 0.0040
Simulating trial. . .
espilon = 0.1576; alpha = 0.0040
Simulating trial. . .
espilon = 0.1576; alpha = 0.0040
Simulating trial. . .
espilon = 0.1576; alpha = 0.0040
Simulating trial. . .
espilon = 0.1576; alpha = 0.0040
Simulating trial. . .
espilon = 0.1576; alpha = 0.0040
Simulating trial. . .
espilon = 0.1576; alpha = 0.0040
Simulating trial. . .
espilon = 0.1576; alpha = 0.0040
Simulating trial. . .
espilon = 0.1576; alpha = 0.0040
Simulating trial. . .
espilon = 0.1576; alpha = 0.0040
Simulating trial. . .
espilon = 0.1576; alpha = 0.0040
Simulating trial. . .
espilon = 0.1576; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.75)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.45)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.31)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.20)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.44)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.32)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent drove forward instead of left. (rewarded 0.67)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'right', None)
Agent drove forward instead of left. (rewarded 0.25)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.67)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.37)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.58)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'right')
Agent properly idled at a red light. (rewarded 1.62)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.61)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 2.50)
30% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 464
\-------------------------
Simulating trial. . .
espilon = 0.1569; alpha = 0.0040
Simulating trial. . .
espilon = 0.1569; alpha = 0.0040
Simulating trial. . .
espilon = 0.1569; alpha = 0.0040
Simulating trial. . .
espilon = 0.1569; alpha = 0.0040
Simulating trial. . .
espilon = 0.1569; alpha = 0.0040
Simulating trial. . .
espilon = 0.1569; alpha = 0.0040
Simulating trial. . .
espilon = 0.1569; alpha = 0.0040
Simulating trial. . .
espilon = 0.1569; alpha = 0.0040
Simulating trial. . .
espilon = 0.1569; alpha = 0.0040
Simulating trial. . .
espilon = 0.1569; alpha = 0.0040
Simulating trial. . .
espilon = 0.1569; alpha = 0.0040
Simulating trial. . .
espilon = 0.1569; alpha = 0.0040
Simulating trial. . .
espilon = 0.1569; alpha = 0.0040
Simulating trial. . .
espilon = 0.1569; alpha = 0.0040
Simulating trial. . .
espilon = 0.1569; alpha = 0.0040
Simulating trial. . .
espilon = 0.1569; alpha = 0.0040
Simulating trial. . .
espilon = 0.1569; alpha = 0.0040
Simulating trial. . .
espilon = 0.1569; alpha = 0.0040
Simulating trial. . .
espilon = 0.1569; alpha = 0.0040
Simulating trial. . .
espilon = 0.1569; alpha = 0.0040
Simulating trial. . .
espilon = 0.1569; alpha = 0.0040
Simulating trial. . .
espilon = 0.1569; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'left')
Agent drove forward instead of right. (rewarded 0.93)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 1.04)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.18)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.26)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.13)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 1.06)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.37)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.98)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.75)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.36)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 2.46)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.12)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent followed the waypoint forward. (rewarded 2.72)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.55)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.86)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.89)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.85)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.56)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent drove right instead of forward. (rewarded 0.87)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.61)
20% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 465
\-------------------------
Simulating trial. . .
espilon = 0.1563; alpha = 0.0040
Simulating trial. . .
espilon = 0.1563; alpha = 0.0040
Simulating trial. . .
espilon = 0.1563; alpha = 0.0040
Simulating trial. . .
espilon = 0.1563; alpha = 0.0040
Simulating trial. . .
espilon = 0.1563; alpha = 0.0040
Simulating trial. . .
espilon = 0.1563; alpha = 0.0040
Simulating trial. . .
espilon = 0.1563; alpha = 0.0040
Simulating trial. . .
espilon = 0.1563; alpha = 0.0040
Simulating trial. . .
espilon = 0.1563; alpha = 0.0040
Simulating trial. . .
espilon = 0.1563; alpha = 0.0040
Simulating trial. . .
espilon = 0.1563; alpha = 0.0040
Simulating trial. . .
espilon = 0.1563; alpha = 0.0040
Simulating trial. . .
espilon = 0.1563; alpha = 0.0040
Simulating trial. . .
espilon = 0.1563; alpha = 0.0040
Simulating trial. . .
espilon = 0.1563; alpha = 0.0040
Simulating trial. . .
espilon = 0.1563; alpha = 0.0040
Simulating trial. . .
espilon = 0.1563; alpha = 0.0040
Simulating trial. . .
espilon = 0.1563; alpha = 0.0040
Simulating trial. . .
espilon = 0.1563; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'right', None)
Agent drove right instead of left. (rewarded 0.14)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.55)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 2.22)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 2.75)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 1.54)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.00)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.06)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.62)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 466
\-------------------------
Simulating trial. . .
espilon = 0.1557; alpha = 0.0040
Simulating trial. . .
espilon = 0.1557; alpha = 0.0040
Simulating trial. . .
espilon = 0.1557; alpha = 0.0040
Simulating trial. . .
espilon = 0.1557; alpha = 0.0040
Simulating trial. . .
espilon = 0.1557; alpha = 0.0040
Simulating trial. . .
espilon = 0.1557; alpha = 0.0040
Simulating trial. . .
espilon = 0.1557; alpha = 0.0040
Simulating trial. . .
espilon = 0.1557; alpha = 0.0040
Simulating trial. . .
espilon = 0.1557; alpha = 0.0040
Simulating trial. . .
espilon = 0.1557; alpha = 0.0040
Simulating trial. . .
espilon = 0.1557; alpha = 0.0040
Simulating trial. . .
espilon = 0.1557; alpha = 0.0040
Simulating trial. . .
espilon = 0.1557; alpha = 0.0040
Simulating trial. . .
espilon = 0.1557; alpha = 0.0040
Simulating trial. . .
espilon = 0.1557; alpha = 0.0040
Simulating trial. . .
espilon = 0.1557; alpha = 0.0040
Simulating trial. . .
espilon = 0.1557; alpha = 0.0040
Simulating trial. . .
espilon = 0.1557; alpha = 0.0040
Simulating trial. . .
espilon = 0.1557; alpha = 0.0040
Simulating trial. . .
espilon = 0.1557; alpha = 0.0040
Simulating trial. . .
espilon = 0.1557; alpha = 0.0040
Simulating trial. . .
espilon = 0.1557; alpha = 0.0040
Simulating trial. . .
espilon = 0.1557; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent followed the waypoint left. (rewarded 1.33)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.14)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.43)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent followed the waypoint right. (rewarded 2.12)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'forward', None)
Agent properly idled at a red light. (rewarded 1.68)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.06)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.70)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.41)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.66)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.29)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 467
\-------------------------
Simulating trial. . .
espilon = 0.1551; alpha = 0.0040
Simulating trial. . .
espilon = 0.1551; alpha = 0.0040
Simulating trial. . .
espilon = 0.1551; alpha = 0.0040
Simulating trial. . .
espilon = 0.1551; alpha = 0.0040
Simulating trial. . .
espilon = 0.1551; alpha = 0.0040
Simulating trial. . .
espilon = 0.1551; alpha = 0.0040
Simulating trial. . .
espilon = 0.1551; alpha = 0.0040
Simulating trial. . .
espilon = 0.1551; alpha = 0.0040
Simulating trial. . .
espilon = 0.1551; alpha = 0.0040
Simulating trial. . .
espilon = 0.1551; alpha = 0.0040
Simulating trial. . .
espilon = 0.1551; alpha = 0.0040
Simulating trial. . .
espilon = 0.1551; alpha = 0.0040
Simulating trial. . .
espilon = 0.1551; alpha = 0.0040
Simulating trial. . .
espilon = 0.1551; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.83)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'forward')
Agent drove right instead of left. (rewarded 0.86)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.16)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.12)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.98)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent drove right instead of forward. (rewarded 1.30)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.30)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.64)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.95)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'forward')
Agent followed the waypoint right. (rewarded 2.00)
50% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 468
\-------------------------
Simulating trial. . .
espilon = 0.1544; alpha = 0.0040
Simulating trial. . .
espilon = 0.1544; alpha = 0.0040
Simulating trial. . .
espilon = 0.1544; alpha = 0.0040
Simulating trial. . .
espilon = 0.1544; alpha = 0.0040
Simulating trial. . .
espilon = 0.1544; alpha = 0.0040
Simulating trial. . .
espilon = 0.1544; alpha = 0.0040
Simulating trial. . .
espilon = 0.1544; alpha = 0.0040
Simulating trial. . .
espilon = 0.1544; alpha = 0.0040
Simulating trial. . .
espilon = 0.1544; alpha = 0.0040
Simulating trial. . .
espilon = 0.1544; alpha = 0.0040
Simulating trial. . .
espilon = 0.1544; alpha = 0.0040
Simulating trial. . .
espilon = 0.1544; alpha = 0.0040
Simulating trial. . .
espilon = 0.1544; alpha = 0.0040
Simulating trial. . .
espilon = 0.1544; alpha = 0.0040
Simulating trial. . .
espilon = 0.1544; alpha = 0.0040
Simulating trial. . .
espilon = 0.1544; alpha = 0.0040
Simulating trial. . .
espilon = 0.1544; alpha = 0.0040
Simulating trial. . .
espilon = 0.1544; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.09)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent drove right instead of left. (rewarded 0.49)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'right')
Agent followed the waypoint right. (rewarded 2.72)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.64)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.43)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 0.99)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 1.02)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'right', 'left')
Agent followed the waypoint left. (rewarded 1.34)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.09)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.61)
50% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 469
\-------------------------
Simulating trial. . .
espilon = 0.1538; alpha = 0.0040
Simulating trial. . .
espilon = 0.1538; alpha = 0.0040
Simulating trial. . .
espilon = 0.1538; alpha = 0.0040
Simulating trial. . .
espilon = 0.1538; alpha = 0.0040
Simulating trial. . .
espilon = 0.1538; alpha = 0.0040
Simulating trial. . .
espilon = 0.1538; alpha = 0.0040
Simulating trial. . .
espilon = 0.1538; alpha = 0.0040
Simulating trial. . .
espilon = 0.1538; alpha = 0.0040
Simulating trial. . .
espilon = 0.1538; alpha = 0.0040
Simulating trial. . .
espilon = 0.1538; alpha = 0.0040
Simulating trial. . .
espilon = 0.1538; alpha = 0.0040
Simulating trial. . .
espilon = 0.1538; alpha = 0.0040
Simulating trial. . .
espilon = 0.1538; alpha = 0.0040
Simulating trial. . .
espilon = 0.1538; alpha = 0.0040
Simulating trial. . .
espilon = 0.1538; alpha = 0.0040
Simulating trial. . .
espilon = 0.1538; alpha = 0.0040
Simulating trial. . .
espilon = 0.1538; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'left', 'forward')
Agent drove forward instead of right. (rewarded 0.64)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 1.28)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.73)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.89)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.50)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.09)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.48)
65% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 470
\-------------------------
Simulating trial. . .
espilon = 0.1532; alpha = 0.0040
Simulating trial. . .
espilon = 0.1532; alpha = 0.0040
Simulating trial. . .
espilon = 0.1532; alpha = 0.0040
Simulating trial. . .
espilon = 0.1532; alpha = 0.0040
Simulating trial. . .
espilon = 0.1532; alpha = 0.0040
Simulating trial. . .
espilon = 0.1532; alpha = 0.0040
Simulating trial. . .
espilon = 0.1532; alpha = 0.0040
Simulating trial. . .
espilon = 0.1532; alpha = 0.0040
Simulating trial. . .
espilon = 0.1532; alpha = 0.0040
Simulating trial. . .
espilon = 0.1532; alpha = 0.0040
Simulating trial. . .
espilon = 0.1532; alpha = 0.0040
Simulating trial. . .
espilon = 0.1532; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.52)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 1.86)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'right', 'forward')
Agent properly idled at a red light. (rewarded 1.59)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.07)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.41)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.32)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.55)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.93)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.06)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.08)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 1.45)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent drove right instead of forward. (rewarded 1.36)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.48)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.65)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent properly idled at a red light. (rewarded 1.29)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.84)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', 'left')
Agent followed the waypoint forward. (rewarded 2.42)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 0.72)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 1.39)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.63)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.25)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.09)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.50)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded -0.06)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', 'forward')
Agent properly idled at a red light. (rewarded 2.03)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.78)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 0.40)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'right')
Agent properly idled at a red light. (rewarded 0.34)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.97)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 0.90)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 471
\-------------------------
Simulating trial. . .
espilon = 0.1526; alpha = 0.0040
Simulating trial. . .
espilon = 0.1526; alpha = 0.0040
Simulating trial. . .
espilon = 0.1526; alpha = 0.0040
Simulating trial. . .
espilon = 0.1526; alpha = 0.0040
Simulating trial. . .
espilon = 0.1526; alpha = 0.0040
Simulating trial. . .
espilon = 0.1526; alpha = 0.0040
Simulating trial. . .
espilon = 0.1526; alpha = 0.0040
Simulating trial. . .
espilon = 0.1526; alpha = 0.0040
Simulating trial. . .
espilon = 0.1526; alpha = 0.0040
Simulating trial. . .
espilon = 0.1526; alpha = 0.0040
Simulating trial. . .
espilon = 0.1526; alpha = 0.0040
Simulating trial. . .
espilon = 0.1526; alpha = 0.0040
Simulating trial. . .
espilon = 0.1526; alpha = 0.0040
Simulating trial. . .
espilon = 0.1526; alpha = 0.0040
Simulating trial. . .
espilon = 0.1526; alpha = 0.0040
Simulating trial. . .
espilon = 0.1526; alpha = 0.0040
Simulating trial. . .
espilon = 0.1526; alpha = 0.0040
Simulating trial. . .
espilon = 0.1526; alpha = 0.0040
Simulating trial. . .
espilon = 0.1526; alpha = 0.0040
Simulating trial. . .
espilon = 0.1526; alpha = 0.0040
Simulating trial. . .
espilon = 0.1526; alpha = 0.0040
Simulating trial. . .
espilon = 0.1526; alpha = 0.0040
Simulating trial. . .
espilon = 0.1526; alpha = 0.0040
Simulating trial. . .
espilon = 0.1526; alpha = 0.0040
Simulating trial. . .
espilon = 0.1526; alpha = 0.0040
Simulating trial. . .
espilon = 0.1526; alpha = 0.0040
Simulating trial. . .
espilon = 0.1526; alpha = 0.0040
Simulating trial. . .
espilon = 0.1526; alpha = 0.0040
Simulating trial. . .
espilon = 0.1526; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', 'left')
Agent followed the waypoint right. (rewarded 1.68)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.31)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent drove right instead of forward. (rewarded 1.71)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 2.61)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'left')
Agent drove left instead of right. (rewarded 1.56)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.15)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.00)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 0.55)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.20)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.84)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.09)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent followed the waypoint left. (rewarded 0.96)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.63)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.65)
44% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 472
\-------------------------
Simulating trial. . .
espilon = 0.1520; alpha = 0.0040
Simulating trial. . .
espilon = 0.1520; alpha = 0.0040
Simulating trial. . .
espilon = 0.1520; alpha = 0.0040
Simulating trial. . .
espilon = 0.1520; alpha = 0.0040
Simulating trial. . .
espilon = 0.1520; alpha = 0.0040
Simulating trial. . .
espilon = 0.1520; alpha = 0.0040
Simulating trial. . .
espilon = 0.1520; alpha = 0.0040
Simulating trial. . .
espilon = 0.1520; alpha = 0.0040
Simulating trial. . .
espilon = 0.1520; alpha = 0.0040
Simulating trial. . .
espilon = 0.1520; alpha = 0.0040
Simulating trial. . .
espilon = 0.1520; alpha = 0.0040
Simulating trial. . .
espilon = 0.1520; alpha = 0.0040
Simulating trial. . .
espilon = 0.1520; alpha = 0.0040
Simulating trial. . .
espilon = 0.1520; alpha = 0.0040
Simulating trial. . .
espilon = 0.1520; alpha = 0.0040
Simulating trial. . .
espilon = 0.1520; alpha = 0.0040
Simulating trial. . .
espilon = 0.1520; alpha = 0.0040
Simulating trial. . .
espilon = 0.1520; alpha = 0.0040
Simulating trial. . .
espilon = 0.1520; alpha = 0.0040
Simulating trial. . .
espilon = 0.1520; alpha = 0.0040
Simulating trial. . .
espilon = 0.1520; alpha = 0.0040
Simulating trial. . .
espilon = 0.1520; alpha = 0.0040
Simulating trial. . .
espilon = 0.1520; alpha = 0.0040
Simulating trial. . .
espilon = 0.1520; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.54)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent followed the waypoint right. (rewarded 1.85)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.49)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.81)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', 'forward')
Agent followed the waypoint forward. (rewarded 1.67)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.24)
76% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 473
\-------------------------
Simulating trial. . .
espilon = 0.1514; alpha = 0.0040
Simulating trial. . .
espilon = 0.1514; alpha = 0.0040
Simulating trial. . .
espilon = 0.1514; alpha = 0.0040
Simulating trial. . .
espilon = 0.1514; alpha = 0.0040
Simulating trial. . .
espilon = 0.1514; alpha = 0.0040
Simulating trial. . .
espilon = 0.1514; alpha = 0.0040
Simulating trial. . .
espilon = 0.1514; alpha = 0.0040
Simulating trial. . .
espilon = 0.1514; alpha = 0.0040
Simulating trial. . .
espilon = 0.1514; alpha = 0.0040
Simulating trial. . .
espilon = 0.1514; alpha = 0.0040
Simulating trial. . .
espilon = 0.1514; alpha = 0.0040
Simulating trial. . .
espilon = 0.1514; alpha = 0.0040
Simulating trial. . .
espilon = 0.1514; alpha = 0.0040
Simulating trial. . .
espilon = 0.1514; alpha = 0.0040
Simulating trial. . .
espilon = 0.1514; alpha = 0.0040
Simulating trial. . .
espilon = 0.1514; alpha = 0.0040
Simulating trial. . .
espilon = 0.1514; alpha = 0.0040
Simulating trial. . .
espilon = 0.1514; alpha = 0.0040
Simulating trial. . .
espilon = 0.1514; alpha = 0.0040
Simulating trial. . .
espilon = 0.1514; alpha = 0.0040
Simulating trial. . .
espilon = 0.1514; alpha = 0.0040
Simulating trial. . .
espilon = 0.1514; alpha = 0.0040
Simulating trial. . .
espilon = 0.1514; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent followed the waypoint left. (rewarded 2.17)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.91)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.67)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.87)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.01)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.54)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.55)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.19)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.43)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.47)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.75)
56% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 474
\-------------------------
Simulating trial. . .
espilon = 0.1508; alpha = 0.0040
Simulating trial. . .
espilon = 0.1508; alpha = 0.0040
Simulating trial. . .
espilon = 0.1508; alpha = 0.0040
Simulating trial. . .
espilon = 0.1508; alpha = 0.0040
Simulating trial. . .
espilon = 0.1508; alpha = 0.0040
Simulating trial. . .
espilon = 0.1508; alpha = 0.0040
Simulating trial. . .
espilon = 0.1508; alpha = 0.0040
Simulating trial. . .
espilon = 0.1508; alpha = 0.0040
Simulating trial. . .
espilon = 0.1508; alpha = 0.0040
Simulating trial. . .
espilon = 0.1508; alpha = 0.0040
Simulating trial. . .
espilon = 0.1508; alpha = 0.0040
Simulating trial. . .
espilon = 0.1508; alpha = 0.0040
Simulating trial. . .
espilon = 0.1508; alpha = 0.0040
Simulating trial. . .
espilon = 0.1508; alpha = 0.0040
Simulating trial. . .
espilon = 0.1508; alpha = 0.0040
Simulating trial. . .
espilon = 0.1508; alpha = 0.0040
Simulating trial. . .
espilon = 0.1508; alpha = 0.0040
Simulating trial. . .
espilon = 0.1508; alpha = 0.0040
Simulating trial. . .
espilon = 0.1508; alpha = 0.0040
Simulating trial. . .
espilon = 0.1508; alpha = 0.0040
Simulating trial. . .
espilon = 0.1508; alpha = 0.0040
Simulating trial. . .
espilon = 0.1508; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 2.85)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.31)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.55)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.86)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.04)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.67)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.67)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.12)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.15)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.55)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.73)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.63)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent followed the waypoint right. (rewarded 0.89)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent drove right instead of forward. (rewarded 0.98)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'forward', None)
Agent drove right instead of left. (rewarded 0.69)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.10)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.21)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 0.75)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.55)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.73)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.58)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.69)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.22)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.08)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.71)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 475
\-------------------------
Simulating trial. . .
espilon = 0.1502; alpha = 0.0040
Simulating trial. . .
espilon = 0.1502; alpha = 0.0040
Simulating trial. . .
espilon = 0.1502; alpha = 0.0040
Simulating trial. . .
espilon = 0.1502; alpha = 0.0040
Simulating trial. . .
espilon = 0.1502; alpha = 0.0040
Simulating trial. . .
espilon = 0.1502; alpha = 0.0040
Simulating trial. . .
espilon = 0.1502; alpha = 0.0040
Simulating trial. . .
espilon = 0.1502; alpha = 0.0040
Simulating trial. . .
espilon = 0.1502; alpha = 0.0040
Simulating trial. . .
espilon = 0.1502; alpha = 0.0040
Simulating trial. . .
espilon = 0.1502; alpha = 0.0040
Simulating trial. . .
espilon = 0.1502; alpha = 0.0040
Simulating trial. . .
espilon = 0.1502; alpha = 0.0040
Simulating trial. . .
espilon = 0.1502; alpha = 0.0040
Simulating trial. . .
espilon = 0.1502; alpha = 0.0040
Simulating trial. . .
espilon = 0.1502; alpha = 0.0040
Simulating trial. . .
espilon = 0.1502; alpha = 0.0040
Simulating trial. . .
espilon = 0.1502; alpha = 0.0040
Simulating trial. . .
espilon = 0.1502; alpha = 0.0040
Simulating trial. . .
espilon = 0.1502; alpha = 0.0040
Simulating trial. . .
espilon = 0.1502; alpha = 0.0040
Simulating trial. . .
espilon = 0.1502; alpha = 0.0040
Simulating trial. . .
espilon = 0.1502; alpha = 0.0040
Simulating trial. . .
espilon = 0.1502; alpha = 0.0040
Simulating trial. . .
espilon = 0.1502; alpha = 0.0040
Simulating trial. . .
espilon = 0.1502; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.41)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.96)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.15)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.95)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.74)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.02)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', 'left')
Agent drove right instead of left. (rewarded 0.43)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'right')
Agent properly idled at a red light. (rewarded 2.69)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.91)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.73)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.02)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.45)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.17)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.67)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.19)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 0.74)
36% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 476
\-------------------------
Simulating trial. . .
espilon = 0.1496; alpha = 0.0040
Simulating trial. . .
espilon = 0.1496; alpha = 0.0040
Simulating trial. . .
espilon = 0.1496; alpha = 0.0040
Simulating trial. . .
espilon = 0.1496; alpha = 0.0040
Simulating trial. . .
espilon = 0.1496; alpha = 0.0040
Simulating trial. . .
espilon = 0.1496; alpha = 0.0040
Simulating trial. . .
espilon = 0.1496; alpha = 0.0040
Simulating trial. . .
espilon = 0.1496; alpha = 0.0040
Simulating trial. . .
espilon = 0.1496; alpha = 0.0040
Simulating trial. . .
espilon = 0.1496; alpha = 0.0040
Simulating trial. . .
espilon = 0.1496; alpha = 0.0040
Simulating trial. . .
espilon = 0.1496; alpha = 0.0040
Simulating trial. . .
espilon = 0.1496; alpha = 0.0040
Simulating trial. . .
espilon = 0.1496; alpha = 0.0040
Simulating trial. . .
espilon = 0.1496; alpha = 0.0040
Simulating trial. . .
espilon = 0.1496; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.22)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'right', None)
Agent followed the waypoint forward. (rewarded 2.76)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.61)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.96)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.94)
75% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 477
\-------------------------
Simulating trial. . .
espilon = 0.1490; alpha = 0.0040
Simulating trial. . .
espilon = 0.1490; alpha = 0.0040
Simulating trial. . .
espilon = 0.1490; alpha = 0.0040
Simulating trial. . .
espilon = 0.1490; alpha = 0.0040
Simulating trial. . .
espilon = 0.1490; alpha = 0.0040
Simulating trial. . .
espilon = 0.1490; alpha = 0.0040
Simulating trial. . .
espilon = 0.1490; alpha = 0.0040
Simulating trial. . .
espilon = 0.1490; alpha = 0.0040
Simulating trial. . .
espilon = 0.1490; alpha = 0.0040
Simulating trial. . .
espilon = 0.1490; alpha = 0.0040
Simulating trial. . .
espilon = 0.1490; alpha = 0.0040
Simulating trial. . .
espilon = 0.1490; alpha = 0.0040
Simulating trial. . .
espilon = 0.1490; alpha = 0.0040
Simulating trial. . .
espilon = 0.1490; alpha = 0.0040
Simulating trial. . .
espilon = 0.1490; alpha = 0.0040
Simulating trial. . .
espilon = 0.1490; alpha = 0.0040
Simulating trial. . .
espilon = 0.1490; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.52)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.94)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.86)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.50)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.89)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.40)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.57)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.83)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent followed the waypoint right. (rewarded 2.36)
55% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 478
\-------------------------
Simulating trial. . .
espilon = 0.1484; alpha = 0.0040
Simulating trial. . .
espilon = 0.1484; alpha = 0.0040
Simulating trial. . .
espilon = 0.1484; alpha = 0.0040
Simulating trial. . .
espilon = 0.1484; alpha = 0.0040
Simulating trial. . .
espilon = 0.1484; alpha = 0.0040
Simulating trial. . .
espilon = 0.1484; alpha = 0.0040
Simulating trial. . .
espilon = 0.1484; alpha = 0.0040
Simulating trial. . .
espilon = 0.1484; alpha = 0.0040
Simulating trial. . .
espilon = 0.1484; alpha = 0.0040
Simulating trial. . .
espilon = 0.1484; alpha = 0.0040
Simulating trial. . .
espilon = 0.1484; alpha = 0.0040
Simulating trial. . .
espilon = 0.1484; alpha = 0.0040
Simulating trial. . .
espilon = 0.1484; alpha = 0.0040
Simulating trial. . .
espilon = 0.1484; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'right', 'forward')
Agent properly idled at a red light. (rewarded 1.98)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.74)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.47)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.30)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.60)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.36)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.95)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'left')
Agent followed the waypoint right. (rewarded 1.77)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.18)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.14)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 479
\-------------------------
Simulating trial. . .
espilon = 0.1478; alpha = 0.0040
Simulating trial. . .
espilon = 0.1478; alpha = 0.0040
Simulating trial. . .
espilon = 0.1478; alpha = 0.0040
Simulating trial. . .
espilon = 0.1478; alpha = 0.0040
Simulating trial. . .
espilon = 0.1478; alpha = 0.0040
Simulating trial. . .
espilon = 0.1478; alpha = 0.0040
Simulating trial. . .
espilon = 0.1478; alpha = 0.0040
Simulating trial. . .
espilon = 0.1478; alpha = 0.0040
Simulating trial. . .
espilon = 0.1478; alpha = 0.0040
Simulating trial. . .
espilon = 0.1478; alpha = 0.0040
Simulating trial. . .
espilon = 0.1478; alpha = 0.0040
Simulating trial. . .
espilon = 0.1478; alpha = 0.0040
Simulating trial. . .
espilon = 0.1478; alpha = 0.0040
Simulating trial. . .
espilon = 0.1478; alpha = 0.0040
Simulating trial. . .
espilon = 0.1478; alpha = 0.0040
Simulating trial. . .
espilon = 0.1478; alpha = 0.0040
Simulating trial. . .
espilon = 0.1478; alpha = 0.0040
Simulating trial. . .
espilon = 0.1478; alpha = 0.0040
Simulating trial. . .
espilon = 0.1478; alpha = 0.0040
Simulating trial. . .
espilon = 0.1478; alpha = 0.0040
Simulating trial. . .
espilon = 0.1478; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.62)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.17)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 1.83)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.55)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.72)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.56)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent followed the waypoint right. (rewarded 1.18)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.70)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'right')
Agent drove right instead of forward. (rewarded 1.09)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent followed the waypoint left. (rewarded 2.47)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.18)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.39)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.61)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.54)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 1.49)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent attempted driving left through a red light. (rewarded -10.68)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.50)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 0.90)
28% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 480
\-------------------------
Simulating trial. . .
espilon = 0.1472; alpha = 0.0040
Simulating trial. . .
espilon = 0.1472; alpha = 0.0040
Simulating trial. . .
espilon = 0.1472; alpha = 0.0040
Simulating trial. . .
espilon = 0.1472; alpha = 0.0040
Simulating trial. . .
espilon = 0.1472; alpha = 0.0040
Simulating trial. . .
espilon = 0.1472; alpha = 0.0040
Simulating trial. . .
espilon = 0.1472; alpha = 0.0040
Simulating trial. . .
espilon = 0.1472; alpha = 0.0040
Simulating trial. . .
espilon = 0.1472; alpha = 0.0040
Simulating trial. . .
espilon = 0.1472; alpha = 0.0040
Simulating trial. . .
espilon = 0.1472; alpha = 0.0040
Simulating trial. . .
espilon = 0.1472; alpha = 0.0040
Simulating trial. . .
espilon = 0.1472; alpha = 0.0040
Simulating trial. . .
espilon = 0.1472; alpha = 0.0040
Simulating trial. . .
espilon = 0.1472; alpha = 0.0040
Simulating trial. . .
espilon = 0.1472; alpha = 0.0040
Simulating trial. . .
espilon = 0.1472; alpha = 0.0040
Simulating trial. . .
espilon = 0.1472; alpha = 0.0040
Simulating trial. . .
espilon = 0.1472; alpha = 0.0040
Simulating trial. . .
espilon = 0.1472; alpha = 0.0040
Simulating trial. . .
espilon = 0.1472; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.20)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.56)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.00)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.57)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.06)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.95)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', None)
Agent followed the waypoint right. (rewarded 1.17)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.22)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 2.23)
70% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 481
\-------------------------
Simulating trial. . .
espilon = 0.1466; alpha = 0.0040
Simulating trial. . .
espilon = 0.1466; alpha = 0.0040
Simulating trial. . .
espilon = 0.1466; alpha = 0.0040
Simulating trial. . .
espilon = 0.1466; alpha = 0.0040
Simulating trial. . .
espilon = 0.1466; alpha = 0.0040
Simulating trial. . .
espilon = 0.1466; alpha = 0.0040
Simulating trial. . .
espilon = 0.1466; alpha = 0.0040
Simulating trial. . .
espilon = 0.1466; alpha = 0.0040
Simulating trial. . .
espilon = 0.1466; alpha = 0.0040
Simulating trial. . .
espilon = 0.1466; alpha = 0.0040
Simulating trial. . .
espilon = 0.1466; alpha = 0.0040
Simulating trial. . .
espilon = 0.1466; alpha = 0.0040
Simulating trial. . .
espilon = 0.1466; alpha = 0.0040
Simulating trial. . .
espilon = 0.1466; alpha = 0.0040
Simulating trial. . .
espilon = 0.1466; alpha = 0.0040
Simulating trial. . .
espilon = 0.1466; alpha = 0.0040
Simulating trial. . .
espilon = 0.1466; alpha = 0.0040
Simulating trial. . .
espilon = 0.1466; alpha = 0.0040
Simulating trial. . .
espilon = 0.1466; alpha = 0.0040
Simulating trial. . .
espilon = 0.1466; alpha = 0.0040
Simulating trial. . .
espilon = 0.1466; alpha = 0.0040
Simulating trial. . .
espilon = 0.1466; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.30)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.64)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.04)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.69)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.65)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'forward')
Agent followed the waypoint forward. (rewarded 1.26)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.53)
65% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 482
\-------------------------
Simulating trial. . .
espilon = 0.1460; alpha = 0.0040
Simulating trial. . .
espilon = 0.1460; alpha = 0.0040
Simulating trial. . .
espilon = 0.1460; alpha = 0.0040
Simulating trial. . .
espilon = 0.1460; alpha = 0.0040
Simulating trial. . .
espilon = 0.1460; alpha = 0.0040
Simulating trial. . .
espilon = 0.1460; alpha = 0.0040
Simulating trial. . .
espilon = 0.1460; alpha = 0.0040
Simulating trial. . .
espilon = 0.1460; alpha = 0.0040
Simulating trial. . .
espilon = 0.1460; alpha = 0.0040
Simulating trial. . .
espilon = 0.1460; alpha = 0.0040
Simulating trial. . .
espilon = 0.1460; alpha = 0.0040
Simulating trial. . .
espilon = 0.1460; alpha = 0.0040
Simulating trial. . .
espilon = 0.1460; alpha = 0.0040
Simulating trial. . .
espilon = 0.1460; alpha = 0.0040
Simulating trial. . .
espilon = 0.1460; alpha = 0.0040
Simulating trial. . .
espilon = 0.1460; alpha = 0.0040
Simulating trial. . .
espilon = 0.1460; alpha = 0.0040
Simulating trial. . .
espilon = 0.1460; alpha = 0.0040
Simulating trial. . .
espilon = 0.1460; alpha = 0.0040
Simulating trial. . .
espilon = 0.1460; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.11)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.83)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.58)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.27)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -10.20)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.56)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', 'left')
Agent properly idled at a red light. (rewarded 2.39)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.17)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded 1.74)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.61)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.61)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.45)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'left')
Agent followed the waypoint right. (rewarded 0.95)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 1.25)
53% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 483
\-------------------------
Simulating trial. . .
espilon = 0.1454; alpha = 0.0040
Simulating trial. . .
espilon = 0.1454; alpha = 0.0040
Simulating trial. . .
espilon = 0.1454; alpha = 0.0040
Simulating trial. . .
espilon = 0.1454; alpha = 0.0040
Simulating trial. . .
espilon = 0.1454; alpha = 0.0040
Simulating trial. . .
espilon = 0.1454; alpha = 0.0040
Simulating trial. . .
espilon = 0.1454; alpha = 0.0040
Simulating trial. . .
espilon = 0.1454; alpha = 0.0040
Simulating trial. . .
espilon = 0.1454; alpha = 0.0040
Simulating trial. . .
espilon = 0.1454; alpha = 0.0040
Simulating trial. . .
espilon = 0.1454; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', 'left')
Agent followed the waypoint right. (rewarded 2.76)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.09)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.90)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.64)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.54)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.85)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.00)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.53)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'right')
Agent properly idled at a red light. (rewarded 2.59)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.80)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.08)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.05)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 0.86)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', 'right')
Agent drove right instead of forward. (rewarded 0.23)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.79)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.05)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.63)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded -0.22)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded -0.21)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.24)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 1.17)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.30)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.19)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 0.70)
4% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 484
\-------------------------
Simulating trial. . .
espilon = 0.1449; alpha = 0.0040
Simulating trial. . .
espilon = 0.1449; alpha = 0.0040
Simulating trial. . .
espilon = 0.1449; alpha = 0.0040
Simulating trial. . .
espilon = 0.1449; alpha = 0.0040
Simulating trial. . .
espilon = 0.1449; alpha = 0.0040
Simulating trial. . .
espilon = 0.1449; alpha = 0.0040
Simulating trial. . .
espilon = 0.1449; alpha = 0.0040
Simulating trial. . .
espilon = 0.1449; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 2.52)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.57)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.08)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.58)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.73)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 1.70)
70% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 485
\-------------------------
Simulating trial. . .
espilon = 0.1443; alpha = 0.0040
Simulating trial. . .
espilon = 0.1443; alpha = 0.0040
Simulating trial. . .
espilon = 0.1443; alpha = 0.0040
Simulating trial. . .
espilon = 0.1443; alpha = 0.0040
Simulating trial. . .
espilon = 0.1443; alpha = 0.0040
Simulating trial. . .
espilon = 0.1443; alpha = 0.0040
Simulating trial. . .
espilon = 0.1443; alpha = 0.0040
Simulating trial. . .
espilon = 0.1443; alpha = 0.0040
Simulating trial. . .
espilon = 0.1443; alpha = 0.0040
Simulating trial. . .
espilon = 0.1443; alpha = 0.0040
Simulating trial. . .
espilon = 0.1443; alpha = 0.0040
Simulating trial. . .
espilon = 0.1443; alpha = 0.0040
Simulating trial. . .
espilon = 0.1443; alpha = 0.0040
Simulating trial. . .
espilon = 0.1443; alpha = 0.0040
Simulating trial. . .
espilon = 0.1443; alpha = 0.0040
Simulating trial. . .
espilon = 0.1443; alpha = 0.0040
Simulating trial. . .
espilon = 0.1443; alpha = 0.0040
Simulating trial. . .
espilon = 0.1443; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.69)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.72)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.74)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.62)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.76)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.77)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.63)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.89)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.41)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.73)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 1.28)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.69)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.83)
48% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 486
\-------------------------
Simulating trial. . .
espilon = 0.1437; alpha = 0.0040
Simulating trial. . .
espilon = 0.1437; alpha = 0.0040
Simulating trial. . .
espilon = 0.1437; alpha = 0.0040
Simulating trial. . .
espilon = 0.1437; alpha = 0.0040
Simulating trial. . .
espilon = 0.1437; alpha = 0.0040
Simulating trial. . .
espilon = 0.1437; alpha = 0.0040
Simulating trial. . .
espilon = 0.1437; alpha = 0.0040
Simulating trial. . .
espilon = 0.1437; alpha = 0.0040
Simulating trial. . .
espilon = 0.1437; alpha = 0.0040
Simulating trial. . .
espilon = 0.1437; alpha = 0.0040
Simulating trial. . .
espilon = 0.1437; alpha = 0.0040
Simulating trial. . .
espilon = 0.1437; alpha = 0.0040
Simulating trial. . .
espilon = 0.1437; alpha = 0.0040
Simulating trial. . .
espilon = 0.1437; alpha = 0.0040
Simulating trial. . .
espilon = 0.1437; alpha = 0.0040
Simulating trial. . .
espilon = 0.1437; alpha = 0.0040
Simulating trial. . .
espilon = 0.1437; alpha = 0.0040
Simulating trial. . .
espilon = 0.1437; alpha = 0.0040
Simulating trial. . .
espilon = 0.1437; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.13)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.72)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.41)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.87)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.82)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 1.23)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.28)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 2.71)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.85)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.82)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.67)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded 1.50)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', 'left')
Agent properly idled at a red light. (rewarded 1.47)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.67)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent followed the waypoint right. (rewarded 0.79)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'right', 'forward')
Agent attempted driving forward through a red light. (rewarded -10.57)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.13)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 0.77)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.57)
37% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 487
\-------------------------
Simulating trial. . .
espilon = 0.1431; alpha = 0.0040
Simulating trial. . .
espilon = 0.1431; alpha = 0.0040
Simulating trial. . .
espilon = 0.1431; alpha = 0.0040
Simulating trial. . .
espilon = 0.1431; alpha = 0.0040
Simulating trial. . .
espilon = 0.1431; alpha = 0.0040
Simulating trial. . .
espilon = 0.1431; alpha = 0.0040
Simulating trial. . .
espilon = 0.1431; alpha = 0.0040
Simulating trial. . .
espilon = 0.1431; alpha = 0.0040
Simulating trial. . .
espilon = 0.1431; alpha = 0.0040
Simulating trial. . .
espilon = 0.1431; alpha = 0.0040
Simulating trial. . .
espilon = 0.1431; alpha = 0.0040
Simulating trial. . .
espilon = 0.1431; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.34)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'left', None)
Agent followed the waypoint forward. (rewarded 1.59)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 2.78)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.30)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.47)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.41)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.94)
65% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 488
\-------------------------
Simulating trial. . .
espilon = 0.1426; alpha = 0.0040
Simulating trial. . .
espilon = 0.1426; alpha = 0.0040
Simulating trial. . .
espilon = 0.1426; alpha = 0.0040
Simulating trial. . .
espilon = 0.1426; alpha = 0.0040
Simulating trial. . .
espilon = 0.1426; alpha = 0.0040
Simulating trial. . .
espilon = 0.1426; alpha = 0.0040
Simulating trial. . .
espilon = 0.1426; alpha = 0.0040
Simulating trial. . .
espilon = 0.1426; alpha = 0.0040
Simulating trial. . .
espilon = 0.1426; alpha = 0.0040
Simulating trial. . .
espilon = 0.1426; alpha = 0.0040
Simulating trial. . .
espilon = 0.1426; alpha = 0.0040
Simulating trial. . .
espilon = 0.1426; alpha = 0.0040
Simulating trial. . .
espilon = 0.1426; alpha = 0.0040
Simulating trial. . .
espilon = 0.1426; alpha = 0.0040
Simulating trial. . .
espilon = 0.1426; alpha = 0.0040
Simulating trial. . .
espilon = 0.1426; alpha = 0.0040
Simulating trial. . .
espilon = 0.1426; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', 'right')
Agent drove forward instead of right. (rewarded 0.39)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'forward')
Agent drove forward instead of right. (rewarded 0.93)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.11)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.85)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.71)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.33)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.47)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', 'forward')
Agent properly idled at a red light. (rewarded 1.41)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 1.38)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent followed the waypoint right. (rewarded 1.45)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.46)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.14)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.02)
63% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 489
\-------------------------
Simulating trial. . .
espilon = 0.1420; alpha = 0.0040
Simulating trial. . .
espilon = 0.1420; alpha = 0.0040
Simulating trial. . .
espilon = 0.1420; alpha = 0.0040
Simulating trial. . .
espilon = 0.1420; alpha = 0.0040
Simulating trial. . .
espilon = 0.1420; alpha = 0.0040
Simulating trial. . .
espilon = 0.1420; alpha = 0.0040
Simulating trial. . .
espilon = 0.1420; alpha = 0.0040
Simulating trial. . .
espilon = 0.1420; alpha = 0.0040
Simulating trial. . .
espilon = 0.1420; alpha = 0.0040
Simulating trial. . .
espilon = 0.1420; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.41)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.76)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.83)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.54)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.92)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 2.76)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.17)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.32)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.86)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.42)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', 'left')
Agent properly idled at a red light. (rewarded 1.43)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'left')
Agent followed the waypoint forward. (rewarded 2.38)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 490
\-------------------------
Simulating trial. . .
espilon = 0.1414; alpha = 0.0040
Simulating trial. . .
espilon = 0.1414; alpha = 0.0040
Simulating trial. . .
espilon = 0.1414; alpha = 0.0040
Simulating trial. . .
espilon = 0.1414; alpha = 0.0040
Simulating trial. . .
espilon = 0.1414; alpha = 0.0040
Simulating trial. . .
espilon = 0.1414; alpha = 0.0040
Simulating trial. . .
espilon = 0.1414; alpha = 0.0040
Simulating trial. . .
espilon = 0.1414; alpha = 0.0040
Simulating trial. . .
espilon = 0.1414; alpha = 0.0040
Simulating trial. . .
espilon = 0.1414; alpha = 0.0040
Simulating trial. . .
espilon = 0.1414; alpha = 0.0040
Simulating trial. . .
espilon = 0.1414; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.87)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.26)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.57)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.78)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 2.14)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.48)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 2.62)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.60)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.15)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.47)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.35)
63% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 491
\-------------------------
Simulating trial. . .
espilon = 0.1409; alpha = 0.0040
Simulating trial. . .
espilon = 0.1409; alpha = 0.0040
Simulating trial. . .
espilon = 0.1409; alpha = 0.0040
Simulating trial. . .
espilon = 0.1409; alpha = 0.0040
Simulating trial. . .
espilon = 0.1409; alpha = 0.0040
Simulating trial. . .
espilon = 0.1409; alpha = 0.0040
Simulating trial. . .
espilon = 0.1409; alpha = 0.0040
Simulating trial. . .
espilon = 0.1409; alpha = 0.0040
Simulating trial. . .
espilon = 0.1409; alpha = 0.0040
Simulating trial. . .
espilon = 0.1409; alpha = 0.0040
Simulating trial. . .
espilon = 0.1409; alpha = 0.0040
Simulating trial. . .
espilon = 0.1409; alpha = 0.0040
Simulating trial. . .
espilon = 0.1409; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 2.00)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.26)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.40)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.52)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.63)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.34)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 1.62)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.96)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 0.99)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.35)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 492
\-------------------------
Simulating trial. . .
espilon = 0.1403; alpha = 0.0040
Simulating trial. . .
espilon = 0.1403; alpha = 0.0040
Simulating trial. . .
espilon = 0.1403; alpha = 0.0040
Simulating trial. . .
espilon = 0.1403; alpha = 0.0040
Simulating trial. . .
espilon = 0.1403; alpha = 0.0040
Simulating trial. . .
espilon = 0.1403; alpha = 0.0040
Simulating trial. . .
espilon = 0.1403; alpha = 0.0040
Simulating trial. . .
espilon = 0.1403; alpha = 0.0040
Simulating trial. . .
espilon = 0.1403; alpha = 0.0040
Simulating trial. . .
espilon = 0.1403; alpha = 0.0040
Simulating trial. . .
espilon = 0.1403; alpha = 0.0040
Simulating trial. . .
espilon = 0.1403; alpha = 0.0040
Simulating trial. . .
espilon = 0.1403; alpha = 0.0040
Simulating trial. . .
espilon = 0.1403; alpha = 0.0040
Simulating trial. . .
espilon = 0.1403; alpha = 0.0040
Simulating trial. . .
espilon = 0.1403; alpha = 0.0040
Simulating trial. . .
espilon = 0.1403; alpha = 0.0040
Simulating trial. . .
espilon = 0.1403; alpha = 0.0040
Simulating trial. . .
espilon = 0.1403; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'forward', 'left')
Agent drove forward instead of left. (rewarded 0.03)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.74)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.29)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.25)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.43)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.15)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.21)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.99)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.27)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.60)
50% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 493
\-------------------------
Simulating trial. . .
espilon = 0.1397; alpha = 0.0040
Simulating trial. . .
espilon = 0.1397; alpha = 0.0040
Simulating trial. . .
espilon = 0.1397; alpha = 0.0040
Simulating trial. . .
espilon = 0.1397; alpha = 0.0040
Simulating trial. . .
espilon = 0.1397; alpha = 0.0040
Simulating trial. . .
espilon = 0.1397; alpha = 0.0040
Simulating trial. . .
espilon = 0.1397; alpha = 0.0040
Simulating trial. . .
espilon = 0.1397; alpha = 0.0040
Simulating trial. . .
espilon = 0.1397; alpha = 0.0040
Simulating trial. . .
espilon = 0.1397; alpha = 0.0040
Simulating trial. . .
espilon = 0.1397; alpha = 0.0040
Simulating trial. . .
espilon = 0.1397; alpha = 0.0040
Simulating trial. . .
espilon = 0.1397; alpha = 0.0040
Simulating trial. . .
espilon = 0.1397; alpha = 0.0040
Simulating trial. . .
espilon = 0.1397; alpha = 0.0040
Simulating trial. . .
espilon = 0.1397; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'forward')
Agent drove right instead of left. (rewarded 1.41)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent drove forward instead of right. (rewarded 0.88)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 2.20)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.17)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'right')
Agent drove right instead of left. (rewarded 1.08)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent followed the waypoint right. (rewarded 2.85)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'left')
Agent drove left instead of right. (rewarded 1.37)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.41)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.72)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 2.07)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.11)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.43)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.89)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.66)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.12)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 494
\-------------------------
Simulating trial. . .
espilon = 0.1392; alpha = 0.0040
Simulating trial. . .
espilon = 0.1392; alpha = 0.0040
Simulating trial. . .
espilon = 0.1392; alpha = 0.0040
Simulating trial. . .
espilon = 0.1392; alpha = 0.0040
Simulating trial. . .
espilon = 0.1392; alpha = 0.0040
Simulating trial. . .
espilon = 0.1392; alpha = 0.0040
Simulating trial. . .
espilon = 0.1392; alpha = 0.0040
Simulating trial. . .
espilon = 0.1392; alpha = 0.0040
Simulating trial. . .
espilon = 0.1392; alpha = 0.0040
Simulating trial. . .
espilon = 0.1392; alpha = 0.0040
Simulating trial. . .
espilon = 0.1392; alpha = 0.0040
Simulating trial. . .
espilon = 0.1392; alpha = 0.0040
Simulating trial. . .
espilon = 0.1392; alpha = 0.0040
Simulating trial. . .
espilon = 0.1392; alpha = 0.0040
Simulating trial. . .
espilon = 0.1392; alpha = 0.0040
Simulating trial. . .
espilon = 0.1392; alpha = 0.0040
Simulating trial. . .
espilon = 0.1392; alpha = 0.0040
Simulating trial. . .
espilon = 0.1392; alpha = 0.0040
Simulating trial. . .
espilon = 0.1392; alpha = 0.0040
Simulating trial. . .
espilon = 0.1392; alpha = 0.0040
Simulating trial. . .
espilon = 0.1392; alpha = 0.0040
Simulating trial. . .
espilon = 0.1392; alpha = 0.0040
Simulating trial. . .
espilon = 0.1392; alpha = 0.0040
Simulating trial. . .
espilon = 0.1392; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', 'forward')
Agent properly idled at a red light. (rewarded 1.22)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.23)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.21)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'forward')
Agent properly idled at a red light. (rewarded 2.41)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.26)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 0.83)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.59)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.55)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.23)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.57)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.88)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.68)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent drove right instead of forward. (rewarded 1.13)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.81)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.22)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.64)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.03)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove right instead of forward. (rewarded 0.52)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'left', None)
Agent drove forward instead of left. (rewarded -0.30)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 0.69)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.47)
16% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 495
\-------------------------
Simulating trial. . .
espilon = 0.1386; alpha = 0.0040
Simulating trial. . .
espilon = 0.1386; alpha = 0.0040
Simulating trial. . .
espilon = 0.1386; alpha = 0.0040
Simulating trial. . .
espilon = 0.1386; alpha = 0.0040
Simulating trial. . .
espilon = 0.1386; alpha = 0.0040
Simulating trial. . .
espilon = 0.1386; alpha = 0.0040
Simulating trial. . .
espilon = 0.1386; alpha = 0.0040
Simulating trial. . .
espilon = 0.1386; alpha = 0.0040
Simulating trial. . .
espilon = 0.1386; alpha = 0.0040
Simulating trial. . .
espilon = 0.1386; alpha = 0.0040
Simulating trial. . .
espilon = 0.1386; alpha = 0.0040
Simulating trial. . .
espilon = 0.1386; alpha = 0.0040
Simulating trial. . .
espilon = 0.1386; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.84)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.81)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.42)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.27)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.99)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.31)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 2.19)
65% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 496
\-------------------------
Simulating trial. . .
espilon = 0.1381; alpha = 0.0040
Simulating trial. . .
espilon = 0.1381; alpha = 0.0040
Simulating trial. . .
espilon = 0.1381; alpha = 0.0040
Simulating trial. . .
espilon = 0.1381; alpha = 0.0040
Simulating trial. . .
espilon = 0.1381; alpha = 0.0040
Simulating trial. . .
espilon = 0.1381; alpha = 0.0040
Simulating trial. . .
espilon = 0.1381; alpha = 0.0040
Simulating trial. . .
espilon = 0.1381; alpha = 0.0040
Simulating trial. . .
espilon = 0.1381; alpha = 0.0040
Simulating trial. . .
espilon = 0.1381; alpha = 0.0040
Simulating trial. . .
espilon = 0.1381; alpha = 0.0040
Simulating trial. . .
espilon = 0.1381; alpha = 0.0040
Simulating trial. . .
espilon = 0.1381; alpha = 0.0040
Simulating trial. . .
espilon = 0.1381; alpha = 0.0040
Simulating trial. . .
espilon = 0.1381; alpha = 0.0040
Simulating trial. . .
espilon = 0.1381; alpha = 0.0040
Simulating trial. . .
espilon = 0.1381; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', 'forward')
Agent drove right instead of forward. (rewarded 1.45)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.19)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.12)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.16)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.56)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.54)
70% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 497
\-------------------------
Simulating trial. . .
espilon = 0.1375; alpha = 0.0040
Simulating trial. . .
espilon = 0.1375; alpha = 0.0040
Simulating trial. . .
espilon = 0.1375; alpha = 0.0040
Simulating trial. . .
espilon = 0.1375; alpha = 0.0040
Simulating trial. . .
espilon = 0.1375; alpha = 0.0040
Simulating trial. . .
espilon = 0.1375; alpha = 0.0040
Simulating trial. . .
espilon = 0.1375; alpha = 0.0040
Simulating trial. . .
espilon = 0.1375; alpha = 0.0040
Simulating trial. . .
espilon = 0.1375; alpha = 0.0040
Simulating trial. . .
espilon = 0.1375; alpha = 0.0040
Simulating trial. . .
espilon = 0.1375; alpha = 0.0040
Simulating trial. . .
espilon = 0.1375; alpha = 0.0040
Simulating trial. . .
espilon = 0.1375; alpha = 0.0040
Simulating trial. . .
espilon = 0.1375; alpha = 0.0040
Simulating trial. . .
espilon = 0.1375; alpha = 0.0040
Simulating trial. . .
espilon = 0.1375; alpha = 0.0040
Simulating trial. . .
espilon = 0.1375; alpha = 0.0040
Simulating trial. . .
espilon = 0.1375; alpha = 0.0040
Simulating trial. . .
espilon = 0.1375; alpha = 0.0040
Simulating trial. . .
espilon = 0.1375; alpha = 0.0040
Simulating trial. . .
espilon = 0.1375; alpha = 0.0040
Simulating trial. . .
espilon = 0.1375; alpha = 0.0040
Simulating trial. . .
espilon = 0.1375; alpha = 0.0040
Simulating trial. . .
espilon = 0.1375; alpha = 0.0040
Simulating trial. . .
espilon = 0.1375; alpha = 0.0040
Simulating trial. . .
espilon = 0.1375; alpha = 0.0040
Simulating trial. . .
espilon = 0.1375; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'right', 'left')
Agent properly idled at a red light. (rewarded 2.83)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.73)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.00)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.22)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.98)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.14)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.09)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.04)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.55)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.77)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.61)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.37)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.31)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.66)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.04)
50% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 498
\-------------------------
Simulating trial. . .
espilon = 0.1370; alpha = 0.0040
Simulating trial. . .
espilon = 0.1370; alpha = 0.0040
Simulating trial. . .
espilon = 0.1370; alpha = 0.0040
Simulating trial. . .
espilon = 0.1370; alpha = 0.0040
Simulating trial. . .
espilon = 0.1370; alpha = 0.0040
Simulating trial. . .
espilon = 0.1370; alpha = 0.0040
Simulating trial. . .
espilon = 0.1370; alpha = 0.0040
Simulating trial. . .
espilon = 0.1370; alpha = 0.0040
Simulating trial. . .
espilon = 0.1370; alpha = 0.0040
Simulating trial. . .
espilon = 0.1370; alpha = 0.0040
Simulating trial. . .
espilon = 0.1370; alpha = 0.0040
Simulating trial. . .
espilon = 0.1370; alpha = 0.0040
Simulating trial. . .
espilon = 0.1370; alpha = 0.0040
Simulating trial. . .
espilon = 0.1370; alpha = 0.0040
Simulating trial. . .
espilon = 0.1370; alpha = 0.0040
Simulating trial. . .
espilon = 0.1370; alpha = 0.0040
Simulating trial. . .
espilon = 0.1370; alpha = 0.0040
Simulating trial. . .
espilon = 0.1370; alpha = 0.0040
Simulating trial. . .
espilon = 0.1370; alpha = 0.0040
Simulating trial. . .
espilon = 0.1370; alpha = 0.0040
Simulating trial. . .
espilon = 0.1370; alpha = 0.0040
Simulating trial. . .
espilon = 0.1370; alpha = 0.0040
Simulating trial. . .
espilon = 0.1370; alpha = 0.0040
Simulating trial. . .
espilon = 0.1370; alpha = 0.0040
Simulating trial. . .
espilon = 0.1370; alpha = 0.0040
Simulating trial. . .
espilon = 0.1370; alpha = 0.0040
Simulating trial. . .
espilon = 0.1370; alpha = 0.0040
Simulating trial. . .
espilon = 0.1370; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.77)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 1.38)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.21)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.91)
80% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 499
\-------------------------
Simulating trial. . .
espilon = 0.1364; alpha = 0.0040
Simulating trial. . .
espilon = 0.1364; alpha = 0.0040
Simulating trial. . .
espilon = 0.1364; alpha = 0.0040
Simulating trial. . .
espilon = 0.1364; alpha = 0.0040
Simulating trial. . .
espilon = 0.1364; alpha = 0.0040
Simulating trial. . .
espilon = 0.1364; alpha = 0.0040
Simulating trial. . .
espilon = 0.1364; alpha = 0.0040
Simulating trial. . .
espilon = 0.1364; alpha = 0.0040
Simulating trial. . .
espilon = 0.1364; alpha = 0.0040
Simulating trial. . .
espilon = 0.1364; alpha = 0.0040
Simulating trial. . .
espilon = 0.1364; alpha = 0.0040
Simulating trial. . .
espilon = 0.1364; alpha = 0.0040
Simulating trial. . .
espilon = 0.1364; alpha = 0.0040
Simulating trial. . .
espilon = 0.1364; alpha = 0.0040
Simulating trial. . .
espilon = 0.1364; alpha = 0.0040
Simulating trial. . .
espilon = 0.1364; alpha = 0.0040
Simulating trial. . .
espilon = 0.1364; alpha = 0.0040
Simulating trial. . .
espilon = 0.1364; alpha = 0.0040
Simulating trial. . .
espilon = 0.1364; alpha = 0.0040
Simulating trial. . .
espilon = 0.1364; alpha = 0.0040
Simulating trial. . .
espilon = 0.1364; alpha = 0.0040
Simulating trial. . .
espilon = 0.1364; alpha = 0.0040
Simulating trial. . .
espilon = 0.1364; alpha = 0.0040
Simulating trial. . .
espilon = 0.1364; alpha = 0.0040
Simulating trial. . .
espilon = 0.1364; alpha = 0.0040
Simulating trial. . .
espilon = 0.1364; alpha = 0.0040
Simulating trial. . .
espilon = 0.1364; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'right')
Agent properly idled at a red light. (rewarded 2.66)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent attempted driving forward through a red light. (rewarded -10.20)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.68)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.93)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 2.16)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.93)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.49)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.02)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.87)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.82)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'left')
Agent followed the waypoint forward. (rewarded 2.17)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.58)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.34)
48% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 500
\-------------------------
Simulating trial. . .
espilon = 0.1359; alpha = 0.0040
Simulating trial. . .
espilon = 0.1359; alpha = 0.0040
Simulating trial. . .
espilon = 0.1359; alpha = 0.0040
Simulating trial. . .
espilon = 0.1359; alpha = 0.0040
Simulating trial. . .
espilon = 0.1359; alpha = 0.0040
Simulating trial. . .
espilon = 0.1359; alpha = 0.0040
Simulating trial. . .
espilon = 0.1359; alpha = 0.0040
Simulating trial. . .
espilon = 0.1359; alpha = 0.0040
Simulating trial. . .
espilon = 0.1359; alpha = 0.0040
Simulating trial. . .
espilon = 0.1359; alpha = 0.0040
Simulating trial. . .
espilon = 0.1359; alpha = 0.0040
Simulating trial. . .
espilon = 0.1359; alpha = 0.0040
Simulating trial. . .
espilon = 0.1359; alpha = 0.0040
Simulating trial. . .
espilon = 0.1359; alpha = 0.0040
Simulating trial. . .
espilon = 0.1359; alpha = 0.0040
Simulating trial. . .
espilon = 0.1359; alpha = 0.0040
Simulating trial. . .
espilon = 0.1359; alpha = 0.0040
Simulating trial. . .
espilon = 0.1359; alpha = 0.0040
Simulating trial. . .
espilon = 0.1359; alpha = 0.0040
Simulating trial. . .
espilon = 0.1359; alpha = 0.0040
Simulating trial. . .
espilon = 0.1359; alpha = 0.0040
Simulating trial. . .
espilon = 0.1359; alpha = 0.0040
Simulating trial. . .
espilon = 0.1359; alpha = 0.0040
Simulating trial. . .
espilon = 0.1359; alpha = 0.0040
Simulating trial. . .
espilon = 0.1359; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.41)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.78)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.89)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.45)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 2.70)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.48)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.17)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.67)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.80)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'forward')
Agent followed the waypoint forward. (rewarded 1.14)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.25)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.46)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 1.87)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.20)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 1.48)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 1.26)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.70)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.67)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.08)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.97)
20% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 501
\-------------------------
Simulating trial. . .
espilon = 0.1353; alpha = 0.0040
Simulating trial. . .
espilon = 0.1353; alpha = 0.0040
Simulating trial. . .
espilon = 0.1353; alpha = 0.0040
Simulating trial. . .
espilon = 0.1353; alpha = 0.0040
Simulating trial. . .
espilon = 0.1353; alpha = 0.0040
Simulating trial. . .
espilon = 0.1353; alpha = 0.0040
Simulating trial. . .
espilon = 0.1353; alpha = 0.0040
Simulating trial. . .
espilon = 0.1353; alpha = 0.0040
Simulating trial. . .
espilon = 0.1353; alpha = 0.0040
Simulating trial. . .
espilon = 0.1353; alpha = 0.0040
Simulating trial. . .
espilon = 0.1353; alpha = 0.0040
Simulating trial. . .
espilon = 0.1353; alpha = 0.0040
Simulating trial. . .
espilon = 0.1353; alpha = 0.0040
Simulating trial. . .
espilon = 0.1353; alpha = 0.0040
Simulating trial. . .
espilon = 0.1353; alpha = 0.0040
Simulating trial. . .
espilon = 0.1353; alpha = 0.0040
Simulating trial. . .
espilon = 0.1353; alpha = 0.0040
Simulating trial. . .
espilon = 0.1353; alpha = 0.0040
Simulating trial. . .
espilon = 0.1353; alpha = 0.0040
Simulating trial. . .
espilon = 0.1353; alpha = 0.0040
Simulating trial. . .
espilon = 0.1353; alpha = 0.0040
Simulating trial. . .
espilon = 0.1353; alpha = 0.0040
Simulating trial. . .
espilon = 0.1353; alpha = 0.0040
Simulating trial. . .
espilon = 0.1353; alpha = 0.0040
Simulating trial. . .
espilon = 0.1353; alpha = 0.0040
Simulating trial. . .
espilon = 0.1353; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'forward')
Agent drove forward instead of right. (rewarded 1.12)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', None)
Agent attempted driving left through a red light. (rewarded -10.60)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.68)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 2.37)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.10)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.71)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.93)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.32)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.10)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.00)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.31)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.09)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.04)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.39)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.17)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.79)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.20)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent drove right instead of left. (rewarded 0.11)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.89)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'forward', 'forward')
Agent followed the waypoint right. (rewarded 0.99)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.10)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'left')
Agent followed the waypoint right. (rewarded 0.96)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.15)
8% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 502
\-------------------------
Simulating trial. . .
espilon = 0.1348; alpha = 0.0040
Simulating trial. . .
espilon = 0.1348; alpha = 0.0040
Simulating trial. . .
espilon = 0.1348; alpha = 0.0040
Simulating trial. . .
espilon = 0.1348; alpha = 0.0040
Simulating trial. . .
espilon = 0.1348; alpha = 0.0040
Simulating trial. . .
espilon = 0.1348; alpha = 0.0040
Simulating trial. . .
espilon = 0.1348; alpha = 0.0040
Simulating trial. . .
espilon = 0.1348; alpha = 0.0040
Simulating trial. . .
espilon = 0.1348; alpha = 0.0040
Simulating trial. . .
espilon = 0.1348; alpha = 0.0040
Simulating trial. . .
espilon = 0.1348; alpha = 0.0040
Simulating trial. . .
espilon = 0.1348; alpha = 0.0040
Simulating trial. . .
espilon = 0.1348; alpha = 0.0040
Simulating trial. . .
espilon = 0.1348; alpha = 0.0040
Simulating trial. . .
espilon = 0.1348; alpha = 0.0040
Simulating trial. . .
espilon = 0.1348; alpha = 0.0040
Simulating trial. . .
espilon = 0.1348; alpha = 0.0040
Simulating trial. . .
espilon = 0.1348; alpha = 0.0040
Simulating trial. . .
espilon = 0.1348; alpha = 0.0040
Simulating trial. . .
espilon = 0.1348; alpha = 0.0040
Simulating trial. . .
espilon = 0.1348; alpha = 0.0040
Simulating trial. . .
espilon = 0.1348; alpha = 0.0040
Simulating trial. . .
espilon = 0.1348; alpha = 0.0040
Simulating trial. . .
espilon = 0.1348; alpha = 0.0040
Simulating trial. . .
espilon = 0.1348; alpha = 0.0040
Simulating trial. . .
espilon = 0.1348; alpha = 0.0040
Simulating trial. . .
espilon = 0.1348; alpha = 0.0040
Simulating trial. . .
espilon = 0.1348; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'right')
Agent properly idled at a red light. (rewarded 1.40)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.89)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.44)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 1.78)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.61)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.34)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.31)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', 'left')
Agent properly idled at a red light. (rewarded 1.50)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.50)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent followed the waypoint left. (rewarded 0.86)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.91)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 2.37)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 0.45)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.27)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', 'left')
Agent properly idled at a red light. (rewarded 1.90)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 0.71)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.03)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.57)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 0.57)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'left')
Agent drove right instead of left. (rewarded 0.19)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 0.65)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded -0.52)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.18)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.44)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -9.95)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 503
\-------------------------
Simulating trial. . .
espilon = 0.1343; alpha = 0.0040
Simulating trial. . .
espilon = 0.1343; alpha = 0.0040
Simulating trial. . .
espilon = 0.1343; alpha = 0.0040
Simulating trial. . .
espilon = 0.1343; alpha = 0.0040
Simulating trial. . .
espilon = 0.1343; alpha = 0.0040
Simulating trial. . .
espilon = 0.1343; alpha = 0.0040
Simulating trial. . .
espilon = 0.1343; alpha = 0.0040
Simulating trial. . .
espilon = 0.1343; alpha = 0.0040
Simulating trial. . .
espilon = 0.1343; alpha = 0.0040
Simulating trial. . .
espilon = 0.1343; alpha = 0.0040
Simulating trial. . .
espilon = 0.1343; alpha = 0.0040
Simulating trial. . .
espilon = 0.1343; alpha = 0.0040
Simulating trial. . .
espilon = 0.1343; alpha = 0.0040
Simulating trial. . .
espilon = 0.1343; alpha = 0.0040
Simulating trial. . .
espilon = 0.1343; alpha = 0.0040
Simulating trial. . .
espilon = 0.1343; alpha = 0.0040
Simulating trial. . .
espilon = 0.1343; alpha = 0.0040
Simulating trial. . .
espilon = 0.1343; alpha = 0.0040
Simulating trial. . .
espilon = 0.1343; alpha = 0.0040
Simulating trial. . .
espilon = 0.1343; alpha = 0.0040
Simulating trial. . .
espilon = 0.1343; alpha = 0.0040
Simulating trial. . .
espilon = 0.1343; alpha = 0.0040
Simulating trial. . .
espilon = 0.1343; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', 'forward')
Agent drove right instead of forward. (rewarded 1.99)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.21)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.89)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.18)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.67)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.55)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.90)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.26)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 1.14)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.41)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.30)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 0.95)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.74)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.56)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.47)
25% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 504
\-------------------------
Simulating trial. . .
espilon = 0.1337; alpha = 0.0040
Simulating trial. . .
espilon = 0.1337; alpha = 0.0040
Simulating trial. . .
espilon = 0.1337; alpha = 0.0040
Simulating trial. . .
espilon = 0.1337; alpha = 0.0040
Simulating trial. . .
espilon = 0.1337; alpha = 0.0040
Simulating trial. . .
espilon = 0.1337; alpha = 0.0040
Simulating trial. . .
espilon = 0.1337; alpha = 0.0040
Simulating trial. . .
espilon = 0.1337; alpha = 0.0040
Simulating trial. . .
espilon = 0.1337; alpha = 0.0040
Simulating trial. . .
espilon = 0.1337; alpha = 0.0040
Simulating trial. . .
espilon = 0.1337; alpha = 0.0040
Simulating trial. . .
espilon = 0.1337; alpha = 0.0040
Simulating trial. . .
espilon = 0.1337; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.27)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, 'forward')
Agent properly idled at a red light. (rewarded 1.07)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, 'forward')
Agent properly idled at a red light. (rewarded 2.08)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'right', 'forward')
Agent followed the waypoint right. (rewarded 2.55)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.41)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.13)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.71)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.33)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.25)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded -0.05)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.87)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 2.12)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -9.40)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.99)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.98)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 0.74)
36% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 505
\-------------------------
Simulating trial. . .
espilon = 0.1332; alpha = 0.0040
Simulating trial. . .
espilon = 0.1332; alpha = 0.0040
Simulating trial. . .
espilon = 0.1332; alpha = 0.0040
Simulating trial. . .
espilon = 0.1332; alpha = 0.0040
Simulating trial. . .
espilon = 0.1332; alpha = 0.0040
Simulating trial. . .
espilon = 0.1332; alpha = 0.0040
Simulating trial. . .
espilon = 0.1332; alpha = 0.0040
Simulating trial. . .
espilon = 0.1332; alpha = 0.0040
Simulating trial. . .
espilon = 0.1332; alpha = 0.0040
Simulating trial. . .
espilon = 0.1332; alpha = 0.0040
Simulating trial. . .
espilon = 0.1332; alpha = 0.0040
Simulating trial. . .
espilon = 0.1332; alpha = 0.0040
Simulating trial. . .
espilon = 0.1332; alpha = 0.0040
Simulating trial. . .
espilon = 0.1332; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'left', 'left')
Agent drove forward instead of left. (rewarded 1.57)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent drove right instead of left. (rewarded 1.28)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 2.25)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.78)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.59)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.33)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.31)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove left instead of forward. (rewarded 0.04)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.48)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.51)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.64)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.11)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent attempted driving left through a red light. (rewarded -10.95)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded 1.70)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.24)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 2.36)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.20)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.50)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 2.37)
24% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 506
\-------------------------
Simulating trial. . .
espilon = 0.1327; alpha = 0.0040
Simulating trial. . .
espilon = 0.1327; alpha = 0.0040
Simulating trial. . .
espilon = 0.1327; alpha = 0.0040
Simulating trial. . .
espilon = 0.1327; alpha = 0.0040
Simulating trial. . .
espilon = 0.1327; alpha = 0.0040
Simulating trial. . .
espilon = 0.1327; alpha = 0.0040
Simulating trial. . .
espilon = 0.1327; alpha = 0.0040
Simulating trial. . .
espilon = 0.1327; alpha = 0.0040
Simulating trial. . .
espilon = 0.1327; alpha = 0.0040
Simulating trial. . .
espilon = 0.1327; alpha = 0.0040
Simulating trial. . .
espilon = 0.1327; alpha = 0.0040
Simulating trial. . .
espilon = 0.1327; alpha = 0.0040
Simulating trial. . .
espilon = 0.1327; alpha = 0.0040
Simulating trial. . .
espilon = 0.1327; alpha = 0.0040
Simulating trial. . .
espilon = 0.1327; alpha = 0.0040
Simulating trial. . .
espilon = 0.1327; alpha = 0.0040
Simulating trial. . .
espilon = 0.1327; alpha = 0.0040
Simulating trial. . .
espilon = 0.1327; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.76)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.26)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent drove right instead of left. (rewarded 1.46)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent followed the waypoint right. (rewarded 1.56)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.10)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.47)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 0.92)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.02)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.18)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.88)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.59)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 1.60)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.66)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 2.03)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.12)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.12)
36% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 507
\-------------------------
Simulating trial. . .
espilon = 0.1321; alpha = 0.0040
Simulating trial. . .
espilon = 0.1321; alpha = 0.0040
Simulating trial. . .
espilon = 0.1321; alpha = 0.0040
Simulating trial. . .
espilon = 0.1321; alpha = 0.0040
Simulating trial. . .
espilon = 0.1321; alpha = 0.0040
Simulating trial. . .
espilon = 0.1321; alpha = 0.0040
Simulating trial. . .
espilon = 0.1321; alpha = 0.0040
Simulating trial. . .
espilon = 0.1321; alpha = 0.0040
Simulating trial. . .
espilon = 0.1321; alpha = 0.0040
Simulating trial. . .
espilon = 0.1321; alpha = 0.0040
Simulating trial. . .
espilon = 0.1321; alpha = 0.0040
Simulating trial. . .
espilon = 0.1321; alpha = 0.0040
Simulating trial. . .
espilon = 0.1321; alpha = 0.0040
Simulating trial. . .
espilon = 0.1321; alpha = 0.0040
Simulating trial. . .
espilon = 0.1321; alpha = 0.0040
Simulating trial. . .
espilon = 0.1321; alpha = 0.0040
Simulating trial. . .
espilon = 0.1321; alpha = 0.0040
Simulating trial. . .
espilon = 0.1321; alpha = 0.0040
Simulating trial. . .
espilon = 0.1321; alpha = 0.0040
Simulating trial. . .
espilon = 0.1321; alpha = 0.0040
Simulating trial. . .
espilon = 0.1321; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.47)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', 'forward')
Agent followed the waypoint right. (rewarded 1.13)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.45)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.47)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.64)
75% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 508
\-------------------------
Simulating trial. . .
espilon = 0.1316; alpha = 0.0040
Simulating trial. . .
espilon = 0.1316; alpha = 0.0040
Simulating trial. . .
espilon = 0.1316; alpha = 0.0040
Simulating trial. . .
espilon = 0.1316; alpha = 0.0040
Simulating trial. . .
espilon = 0.1316; alpha = 0.0040
Simulating trial. . .
espilon = 0.1316; alpha = 0.0040
Simulating trial. . .
espilon = 0.1316; alpha = 0.0040
Simulating trial. . .
espilon = 0.1316; alpha = 0.0040
Simulating trial. . .
espilon = 0.1316; alpha = 0.0040
Simulating trial. . .
espilon = 0.1316; alpha = 0.0040
Simulating trial. . .
espilon = 0.1316; alpha = 0.0040
Simulating trial. . .
espilon = 0.1316; alpha = 0.0040
Simulating trial. . .
espilon = 0.1316; alpha = 0.0040
Simulating trial. . .
espilon = 0.1316; alpha = 0.0040
Simulating trial. . .
espilon = 0.1316; alpha = 0.0040
Simulating trial. . .
espilon = 0.1316; alpha = 0.0040
Simulating trial. . .
espilon = 0.1316; alpha = 0.0040
Simulating trial. . .
espilon = 0.1316; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.71)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.45)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.04)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.16)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.79)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.17)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.89)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'right', 'left')
Agent followed the waypoint right. (rewarded 2.25)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.97)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.62)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.11)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent followed the waypoint forward. (rewarded 1.71)
52% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 509
\-------------------------
Simulating trial. . .
espilon = 0.1311; alpha = 0.0040
Simulating trial. . .
espilon = 0.1311; alpha = 0.0040
Simulating trial. . .
espilon = 0.1311; alpha = 0.0040
Simulating trial. . .
espilon = 0.1311; alpha = 0.0040
Simulating trial. . .
espilon = 0.1311; alpha = 0.0040
Simulating trial. . .
espilon = 0.1311; alpha = 0.0040
Simulating trial. . .
espilon = 0.1311; alpha = 0.0040
Simulating trial. . .
espilon = 0.1311; alpha = 0.0040
Simulating trial. . .
espilon = 0.1311; alpha = 0.0040
Simulating trial. . .
espilon = 0.1311; alpha = 0.0040
Simulating trial. . .
espilon = 0.1311; alpha = 0.0040
Simulating trial. . .
espilon = 0.1311; alpha = 0.0040
Simulating trial. . .
espilon = 0.1311; alpha = 0.0040
Simulating trial. . .
espilon = 0.1311; alpha = 0.0040
Simulating trial. . .
espilon = 0.1311; alpha = 0.0040
Simulating trial. . .
espilon = 0.1311; alpha = 0.0040
Simulating trial. . .
espilon = 0.1311; alpha = 0.0040
Simulating trial. . .
espilon = 0.1311; alpha = 0.0040
Simulating trial. . .
espilon = 0.1311; alpha = 0.0040
Simulating trial. . .
espilon = 0.1311; alpha = 0.0040
Simulating trial. . .
espilon = 0.1311; alpha = 0.0040
Simulating trial. . .
espilon = 0.1311; alpha = 0.0040
Simulating trial. . .
espilon = 0.1311; alpha = 0.0040
Simulating trial. . .
espilon = 0.1311; alpha = 0.0040
Simulating trial. . .
espilon = 0.1311; alpha = 0.0040
Simulating trial. . .
espilon = 0.1311; alpha = 0.0040
Simulating trial. . .
espilon = 0.1311; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.58)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'right')
Agent followed the waypoint right. (rewarded 2.59)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.57)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.55)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.52)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove left instead of forward. (rewarded 0.75)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'right')
Agent followed the waypoint right. (rewarded 2.20)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 2.74)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 510
\-------------------------
Simulating trial. . .
espilon = 0.1305; alpha = 0.0040
Simulating trial. . .
espilon = 0.1305; alpha = 0.0040
Simulating trial. . .
espilon = 0.1305; alpha = 0.0040
Simulating trial. . .
espilon = 0.1305; alpha = 0.0040
Simulating trial. . .
espilon = 0.1305; alpha = 0.0040
Simulating trial. . .
espilon = 0.1305; alpha = 0.0040
Simulating trial. . .
espilon = 0.1305; alpha = 0.0040
Simulating trial. . .
espilon = 0.1305; alpha = 0.0040
Simulating trial. . .
espilon = 0.1305; alpha = 0.0040
Simulating trial. . .
espilon = 0.1305; alpha = 0.0040
Simulating trial. . .
espilon = 0.1305; alpha = 0.0040
Simulating trial. . .
espilon = 0.1305; alpha = 0.0040
Simulating trial. . .
espilon = 0.1305; alpha = 0.0040
Simulating trial. . .
espilon = 0.1305; alpha = 0.0040
Simulating trial. . .
espilon = 0.1305; alpha = 0.0040
Simulating trial. . .
espilon = 0.1305; alpha = 0.0040
Simulating trial. . .
espilon = 0.1305; alpha = 0.0040
Simulating trial. . .
espilon = 0.1305; alpha = 0.0040
Simulating trial. . .
espilon = 0.1305; alpha = 0.0040
Simulating trial. . .
espilon = 0.1305; alpha = 0.0040
Simulating trial. . .
espilon = 0.1305; alpha = 0.0040
Simulating trial. . .
espilon = 0.1305; alpha = 0.0040
Simulating trial. . .
espilon = 0.1305; alpha = 0.0040
Simulating trial. . .
espilon = 0.1305; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.30)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent followed the waypoint right. (rewarded 2.91)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', 'right')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.43)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.59)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.32)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.20)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.15)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.48)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.97)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.26)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.48)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.49)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.18)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.78)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.82)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 511
\-------------------------
Simulating trial. . .
espilon = 0.1300; alpha = 0.0040
Simulating trial. . .
espilon = 0.1300; alpha = 0.0040
Simulating trial. . .
espilon = 0.1300; alpha = 0.0040
Simulating trial. . .
espilon = 0.1300; alpha = 0.0040
Simulating trial. . .
espilon = 0.1300; alpha = 0.0040
Simulating trial. . .
espilon = 0.1300; alpha = 0.0040
Simulating trial. . .
espilon = 0.1300; alpha = 0.0040
Simulating trial. . .
espilon = 0.1300; alpha = 0.0040
Simulating trial. . .
espilon = 0.1300; alpha = 0.0040
Simulating trial. . .
espilon = 0.1300; alpha = 0.0040
Simulating trial. . .
espilon = 0.1300; alpha = 0.0040
Simulating trial. . .
espilon = 0.1300; alpha = 0.0040
Simulating trial. . .
espilon = 0.1300; alpha = 0.0040
Simulating trial. . .
espilon = 0.1300; alpha = 0.0040
Simulating trial. . .
espilon = 0.1300; alpha = 0.0040
Simulating trial. . .
espilon = 0.1300; alpha = 0.0040
Simulating trial. . .
espilon = 0.1300; alpha = 0.0040
Simulating trial. . .
espilon = 0.1300; alpha = 0.0040
Simulating trial. . .
espilon = 0.1300; alpha = 0.0040
Simulating trial. . .
espilon = 0.1300; alpha = 0.0040
Simulating trial. . .
espilon = 0.1300; alpha = 0.0040
Simulating trial. . .
espilon = 0.1300; alpha = 0.0040
Simulating trial. . .
espilon = 0.1300; alpha = 0.0040
Simulating trial. . .
espilon = 0.1300; alpha = 0.0040
Simulating trial. . .
espilon = 0.1300; alpha = 0.0040
Simulating trial. . .
espilon = 0.1300; alpha = 0.0040
Simulating trial. . .
espilon = 0.1300; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', 'left')
Agent followed the waypoint right. (rewarded 2.63)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.95)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.32)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.88)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 0.28)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 2.55)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'forward', None)
Agent followed the waypoint left. (rewarded 2.24)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent drove left instead of right. (rewarded 0.06)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.76)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.59)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.73)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 1.36)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'right')
Agent followed the waypoint right. (rewarded 2.35)
35% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 512
\-------------------------
Simulating trial. . .
espilon = 0.1295; alpha = 0.0040
Simulating trial. . .
espilon = 0.1295; alpha = 0.0040
Simulating trial. . .
espilon = 0.1295; alpha = 0.0040
Simulating trial. . .
espilon = 0.1295; alpha = 0.0040
Simulating trial. . .
espilon = 0.1295; alpha = 0.0040
Simulating trial. . .
espilon = 0.1295; alpha = 0.0040
Simulating trial. . .
espilon = 0.1295; alpha = 0.0040
Simulating trial. . .
espilon = 0.1295; alpha = 0.0040
Simulating trial. . .
espilon = 0.1295; alpha = 0.0040
Simulating trial. . .
espilon = 0.1295; alpha = 0.0040
Simulating trial. . .
espilon = 0.1295; alpha = 0.0040
Simulating trial. . .
espilon = 0.1295; alpha = 0.0040
Simulating trial. . .
espilon = 0.1295; alpha = 0.0040
Simulating trial. . .
espilon = 0.1295; alpha = 0.0040
Simulating trial. . .
espilon = 0.1295; alpha = 0.0040
Simulating trial. . .
espilon = 0.1295; alpha = 0.0040
Simulating trial. . .
espilon = 0.1295; alpha = 0.0040
Simulating trial. . .
espilon = 0.1295; alpha = 0.0040
Simulating trial. . .
espilon = 0.1295; alpha = 0.0040
Simulating trial. . .
espilon = 0.1295; alpha = 0.0040
Simulating trial. . .
espilon = 0.1295; alpha = 0.0040
Simulating trial. . .
espilon = 0.1295; alpha = 0.0040
Simulating trial. . .
espilon = 0.1295; alpha = 0.0040
Simulating trial. . .
espilon = 0.1295; alpha = 0.0040
Simulating trial. . .
espilon = 0.1295; alpha = 0.0040
Simulating trial. . .
espilon = 0.1295; alpha = 0.0040
Simulating trial. . .
espilon = 0.1295; alpha = 0.0040
Simulating trial. . .
espilon = 0.1295; alpha = 0.0040
Simulating trial. . .
espilon = 0.1295; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 0.63)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', 'left')
Agent followed the waypoint left. (rewarded 1.55)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.65)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.30)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 0.95)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.56)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.33)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.60)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 513
\-------------------------
Simulating trial. . .
espilon = 0.1290; alpha = 0.0040
Simulating trial. . .
espilon = 0.1290; alpha = 0.0040
Simulating trial. . .
espilon = 0.1290; alpha = 0.0040
Simulating trial. . .
espilon = 0.1290; alpha = 0.0040
Simulating trial. . .
espilon = 0.1290; alpha = 0.0040
Simulating trial. . .
espilon = 0.1290; alpha = 0.0040
Simulating trial. . .
espilon = 0.1290; alpha = 0.0040
Simulating trial. . .
espilon = 0.1290; alpha = 0.0040
Simulating trial. . .
espilon = 0.1290; alpha = 0.0040
Simulating trial. . .
espilon = 0.1290; alpha = 0.0040
Simulating trial. . .
espilon = 0.1290; alpha = 0.0040
Simulating trial. . .
espilon = 0.1290; alpha = 0.0040
Simulating trial. . .
espilon = 0.1290; alpha = 0.0040
Simulating trial. . .
espilon = 0.1290; alpha = 0.0040
Simulating trial. . .
espilon = 0.1290; alpha = 0.0040
Simulating trial. . .
espilon = 0.1290; alpha = 0.0040
Simulating trial. . .
espilon = 0.1290; alpha = 0.0040
Simulating trial. . .
espilon = 0.1290; alpha = 0.0040
Simulating trial. . .
espilon = 0.1290; alpha = 0.0040
Simulating trial. . .
espilon = 0.1290; alpha = 0.0040
Simulating trial. . .
espilon = 0.1290; alpha = 0.0040
Simulating trial. . .
espilon = 0.1290; alpha = 0.0040
Simulating trial. . .
espilon = 0.1290; alpha = 0.0040
Simulating trial. . .
espilon = 0.1290; alpha = 0.0040
Simulating trial. . .
espilon = 0.1290; alpha = 0.0040
Simulating trial. . .
espilon = 0.1290; alpha = 0.0040
Simulating trial. . .
espilon = 0.1290; alpha = 0.0040
Simulating trial. . .
espilon = 0.1290; alpha = 0.0040
Simulating trial. . .
espilon = 0.1290; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.49)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.50)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.57)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 1.72)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.19)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.36)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 0.90)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.44)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 1.11)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'right')
Agent drove right instead of forward. (rewarded 0.19)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.38)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.71)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.10)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.17)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.58)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'forward')
Agent followed the waypoint forward. (rewarded 1.13)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.45)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.39)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'right')
Agent followed the waypoint forward. (rewarded 0.55)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.70)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 514
\-------------------------
Simulating trial. . .
espilon = 0.1285; alpha = 0.0040
Simulating trial. . .
espilon = 0.1285; alpha = 0.0040
Simulating trial. . .
espilon = 0.1285; alpha = 0.0040
Simulating trial. . .
espilon = 0.1285; alpha = 0.0040
Simulating trial. . .
espilon = 0.1285; alpha = 0.0040
Simulating trial. . .
espilon = 0.1285; alpha = 0.0040
Simulating trial. . .
espilon = 0.1285; alpha = 0.0040
Simulating trial. . .
espilon = 0.1285; alpha = 0.0040
Simulating trial. . .
espilon = 0.1285; alpha = 0.0040
Simulating trial. . .
espilon = 0.1285; alpha = 0.0040
Simulating trial. . .
espilon = 0.1285; alpha = 0.0040
Simulating trial. . .
espilon = 0.1285; alpha = 0.0040
Simulating trial. . .
espilon = 0.1285; alpha = 0.0040
Simulating trial. . .
espilon = 0.1285; alpha = 0.0040
Simulating trial. . .
espilon = 0.1285; alpha = 0.0040
Simulating trial. . .
espilon = 0.1285; alpha = 0.0040
Simulating trial. . .
espilon = 0.1285; alpha = 0.0040
Simulating trial. . .
espilon = 0.1285; alpha = 0.0040
Simulating trial. . .
espilon = 0.1285; alpha = 0.0040
Simulating trial. . .
espilon = 0.1285; alpha = 0.0040
Simulating trial. . .
espilon = 0.1285; alpha = 0.0040
Simulating trial. . .
espilon = 0.1285; alpha = 0.0040
Simulating trial. . .
espilon = 0.1285; alpha = 0.0040
Simulating trial. . .
espilon = 0.1285; alpha = 0.0040
Simulating trial. . .
espilon = 0.1285; alpha = 0.0040
Simulating trial. . .
espilon = 0.1285; alpha = 0.0040
Simulating trial. . .
espilon = 0.1285; alpha = 0.0040
Simulating trial. . .
espilon = 0.1285; alpha = 0.0040
Simulating trial. . .
espilon = 0.1285; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.62)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'right')
Agent properly idled at a red light. (rewarded 1.85)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.48)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 1.08)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.57)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', 'left')
Agent properly idled at a red light. (rewarded 1.66)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 0.01)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 0.24)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 1.51)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.29)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.84)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.89)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 515
\-------------------------
Simulating trial. . .
espilon = 0.1280; alpha = 0.0040
Simulating trial. . .
espilon = 0.1280; alpha = 0.0040
Simulating trial. . .
espilon = 0.1280; alpha = 0.0040
Simulating trial. . .
espilon = 0.1280; alpha = 0.0040
Simulating trial. . .
espilon = 0.1280; alpha = 0.0040
Simulating trial. . .
espilon = 0.1280; alpha = 0.0040
Simulating trial. . .
espilon = 0.1280; alpha = 0.0040
Simulating trial. . .
espilon = 0.1280; alpha = 0.0040
Simulating trial. . .
espilon = 0.1280; alpha = 0.0040
Simulating trial. . .
espilon = 0.1280; alpha = 0.0040
Simulating trial. . .
espilon = 0.1280; alpha = 0.0040
Simulating trial. . .
espilon = 0.1280; alpha = 0.0040
Simulating trial. . .
espilon = 0.1280; alpha = 0.0040
Simulating trial. . .
espilon = 0.1280; alpha = 0.0040
Simulating trial. . .
espilon = 0.1280; alpha = 0.0040
Simulating trial. . .
espilon = 0.1280; alpha = 0.0040
Simulating trial. . .
espilon = 0.1280; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.85)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent drove forward instead of right. (rewarded 0.01)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, 'right')
Agent followed the waypoint right. (rewarded 1.69)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.89)
80% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 516
\-------------------------
Simulating trial. . .
espilon = 0.1275; alpha = 0.0040
Simulating trial. . .
espilon = 0.1275; alpha = 0.0040
Simulating trial. . .
espilon = 0.1275; alpha = 0.0040
Simulating trial. . .
espilon = 0.1275; alpha = 0.0040
Simulating trial. . .
espilon = 0.1275; alpha = 0.0040
Simulating trial. . .
espilon = 0.1275; alpha = 0.0040
Simulating trial. . .
espilon = 0.1275; alpha = 0.0040
Simulating trial. . .
espilon = 0.1275; alpha = 0.0040
Simulating trial. . .
espilon = 0.1275; alpha = 0.0040
Simulating trial. . .
espilon = 0.1275; alpha = 0.0040
Simulating trial. . .
espilon = 0.1275; alpha = 0.0040
Simulating trial. . .
espilon = 0.1275; alpha = 0.0040
Simulating trial. . .
espilon = 0.1275; alpha = 0.0040
Simulating trial. . .
espilon = 0.1275; alpha = 0.0040
Simulating trial. . .
espilon = 0.1275; alpha = 0.0040
Simulating trial. . .
espilon = 0.1275; alpha = 0.0040
Simulating trial. . .
espilon = 0.1275; alpha = 0.0040
Simulating trial. . .
espilon = 0.1275; alpha = 0.0040
Simulating trial. . .
espilon = 0.1275; alpha = 0.0040
Simulating trial. . .
espilon = 0.1275; alpha = 0.0040
Simulating trial. . .
espilon = 0.1275; alpha = 0.0040
Simulating trial. . .
espilon = 0.1275; alpha = 0.0040
Simulating trial. . .
espilon = 0.1275; alpha = 0.0040
Simulating trial. . .
espilon = 0.1275; alpha = 0.0040
Simulating trial. . .
espilon = 0.1275; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 0.16)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 1.31)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.01)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 2.01)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent followed the waypoint right. (rewarded 1.51)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.05)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.00)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.71)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent drove right instead of forward. (rewarded 1.83)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.58)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.73)
56% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 517
\-------------------------
Simulating trial. . .
espilon = 0.1269; alpha = 0.0040
Simulating trial. . .
espilon = 0.1269; alpha = 0.0040
Simulating trial. . .
espilon = 0.1269; alpha = 0.0040
Simulating trial. . .
espilon = 0.1269; alpha = 0.0040
Simulating trial. . .
espilon = 0.1269; alpha = 0.0040
Simulating trial. . .
espilon = 0.1269; alpha = 0.0040
Simulating trial. . .
espilon = 0.1269; alpha = 0.0040
Simulating trial. . .
espilon = 0.1269; alpha = 0.0040
Simulating trial. . .
espilon = 0.1269; alpha = 0.0040
Simulating trial. . .
espilon = 0.1269; alpha = 0.0040
Simulating trial. . .
espilon = 0.1269; alpha = 0.0040
Simulating trial. . .
espilon = 0.1269; alpha = 0.0040
Simulating trial. . .
espilon = 0.1269; alpha = 0.0040
Simulating trial. . .
espilon = 0.1269; alpha = 0.0040
Simulating trial. . .
espilon = 0.1269; alpha = 0.0040
Simulating trial. . .
espilon = 0.1269; alpha = 0.0040
Simulating trial. . .
espilon = 0.1269; alpha = 0.0040
Simulating trial. . .
espilon = 0.1269; alpha = 0.0040
Simulating trial. . .
espilon = 0.1269; alpha = 0.0040
Simulating trial. . .
espilon = 0.1269; alpha = 0.0040
Simulating trial. . .
espilon = 0.1269; alpha = 0.0040
Simulating trial. . .
espilon = 0.1269; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.62)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.16)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.37)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.00)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent attempted driving forward through a red light. (rewarded -10.61)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove right instead of forward. (rewarded 1.72)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.04)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'forward', None)
Agent followed the waypoint left. (rewarded 1.69)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.17)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.01)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 2.01)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.62)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.89)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.62)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.89)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.66)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.70)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'left')
Agent drove right instead of left. (rewarded 1.21)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 0.99)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.88)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 518
\-------------------------
Simulating trial. . .
espilon = 0.1264; alpha = 0.0040
Simulating trial. . .
espilon = 0.1264; alpha = 0.0040
Simulating trial. . .
espilon = 0.1264; alpha = 0.0040
Simulating trial. . .
espilon = 0.1264; alpha = 0.0040
Simulating trial. . .
espilon = 0.1264; alpha = 0.0040
Simulating trial. . .
espilon = 0.1264; alpha = 0.0040
Simulating trial. . .
espilon = 0.1264; alpha = 0.0040
Simulating trial. . .
espilon = 0.1264; alpha = 0.0040
Simulating trial. . .
espilon = 0.1264; alpha = 0.0040
Simulating trial. . .
espilon = 0.1264; alpha = 0.0040
Simulating trial. . .
espilon = 0.1264; alpha = 0.0040
Simulating trial. . .
espilon = 0.1264; alpha = 0.0040
Simulating trial. . .
espilon = 0.1264; alpha = 0.0040
Simulating trial. . .
espilon = 0.1264; alpha = 0.0040
Simulating trial. . .
espilon = 0.1264; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 1.78)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.18)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 1.90)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.93)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.52)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.60)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 2.84)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 2.58)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.56)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.06)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.35)
56% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 519
\-------------------------
Simulating trial. . .
espilon = 0.1259; alpha = 0.0040
Simulating trial. . .
espilon = 0.1259; alpha = 0.0040
Simulating trial. . .
espilon = 0.1259; alpha = 0.0040
Simulating trial. . .
espilon = 0.1259; alpha = 0.0040
Simulating trial. . .
espilon = 0.1259; alpha = 0.0040
Simulating trial. . .
espilon = 0.1259; alpha = 0.0040
Simulating trial. . .
espilon = 0.1259; alpha = 0.0040
Simulating trial. . .
espilon = 0.1259; alpha = 0.0040
Simulating trial. . .
espilon = 0.1259; alpha = 0.0040
Simulating trial. . .
espilon = 0.1259; alpha = 0.0040
Simulating trial. . .
espilon = 0.1259; alpha = 0.0040
Simulating trial. . .
espilon = 0.1259; alpha = 0.0040
Simulating trial. . .
espilon = 0.1259; alpha = 0.0040
Simulating trial. . .
espilon = 0.1259; alpha = 0.0040
Simulating trial. . .
espilon = 0.1259; alpha = 0.0040
Simulating trial. . .
espilon = 0.1259; alpha = 0.0040
Simulating trial. . .
espilon = 0.1259; alpha = 0.0040
Simulating trial. . .
espilon = 0.1259; alpha = 0.0040
Simulating trial. . .
espilon = 0.1259; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent drove right instead of left. (rewarded 0.26)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 1.65)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.91)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.98)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.18)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.57)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.98)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 2.11)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.63)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.60)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.15)
56% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 520
\-------------------------
Simulating trial. . .
espilon = 0.1254; alpha = 0.0040
Simulating trial. . .
espilon = 0.1254; alpha = 0.0040
Simulating trial. . .
espilon = 0.1254; alpha = 0.0040
Simulating trial. . .
espilon = 0.1254; alpha = 0.0040
Simulating trial. . .
espilon = 0.1254; alpha = 0.0040
Simulating trial. . .
espilon = 0.1254; alpha = 0.0040
Simulating trial. . .
espilon = 0.1254; alpha = 0.0040
Simulating trial. . .
espilon = 0.1254; alpha = 0.0040
Simulating trial. . .
espilon = 0.1254; alpha = 0.0040
Simulating trial. . .
espilon = 0.1254; alpha = 0.0040
Simulating trial. . .
espilon = 0.1254; alpha = 0.0040
Simulating trial. . .
espilon = 0.1254; alpha = 0.0040
Simulating trial. . .
espilon = 0.1254; alpha = 0.0040
Simulating trial. . .
espilon = 0.1254; alpha = 0.0040
Simulating trial. . .
espilon = 0.1254; alpha = 0.0040
Simulating trial. . .
espilon = 0.1254; alpha = 0.0040
Simulating trial. . .
espilon = 0.1254; alpha = 0.0040
Simulating trial. . .
espilon = 0.1254; alpha = 0.0040
Simulating trial. . .
espilon = 0.1254; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.74)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.42)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.63)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.41)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.86)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded 1.06)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.51)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent followed the waypoint forward. (rewarded 2.85)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.95)
55% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 521
\-------------------------
Simulating trial. . .
espilon = 0.1249; alpha = 0.0040
Simulating trial. . .
espilon = 0.1249; alpha = 0.0040
Simulating trial. . .
espilon = 0.1249; alpha = 0.0040
Simulating trial. . .
espilon = 0.1249; alpha = 0.0040
Simulating trial. . .
espilon = 0.1249; alpha = 0.0040
Simulating trial. . .
espilon = 0.1249; alpha = 0.0040
Simulating trial. . .
espilon = 0.1249; alpha = 0.0040
Simulating trial. . .
espilon = 0.1249; alpha = 0.0040
Simulating trial. . .
espilon = 0.1249; alpha = 0.0040
Simulating trial. . .
espilon = 0.1249; alpha = 0.0040
Simulating trial. . .
espilon = 0.1249; alpha = 0.0040
Simulating trial. . .
espilon = 0.1249; alpha = 0.0040
Simulating trial. . .
espilon = 0.1249; alpha = 0.0040
Simulating trial. . .
espilon = 0.1249; alpha = 0.0040
Simulating trial. . .
espilon = 0.1249; alpha = 0.0040
Simulating trial. . .
espilon = 0.1249; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'right', 'left')
Agent properly idled at a red light. (rewarded 2.13)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.66)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.01)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.59)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.85)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.47)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'right', 'left')
Agent drove right instead of forward. (rewarded 0.60)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.05)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 2.83)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.85)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.21)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.18)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.63)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.41)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', 'forward')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.84)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.11)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 1.13)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 1.12)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.51)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 0.49)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.91)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 1.09)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.53)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.38)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.04)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 522
\-------------------------
Simulating trial. . .
espilon = 0.1244; alpha = 0.0040
Simulating trial. . .
espilon = 0.1244; alpha = 0.0040
Simulating trial. . .
espilon = 0.1244; alpha = 0.0040
Simulating trial. . .
espilon = 0.1244; alpha = 0.0040
Simulating trial. . .
espilon = 0.1244; alpha = 0.0040
Simulating trial. . .
espilon = 0.1244; alpha = 0.0040
Simulating trial. . .
espilon = 0.1244; alpha = 0.0040
Simulating trial. . .
espilon = 0.1244; alpha = 0.0040
Simulating trial. . .
espilon = 0.1244; alpha = 0.0040
Simulating trial. . .
espilon = 0.1244; alpha = 0.0040
Simulating trial. . .
espilon = 0.1244; alpha = 0.0040
Simulating trial. . .
espilon = 0.1244; alpha = 0.0040
Simulating trial. . .
espilon = 0.1244; alpha = 0.0040
Simulating trial. . .
espilon = 0.1244; alpha = 0.0040
Simulating trial. . .
espilon = 0.1244; alpha = 0.0040
Simulating trial. . .
espilon = 0.1244; alpha = 0.0040
Simulating trial. . .
espilon = 0.1244; alpha = 0.0040
Simulating trial. . .
espilon = 0.1244; alpha = 0.0040
Simulating trial. . .
espilon = 0.1244; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.11)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.57)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.16)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.94)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.87)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.79)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.84)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.08)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.82)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 0.16)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'right', 'left')
Agent drove right instead of forward. (rewarded 1.11)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 0.24)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', 'right')
Agent drove forward instead of left. (rewarded 0.83)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 0.84)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.78)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.48)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.97)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.46)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.91)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent followed the waypoint left. (rewarded 1.35)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'right', None)
Agent followed the waypoint forward. (rewarded 2.24)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.24)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.24)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'right', None)
Agent drove right instead of forward. (rewarded 1.28)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.34)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent drove forward instead of left. (rewarded 1.11)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 2.26)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.85)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 0.75)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 0.57)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 523
\-------------------------
Simulating trial. . .
espilon = 0.1239; alpha = 0.0040
Simulating trial. . .
espilon = 0.1239; alpha = 0.0040
Simulating trial. . .
espilon = 0.1239; alpha = 0.0040
Simulating trial. . .
espilon = 0.1239; alpha = 0.0040
Simulating trial. . .
espilon = 0.1239; alpha = 0.0040
Simulating trial. . .
espilon = 0.1239; alpha = 0.0040
Simulating trial. . .
espilon = 0.1239; alpha = 0.0040
Simulating trial. . .
espilon = 0.1239; alpha = 0.0040
Simulating trial. . .
espilon = 0.1239; alpha = 0.0040
Simulating trial. . .
espilon = 0.1239; alpha = 0.0040
Simulating trial. . .
espilon = 0.1239; alpha = 0.0040
Simulating trial. . .
espilon = 0.1239; alpha = 0.0040
Simulating trial. . .
espilon = 0.1239; alpha = 0.0040
Simulating trial. . .
espilon = 0.1239; alpha = 0.0040
Simulating trial. . .
espilon = 0.1239; alpha = 0.0040
Simulating trial. . .
espilon = 0.1239; alpha = 0.0040
Simulating trial. . .
espilon = 0.1239; alpha = 0.0040
Simulating trial. . .
espilon = 0.1239; alpha = 0.0040
Simulating trial. . .
espilon = 0.1239; alpha = 0.0040
Simulating trial. . .
espilon = 0.1239; alpha = 0.0040
Simulating trial. . .
espilon = 0.1239; alpha = 0.0040
Simulating trial. . .
espilon = 0.1239; alpha = 0.0040
Simulating trial. . .
espilon = 0.1239; alpha = 0.0040
Simulating trial. . .
espilon = 0.1239; alpha = 0.0040
Simulating trial. . .
espilon = 0.1239; alpha = 0.0040
Simulating trial. . .
espilon = 0.1239; alpha = 0.0040
Simulating trial. . .
espilon = 0.1239; alpha = 0.0040
Simulating trial. . .
espilon = 0.1239; alpha = 0.0040
Simulating trial. . .
espilon = 0.1239; alpha = 0.0040
Simulating trial. . .
espilon = 0.1239; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.42)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.09)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.74)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.83)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.92)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.81)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', None)
Agent followed the waypoint right. (rewarded 0.99)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 0.40)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 1.58)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.30)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.51)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, None)
Agent followed the waypoint right. (rewarded 2.64)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.30)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded -0.15)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.06)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent followed the waypoint left. (rewarded 2.39)
47% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 524
\-------------------------
Simulating trial. . .
espilon = 0.1234; alpha = 0.0040
Simulating trial. . .
espilon = 0.1234; alpha = 0.0040
Simulating trial. . .
espilon = 0.1234; alpha = 0.0040
Simulating trial. . .
espilon = 0.1234; alpha = 0.0040
Simulating trial. . .
espilon = 0.1234; alpha = 0.0040
Simulating trial. . .
espilon = 0.1234; alpha = 0.0040
Simulating trial. . .
espilon = 0.1234; alpha = 0.0040
Simulating trial. . .
espilon = 0.1234; alpha = 0.0040
Simulating trial. . .
espilon = 0.1234; alpha = 0.0040
Simulating trial. . .
espilon = 0.1234; alpha = 0.0040
Simulating trial. . .
espilon = 0.1234; alpha = 0.0040
Simulating trial. . .
espilon = 0.1234; alpha = 0.0040
Simulating trial. . .
espilon = 0.1234; alpha = 0.0040
Simulating trial. . .
espilon = 0.1234; alpha = 0.0040
Simulating trial. . .
espilon = 0.1234; alpha = 0.0040
Simulating trial. . .
espilon = 0.1234; alpha = 0.0040
Simulating trial. . .
espilon = 0.1234; alpha = 0.0040
Simulating trial. . .
espilon = 0.1234; alpha = 0.0040
Simulating trial. . .
espilon = 0.1234; alpha = 0.0040
Simulating trial. . .
espilon = 0.1234; alpha = 0.0040
Simulating trial. . .
espilon = 0.1234; alpha = 0.0040
Simulating trial. . .
espilon = 0.1234; alpha = 0.0040
Simulating trial. . .
espilon = 0.1234; alpha = 0.0040
Simulating trial. . .
espilon = 0.1234; alpha = 0.0040
Simulating trial. . .
espilon = 0.1234; alpha = 0.0040
Simulating trial. . .
espilon = 0.1234; alpha = 0.0040
Simulating trial. . .
espilon = 0.1234; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 2.15)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.53)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.13)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent drove right instead of forward. (rewarded 1.73)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.59)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.97)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent followed the waypoint left. (rewarded 1.73)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.47)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.03)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.56)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 2.17)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.01)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 525
\-------------------------
Simulating trial. . .
espilon = 0.1229; alpha = 0.0040
Simulating trial. . .
espilon = 0.1229; alpha = 0.0040
Simulating trial. . .
espilon = 0.1229; alpha = 0.0040
Simulating trial. . .
espilon = 0.1229; alpha = 0.0040
Simulating trial. . .
espilon = 0.1229; alpha = 0.0040
Simulating trial. . .
espilon = 0.1229; alpha = 0.0040
Simulating trial. . .
espilon = 0.1229; alpha = 0.0040
Simulating trial. . .
espilon = 0.1229; alpha = 0.0040
Simulating trial. . .
espilon = 0.1229; alpha = 0.0040
Simulating trial. . .
espilon = 0.1229; alpha = 0.0040
Simulating trial. . .
espilon = 0.1229; alpha = 0.0040
Simulating trial. . .
espilon = 0.1229; alpha = 0.0040
Simulating trial. . .
espilon = 0.1229; alpha = 0.0040
Simulating trial. . .
espilon = 0.1229; alpha = 0.0040
Simulating trial. . .
espilon = 0.1229; alpha = 0.0040
Simulating trial. . .
espilon = 0.1229; alpha = 0.0040
Simulating trial. . .
espilon = 0.1229; alpha = 0.0040
Simulating trial. . .
espilon = 0.1229; alpha = 0.0040
Simulating trial. . .
espilon = 0.1229; alpha = 0.0040
Simulating trial. . .
espilon = 0.1229; alpha = 0.0040
Simulating trial. . .
espilon = 0.1229; alpha = 0.0040
Simulating trial. . .
espilon = 0.1229; alpha = 0.0040
Simulating trial. . .
espilon = 0.1229; alpha = 0.0040
Simulating trial. . .
espilon = 0.1229; alpha = 0.0040
Simulating trial. . .
espilon = 0.1229; alpha = 0.0040
Simulating trial. . .
espilon = 0.1229; alpha = 0.0040
Simulating trial. . .
espilon = 0.1229; alpha = 0.0040
Simulating trial. . .
espilon = 0.1229; alpha = 0.0040
Simulating trial. . .
espilon = 0.1229; alpha = 0.0040
Simulating trial. . .
espilon = 0.1229; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 1.44)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.19)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.02)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.10)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.17)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.46)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded 0.50)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'right')
Agent drove right instead of left. (rewarded 0.19)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.79)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.06)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.03)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.94)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.76)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.59)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.06)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.06)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 2.02)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.82)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'right')
Agent drove forward instead of left. (rewarded 0.39)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.61)
20% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 526
\-------------------------
Simulating trial. . .
espilon = 0.1225; alpha = 0.0040
Simulating trial. . .
espilon = 0.1225; alpha = 0.0040
Simulating trial. . .
espilon = 0.1225; alpha = 0.0040
Simulating trial. . .
espilon = 0.1225; alpha = 0.0040
Simulating trial. . .
espilon = 0.1225; alpha = 0.0040
Simulating trial. . .
espilon = 0.1225; alpha = 0.0040
Simulating trial. . .
espilon = 0.1225; alpha = 0.0040
Simulating trial. . .
espilon = 0.1225; alpha = 0.0040
Simulating trial. . .
espilon = 0.1225; alpha = 0.0040
Simulating trial. . .
espilon = 0.1225; alpha = 0.0040
Simulating trial. . .
espilon = 0.1225; alpha = 0.0040
Simulating trial. . .
espilon = 0.1225; alpha = 0.0040
Simulating trial. . .
espilon = 0.1225; alpha = 0.0040
Simulating trial. . .
espilon = 0.1225; alpha = 0.0040
Simulating trial. . .
espilon = 0.1225; alpha = 0.0040
Simulating trial. . .
espilon = 0.1225; alpha = 0.0040
Simulating trial. . .
espilon = 0.1225; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', 'forward')
Agent followed the waypoint right. (rewarded 1.32)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.63)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.01)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.13)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 1.02)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.61)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.92)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.76)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'right', 'left')
Agent properly idled at a red light. (rewarded 1.87)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove right instead of forward. (rewarded 1.70)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.45)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent drove right instead of left. (rewarded 1.58)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.18)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.73)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.70)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.93)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.09)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 0.86)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.31)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent followed the waypoint right. (rewarded 1.07)
33% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 527
\-------------------------
Simulating trial. . .
espilon = 0.1220; alpha = 0.0040
Simulating trial. . .
espilon = 0.1220; alpha = 0.0040
Simulating trial. . .
espilon = 0.1220; alpha = 0.0040
Simulating trial. . .
espilon = 0.1220; alpha = 0.0040
Simulating trial. . .
espilon = 0.1220; alpha = 0.0040
Simulating trial. . .
espilon = 0.1220; alpha = 0.0040
Simulating trial. . .
espilon = 0.1220; alpha = 0.0040
Simulating trial. . .
espilon = 0.1220; alpha = 0.0040
Simulating trial. . .
espilon = 0.1220; alpha = 0.0040
Simulating trial. . .
espilon = 0.1220; alpha = 0.0040
Simulating trial. . .
espilon = 0.1220; alpha = 0.0040
Simulating trial. . .
espilon = 0.1220; alpha = 0.0040
Simulating trial. . .
espilon = 0.1220; alpha = 0.0040
Simulating trial. . .
espilon = 0.1220; alpha = 0.0040
Simulating trial. . .
espilon = 0.1220; alpha = 0.0040
Simulating trial. . .
espilon = 0.1220; alpha = 0.0040
Simulating trial. . .
espilon = 0.1220; alpha = 0.0040
Simulating trial. . .
espilon = 0.1220; alpha = 0.0040
Simulating trial. . .
espilon = 0.1220; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'forward')
Agent followed the waypoint right. (rewarded 2.01)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.59)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.04)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', 'left')
Agent properly idled at a red light. (rewarded 1.88)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 0.97)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.47)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.96)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 2.73)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.78)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.83)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.02)
56% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 528
\-------------------------
Simulating trial. . .
espilon = 0.1215; alpha = 0.0040
Simulating trial. . .
espilon = 0.1215; alpha = 0.0040
Simulating trial. . .
espilon = 0.1215; alpha = 0.0040
Simulating trial. . .
espilon = 0.1215; alpha = 0.0040
Simulating trial. . .
espilon = 0.1215; alpha = 0.0040
Simulating trial. . .
espilon = 0.1215; alpha = 0.0040
Simulating trial. . .
espilon = 0.1215; alpha = 0.0040
Simulating trial. . .
espilon = 0.1215; alpha = 0.0040
Simulating trial. . .
espilon = 0.1215; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 2.10)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.72)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.56)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.04)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.01)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded 0.45)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.20)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.68)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 1.23)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', None)
Agent attempted driving left through a red light. (rewarded -10.65)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.84)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.42)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.66)
48% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 529
\-------------------------
Simulating trial. . .
espilon = 0.1210; alpha = 0.0040
Simulating trial. . .
espilon = 0.1210; alpha = 0.0040
Simulating trial. . .
espilon = 0.1210; alpha = 0.0040
Simulating trial. . .
espilon = 0.1210; alpha = 0.0040
Simulating trial. . .
espilon = 0.1210; alpha = 0.0040
Simulating trial. . .
espilon = 0.1210; alpha = 0.0040
Simulating trial. . .
espilon = 0.1210; alpha = 0.0040
Simulating trial. . .
espilon = 0.1210; alpha = 0.0040
Simulating trial. . .
espilon = 0.1210; alpha = 0.0040
Simulating trial. . .
espilon = 0.1210; alpha = 0.0040
Simulating trial. . .
espilon = 0.1210; alpha = 0.0040
Simulating trial. . .
espilon = 0.1210; alpha = 0.0040
Simulating trial. . .
espilon = 0.1210; alpha = 0.0040
Simulating trial. . .
espilon = 0.1210; alpha = 0.0040
Simulating trial. . .
espilon = 0.1210; alpha = 0.0040
Simulating trial. . .
espilon = 0.1210; alpha = 0.0040
Simulating trial. . .
espilon = 0.1210; alpha = 0.0040
Simulating trial. . .
espilon = 0.1210; alpha = 0.0040
Simulating trial. . .
espilon = 0.1210; alpha = 0.0040
Simulating trial. . .
espilon = 0.1210; alpha = 0.0040
Simulating trial. . .
espilon = 0.1210; alpha = 0.0040
Simulating trial. . .
espilon = 0.1210; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.87)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.02)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent followed the waypoint right. (rewarded 1.59)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent attempted driving forward through a red light. (rewarded -9.37)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.88)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 1.27)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.13)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.47)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.18)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.37)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.55)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.54)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.18)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.75)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.90)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.26)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 1.93)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.68)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent attempted driving forward through a red light. (rewarded -10.49)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.48)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'right', 'forward')
Agent attempted driving left through a red light. (rewarded -9.93)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 0.64)
27% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 530
\-------------------------
Simulating trial. . .
espilon = 0.1205; alpha = 0.0040
Simulating trial. . .
espilon = 0.1205; alpha = 0.0040
Simulating trial. . .
espilon = 0.1205; alpha = 0.0040
Simulating trial. . .
espilon = 0.1205; alpha = 0.0040
Simulating trial. . .
espilon = 0.1205; alpha = 0.0040
Simulating trial. . .
espilon = 0.1205; alpha = 0.0040
Simulating trial. . .
espilon = 0.1205; alpha = 0.0040
Simulating trial. . .
espilon = 0.1205; alpha = 0.0040
Simulating trial. . .
espilon = 0.1205; alpha = 0.0040
Simulating trial. . .
espilon = 0.1205; alpha = 0.0040
Simulating trial. . .
espilon = 0.1205; alpha = 0.0040
Simulating trial. . .
espilon = 0.1205; alpha = 0.0040
Simulating trial. . .
espilon = 0.1205; alpha = 0.0040
Simulating trial. . .
espilon = 0.1205; alpha = 0.0040
Simulating trial. . .
espilon = 0.1205; alpha = 0.0040
Simulating trial. . .
espilon = 0.1205; alpha = 0.0040
Simulating trial. . .
espilon = 0.1205; alpha = 0.0040
Simulating trial. . .
espilon = 0.1205; alpha = 0.0040
Simulating trial. . .
espilon = 0.1205; alpha = 0.0040
Simulating trial. . .
espilon = 0.1205; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.29)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.64)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.59)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.21)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.49)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.03)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'right')
Agent properly idled at a red light. (rewarded 2.21)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.14)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.19)
55% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 531
\-------------------------
Simulating trial. . .
espilon = 0.1200; alpha = 0.0040
Simulating trial. . .
espilon = 0.1200; alpha = 0.0040
Simulating trial. . .
espilon = 0.1200; alpha = 0.0040
Simulating trial. . .
espilon = 0.1200; alpha = 0.0040
Simulating trial. . .
espilon = 0.1200; alpha = 0.0040
Simulating trial. . .
espilon = 0.1200; alpha = 0.0040
Simulating trial. . .
espilon = 0.1200; alpha = 0.0040
Simulating trial. . .
espilon = 0.1200; alpha = 0.0040
Simulating trial. . .
espilon = 0.1200; alpha = 0.0040
Simulating trial. . .
espilon = 0.1200; alpha = 0.0040
Simulating trial. . .
espilon = 0.1200; alpha = 0.0040
Simulating trial. . .
espilon = 0.1200; alpha = 0.0040
Simulating trial. . .
espilon = 0.1200; alpha = 0.0040
Simulating trial. . .
espilon = 0.1200; alpha = 0.0040
Simulating trial. . .
espilon = 0.1200; alpha = 0.0040
Simulating trial. . .
espilon = 0.1200; alpha = 0.0040
Simulating trial. . .
espilon = 0.1200; alpha = 0.0040
Simulating trial. . .
espilon = 0.1200; alpha = 0.0040
Simulating trial. . .
espilon = 0.1200; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 1.68)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.44)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.23)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.87)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.24)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.82)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.94)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.20)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 1.65)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.68)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.80)
45% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 532
\-------------------------
Simulating trial. . .
espilon = 0.1196; alpha = 0.0040
Simulating trial. . .
espilon = 0.1196; alpha = 0.0040
Simulating trial. . .
espilon = 0.1196; alpha = 0.0040
Simulating trial. . .
espilon = 0.1196; alpha = 0.0040
Simulating trial. . .
espilon = 0.1196; alpha = 0.0040
Simulating trial. . .
espilon = 0.1196; alpha = 0.0040
Simulating trial. . .
espilon = 0.1196; alpha = 0.0040
Simulating trial. . .
espilon = 0.1196; alpha = 0.0040
Simulating trial. . .
espilon = 0.1196; alpha = 0.0040
Simulating trial. . .
espilon = 0.1196; alpha = 0.0040
Simulating trial. . .
espilon = 0.1196; alpha = 0.0040
Simulating trial. . .
espilon = 0.1196; alpha = 0.0040
Simulating trial. . .
espilon = 0.1196; alpha = 0.0040
Simulating trial. . .
espilon = 0.1196; alpha = 0.0040
Simulating trial. . .
espilon = 0.1196; alpha = 0.0040
Simulating trial. . .
espilon = 0.1196; alpha = 0.0040
Simulating trial. . .
espilon = 0.1196; alpha = 0.0040
Simulating trial. . .
espilon = 0.1196; alpha = 0.0040
Simulating trial. . .
espilon = 0.1196; alpha = 0.0040
Simulating trial. . .
espilon = 0.1196; alpha = 0.0040
Simulating trial. . .
espilon = 0.1196; alpha = 0.0040
Simulating trial. . .
espilon = 0.1196; alpha = 0.0040
Simulating trial. . .
espilon = 0.1196; alpha = 0.0040
Simulating trial. . .
espilon = 0.1196; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'left', None)
Agent followed the waypoint right. (rewarded 2.10)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, None)
Agent followed the waypoint right. (rewarded 2.61)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.34)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.14)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.80)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.80)
76% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 533
\-------------------------
Simulating trial. . .
espilon = 0.1191; alpha = 0.0040
Simulating trial. . .
espilon = 0.1191; alpha = 0.0040
Simulating trial. . .
espilon = 0.1191; alpha = 0.0040
Simulating trial. . .
espilon = 0.1191; alpha = 0.0040
Simulating trial. . .
espilon = 0.1191; alpha = 0.0040
Simulating trial. . .
espilon = 0.1191; alpha = 0.0040
Simulating trial. . .
espilon = 0.1191; alpha = 0.0040
Simulating trial. . .
espilon = 0.1191; alpha = 0.0040
Simulating trial. . .
espilon = 0.1191; alpha = 0.0040
Simulating trial. . .
espilon = 0.1191; alpha = 0.0040
Simulating trial. . .
espilon = 0.1191; alpha = 0.0040
Simulating trial. . .
espilon = 0.1191; alpha = 0.0040
Simulating trial. . .
espilon = 0.1191; alpha = 0.0040
Simulating trial. . .
espilon = 0.1191; alpha = 0.0040
Simulating trial. . .
espilon = 0.1191; alpha = 0.0040
Simulating trial. . .
espilon = 0.1191; alpha = 0.0040
Simulating trial. . .
espilon = 0.1191; alpha = 0.0040
Simulating trial. . .
espilon = 0.1191; alpha = 0.0040
Simulating trial. . .
espilon = 0.1191; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.82)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.14)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.05)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.91)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.29)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 0.97)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.93)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.03)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.32)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.53)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.49)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.66)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.20)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.96)
30% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 534
\-------------------------
Simulating trial. . .
espilon = 0.1186; alpha = 0.0040
Simulating trial. . .
espilon = 0.1186; alpha = 0.0040
Simulating trial. . .
espilon = 0.1186; alpha = 0.0040
Simulating trial. . .
espilon = 0.1186; alpha = 0.0040
Simulating trial. . .
espilon = 0.1186; alpha = 0.0040
Simulating trial. . .
espilon = 0.1186; alpha = 0.0040
Simulating trial. . .
espilon = 0.1186; alpha = 0.0040
Simulating trial. . .
espilon = 0.1186; alpha = 0.0040
Simulating trial. . .
espilon = 0.1186; alpha = 0.0040
Simulating trial. . .
espilon = 0.1186; alpha = 0.0040
Simulating trial. . .
espilon = 0.1186; alpha = 0.0040
Simulating trial. . .
espilon = 0.1186; alpha = 0.0040
Simulating trial. . .
espilon = 0.1186; alpha = 0.0040
Simulating trial. . .
espilon = 0.1186; alpha = 0.0040
Simulating trial. . .
espilon = 0.1186; alpha = 0.0040
Simulating trial. . .
espilon = 0.1186; alpha = 0.0040
Simulating trial. . .
espilon = 0.1186; alpha = 0.0040
Simulating trial. . .
espilon = 0.1186; alpha = 0.0040
Simulating trial. . .
espilon = 0.1186; alpha = 0.0040
Simulating trial. . .
espilon = 0.1186; alpha = 0.0040
Simulating trial. . .
espilon = 0.1186; alpha = 0.0040
Simulating trial. . .
espilon = 0.1186; alpha = 0.0040
Simulating trial. . .
espilon = 0.1186; alpha = 0.0040
Simulating trial. . .
espilon = 0.1186; alpha = 0.0040
Simulating trial. . .
espilon = 0.1186; alpha = 0.0040
Simulating trial. . .
espilon = 0.1186; alpha = 0.0040
Simulating trial. . .
espilon = 0.1186; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 1.01)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.07)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 0.68)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 1.84)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.96)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'right', None)
Agent attempted driving forward through a red light. (rewarded -10.56)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.80)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.45)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.71)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.42)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 1.02)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.76)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 535
\-------------------------
Simulating trial. . .
espilon = 0.1181; alpha = 0.0040
Simulating trial. . .
espilon = 0.1181; alpha = 0.0040
Simulating trial. . .
espilon = 0.1181; alpha = 0.0040
Simulating trial. . .
espilon = 0.1181; alpha = 0.0040
Simulating trial. . .
espilon = 0.1181; alpha = 0.0040
Simulating trial. . .
espilon = 0.1181; alpha = 0.0040
Simulating trial. . .
espilon = 0.1181; alpha = 0.0040
Simulating trial. . .
espilon = 0.1181; alpha = 0.0040
Simulating trial. . .
espilon = 0.1181; alpha = 0.0040
Simulating trial. . .
espilon = 0.1181; alpha = 0.0040
Simulating trial. . .
espilon = 0.1181; alpha = 0.0040
Simulating trial. . .
espilon = 0.1181; alpha = 0.0040
Simulating trial. . .
espilon = 0.1181; alpha = 0.0040
Simulating trial. . .
espilon = 0.1181; alpha = 0.0040
Simulating trial. . .
espilon = 0.1181; alpha = 0.0040
Simulating trial. . .
espilon = 0.1181; alpha = 0.0040
Simulating trial. . .
espilon = 0.1181; alpha = 0.0040
Simulating trial. . .
espilon = 0.1181; alpha = 0.0040
Simulating trial. . .
espilon = 0.1181; alpha = 0.0040
Simulating trial. . .
espilon = 0.1181; alpha = 0.0040
Simulating trial. . .
espilon = 0.1181; alpha = 0.0040
Simulating trial. . .
espilon = 0.1181; alpha = 0.0040
Simulating trial. . .
espilon = 0.1181; alpha = 0.0040
Simulating trial. . .
espilon = 0.1181; alpha = 0.0040
Simulating trial. . .
espilon = 0.1181; alpha = 0.0040
Simulating trial. . .
espilon = 0.1181; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.78)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.73)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.05)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.58)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.55)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.27)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.80)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.94)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.92)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.11)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.75)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.90)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.42)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.55)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.03)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 0.90)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 0.75)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent attempted driving forward through a red light. (rewarded -9.96)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'right', None)
Agent followed the waypoint right. (rewarded 0.97)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'right')
Agent properly idled at a red light. (rewarded 1.27)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.22)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.50)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.69)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.53)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 0.15)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 536
\-------------------------
Simulating trial. . .
espilon = 0.1177; alpha = 0.0040
Simulating trial. . .
espilon = 0.1177; alpha = 0.0040
Simulating trial. . .
espilon = 0.1177; alpha = 0.0040
Simulating trial. . .
espilon = 0.1177; alpha = 0.0040
Simulating trial. . .
espilon = 0.1177; alpha = 0.0040
Simulating trial. . .
espilon = 0.1177; alpha = 0.0040
Simulating trial. . .
espilon = 0.1177; alpha = 0.0040
Simulating trial. . .
espilon = 0.1177; alpha = 0.0040
Simulating trial. . .
espilon = 0.1177; alpha = 0.0040
Simulating trial. . .
espilon = 0.1177; alpha = 0.0040
Simulating trial. . .
espilon = 0.1177; alpha = 0.0040
Simulating trial. . .
espilon = 0.1177; alpha = 0.0040
Simulating trial. . .
espilon = 0.1177; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent drove right instead of left. (rewarded 0.21)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.21)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.67)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.02)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.46)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.40)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.09)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.04)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.40)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.12)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.22)
45% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 537
\-------------------------
Simulating trial. . .
espilon = 0.1172; alpha = 0.0040
Simulating trial. . .
espilon = 0.1172; alpha = 0.0040
Simulating trial. . .
espilon = 0.1172; alpha = 0.0040
Simulating trial. . .
espilon = 0.1172; alpha = 0.0040
Simulating trial. . .
espilon = 0.1172; alpha = 0.0040
Simulating trial. . .
espilon = 0.1172; alpha = 0.0040
Simulating trial. . .
espilon = 0.1172; alpha = 0.0040
Simulating trial. . .
espilon = 0.1172; alpha = 0.0040
Simulating trial. . .
espilon = 0.1172; alpha = 0.0040
Simulating trial. . .
espilon = 0.1172; alpha = 0.0040
Simulating trial. . .
espilon = 0.1172; alpha = 0.0040
Simulating trial. . .
espilon = 0.1172; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'left', None)
Agent followed the waypoint forward. (rewarded 1.65)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.57)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.83)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.31)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.04)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 0.05)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.67)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.65)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.22)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.07)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.87)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.61)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.29)
63% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.52)
60% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.86)
57% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.33)
54% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.10)
51% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 538
\-------------------------
Simulating trial. . .
espilon = 0.1167; alpha = 0.0040
Simulating trial. . .
espilon = 0.1167; alpha = 0.0040
Simulating trial. . .
espilon = 0.1167; alpha = 0.0040
Simulating trial. . .
espilon = 0.1167; alpha = 0.0040
Simulating trial. . .
espilon = 0.1167; alpha = 0.0040
Simulating trial. . .
espilon = 0.1167; alpha = 0.0040
Simulating trial. . .
espilon = 0.1167; alpha = 0.0040
Simulating trial. . .
espilon = 0.1167; alpha = 0.0040
Simulating trial. . .
espilon = 0.1167; alpha = 0.0040
Simulating trial. . .
espilon = 0.1167; alpha = 0.0040
Simulating trial. . .
espilon = 0.1167; alpha = 0.0040
Simulating trial. . .
espilon = 0.1167; alpha = 0.0040
Simulating trial. . .
espilon = 0.1167; alpha = 0.0040
Simulating trial. . .
espilon = 0.1167; alpha = 0.0040
Simulating trial. . .
espilon = 0.1167; alpha = 0.0040
Simulating trial. . .
espilon = 0.1167; alpha = 0.0040
Simulating trial. . .
espilon = 0.1167; alpha = 0.0040
Simulating trial. . .
espilon = 0.1167; alpha = 0.0040
Simulating trial. . .
espilon = 0.1167; alpha = 0.0040
Simulating trial. . .
espilon = 0.1167; alpha = 0.0040
Simulating trial. . .
espilon = 0.1167; alpha = 0.0040
Simulating trial. . .
espilon = 0.1167; alpha = 0.0040
Simulating trial. . .
espilon = 0.1167; alpha = 0.0040
Simulating trial. . .
espilon = 0.1167; alpha = 0.0040
Simulating trial. . .
espilon = 0.1167; alpha = 0.0040
Simulating trial. . .
espilon = 0.1167; alpha = 0.0040
Simulating trial. . .
espilon = 0.1167; alpha = 0.0040
Simulating trial. . .
espilon = 0.1167; alpha = 0.0040
Simulating trial. . .
espilon = 0.1167; alpha = 0.0040
Simulating trial. . .
espilon = 0.1167; alpha = 0.0040
Simulating trial. . .
espilon = 0.1167; alpha = 0.0040
Simulating trial. . .
espilon = 0.1167; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.48)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', 'left')
Agent drove right instead of forward. (rewarded 1.55)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.11)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.00)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.47)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.92)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.45)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.88)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.83)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.78)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.55)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.82)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.85)
63% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.42)
60% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'forward')
Agent followed the waypoint forward. (rewarded 1.88)
57% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 539
\-------------------------
Simulating trial. . .
espilon = 0.1163; alpha = 0.0040
Simulating trial. . .
espilon = 0.1163; alpha = 0.0040
Simulating trial. . .
espilon = 0.1163; alpha = 0.0040
Simulating trial. . .
espilon = 0.1163; alpha = 0.0040
Simulating trial. . .
espilon = 0.1163; alpha = 0.0040
Simulating trial. . .
espilon = 0.1163; alpha = 0.0040
Simulating trial. . .
espilon = 0.1163; alpha = 0.0040
Simulating trial. . .
espilon = 0.1163; alpha = 0.0040
Simulating trial. . .
espilon = 0.1163; alpha = 0.0040
Simulating trial. . .
espilon = 0.1163; alpha = 0.0040
Simulating trial. . .
espilon = 0.1163; alpha = 0.0040
Simulating trial. . .
espilon = 0.1163; alpha = 0.0040
Simulating trial. . .
espilon = 0.1163; alpha = 0.0040
Simulating trial. . .
espilon = 0.1163; alpha = 0.0040
Simulating trial. . .
espilon = 0.1163; alpha = 0.0040
Simulating trial. . .
espilon = 0.1163; alpha = 0.0040
Simulating trial. . .
espilon = 0.1163; alpha = 0.0040
Simulating trial. . .
espilon = 0.1163; alpha = 0.0040
Simulating trial. . .
espilon = 0.1163; alpha = 0.0040
Simulating trial. . .
espilon = 0.1163; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', 'forward')
Agent drove right instead of forward. (rewarded 0.54)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.82)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.47)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 0.17)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.62)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 0.13)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 2.42)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.04)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 2.05)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent drove right instead of forward. (rewarded 0.76)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 1.15)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.78)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.92)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'left')
Agent drove forward instead of right. (rewarded 0.74)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent drove left instead of right. (rewarded 1.05)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.02)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.43)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.26)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'forward')
Agent properly idled at a red light. (rewarded 1.85)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.30)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.29)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.04)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 0.64)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 1.09)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.26)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.31)
13% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 540
\-------------------------
Simulating trial. . .
espilon = 0.1158; alpha = 0.0040
Simulating trial. . .
espilon = 0.1158; alpha = 0.0040
Simulating trial. . .
espilon = 0.1158; alpha = 0.0040
Simulating trial. . .
espilon = 0.1158; alpha = 0.0040
Simulating trial. . .
espilon = 0.1158; alpha = 0.0040
Simulating trial. . .
espilon = 0.1158; alpha = 0.0040
Simulating trial. . .
espilon = 0.1158; alpha = 0.0040
Simulating trial. . .
espilon = 0.1158; alpha = 0.0040
Simulating trial. . .
espilon = 0.1158; alpha = 0.0040
Simulating trial. . .
espilon = 0.1158; alpha = 0.0040
Simulating trial. . .
espilon = 0.1158; alpha = 0.0040
Simulating trial. . .
espilon = 0.1158; alpha = 0.0040
Simulating trial. . .
espilon = 0.1158; alpha = 0.0040
Simulating trial. . .
espilon = 0.1158; alpha = 0.0040
Simulating trial. . .
espilon = 0.1158; alpha = 0.0040
Simulating trial. . .
espilon = 0.1158; alpha = 0.0040
Simulating trial. . .
espilon = 0.1158; alpha = 0.0040
Simulating trial. . .
espilon = 0.1158; alpha = 0.0040
Simulating trial. . .
espilon = 0.1158; alpha = 0.0040
Simulating trial. . .
espilon = 0.1158; alpha = 0.0040
Simulating trial. . .
espilon = 0.1158; alpha = 0.0040
Simulating trial. . .
espilon = 0.1158; alpha = 0.0040
Simulating trial. . .
espilon = 0.1158; alpha = 0.0040
Simulating trial. . .
espilon = 0.1158; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.69)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'right', None)
Agent drove right instead of left. (rewarded 1.46)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 1.57)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent drove forward instead of left. (rewarded 0.43)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent drove right instead of left. (rewarded 0.29)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.92)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.00)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.53)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 2.55)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.13)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'forward')
Agent followed the waypoint forward. (rewarded 2.47)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.08)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.77)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.07)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 0.92)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.96)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.74)
32% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 541
\-------------------------
Simulating trial. . .
espilon = 0.1153; alpha = 0.0040
Simulating trial. . .
espilon = 0.1153; alpha = 0.0040
Simulating trial. . .
espilon = 0.1153; alpha = 0.0040
Simulating trial. . .
espilon = 0.1153; alpha = 0.0040
Simulating trial. . .
espilon = 0.1153; alpha = 0.0040
Simulating trial. . .
espilon = 0.1153; alpha = 0.0040
Simulating trial. . .
espilon = 0.1153; alpha = 0.0040
Simulating trial. . .
espilon = 0.1153; alpha = 0.0040
Simulating trial. . .
espilon = 0.1153; alpha = 0.0040
Simulating trial. . .
espilon = 0.1153; alpha = 0.0040
Simulating trial. . .
espilon = 0.1153; alpha = 0.0040
Simulating trial. . .
espilon = 0.1153; alpha = 0.0040
Simulating trial. . .
espilon = 0.1153; alpha = 0.0040
Simulating trial. . .
espilon = 0.1153; alpha = 0.0040
Simulating trial. . .
espilon = 0.1153; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.26)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'right')
Agent followed the waypoint right. (rewarded 2.32)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', 'left')
Agent followed the waypoint right. (rewarded 1.43)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.59)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.36)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.02)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.16)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 1.22)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 0.88)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.31)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.02)
56% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 542
\-------------------------
Simulating trial. . .
espilon = 0.1149; alpha = 0.0040
Simulating trial. . .
espilon = 0.1149; alpha = 0.0040
Simulating trial. . .
espilon = 0.1149; alpha = 0.0040
Simulating trial. . .
espilon = 0.1149; alpha = 0.0040
Simulating trial. . .
espilon = 0.1149; alpha = 0.0040
Simulating trial. . .
espilon = 0.1149; alpha = 0.0040
Simulating trial. . .
espilon = 0.1149; alpha = 0.0040
Simulating trial. . .
espilon = 0.1149; alpha = 0.0040
Simulating trial. . .
espilon = 0.1149; alpha = 0.0040
Simulating trial. . .
espilon = 0.1149; alpha = 0.0040
Simulating trial. . .
espilon = 0.1149; alpha = 0.0040
Simulating trial. . .
espilon = 0.1149; alpha = 0.0040
Simulating trial. . .
espilon = 0.1149; alpha = 0.0040
Simulating trial. . .
espilon = 0.1149; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'right', 'left')
Agent drove forward instead of left. (rewarded 1.76)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.96)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'right')
Agent properly idled at a red light. (rewarded 2.60)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.55)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 1.80)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.89)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.64)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.00)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.35)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent followed the waypoint left. (rewarded 2.77)
50% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 543
\-------------------------
Simulating trial. . .
espilon = 0.1144; alpha = 0.0040
Simulating trial. . .
espilon = 0.1144; alpha = 0.0040
Simulating trial. . .
espilon = 0.1144; alpha = 0.0040
Simulating trial. . .
espilon = 0.1144; alpha = 0.0040
Simulating trial. . .
espilon = 0.1144; alpha = 0.0040
Simulating trial. . .
espilon = 0.1144; alpha = 0.0040
Simulating trial. . .
espilon = 0.1144; alpha = 0.0040
Simulating trial. . .
espilon = 0.1144; alpha = 0.0040
Simulating trial. . .
espilon = 0.1144; alpha = 0.0040
Simulating trial. . .
espilon = 0.1144; alpha = 0.0040
Simulating trial. . .
espilon = 0.1144; alpha = 0.0040
Simulating trial. . .
espilon = 0.1144; alpha = 0.0040
Simulating trial. . .
espilon = 0.1144; alpha = 0.0040
Simulating trial. . .
espilon = 0.1144; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 0.96)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.84)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent followed the waypoint right. (rewarded 2.55)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'left')
Agent drove forward instead of right. (rewarded 0.04)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.93)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.56)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove right instead of forward. (rewarded 1.54)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.56)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.83)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'forward')
Agent drove right instead of left. (rewarded 0.90)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.23)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 2.51)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent followed the waypoint right. (rewarded 1.57)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.75)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.32)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.38)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent drove right instead of forward. (rewarded 0.30)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 1.70)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.57)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.26)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 544
\-------------------------
Simulating trial. . .
espilon = 0.1139; alpha = 0.0040
Simulating trial. . .
espilon = 0.1139; alpha = 0.0040
Simulating trial. . .
espilon = 0.1139; alpha = 0.0040
Simulating trial. . .
espilon = 0.1139; alpha = 0.0040
Simulating trial. . .
espilon = 0.1139; alpha = 0.0040
Simulating trial. . .
espilon = 0.1139; alpha = 0.0040
Simulating trial. . .
espilon = 0.1139; alpha = 0.0040
Simulating trial. . .
espilon = 0.1139; alpha = 0.0040
Simulating trial. . .
espilon = 0.1139; alpha = 0.0040
Simulating trial. . .
espilon = 0.1139; alpha = 0.0040
Simulating trial. . .
espilon = 0.1139; alpha = 0.0040
Simulating trial. . .
espilon = 0.1139; alpha = 0.0040
Simulating trial. . .
espilon = 0.1139; alpha = 0.0040
Simulating trial. . .
espilon = 0.1139; alpha = 0.0040
Simulating trial. . .
espilon = 0.1139; alpha = 0.0040
Simulating trial. . .
espilon = 0.1139; alpha = 0.0040
Simulating trial. . .
espilon = 0.1139; alpha = 0.0040
Simulating trial. . .
espilon = 0.1139; alpha = 0.0040
Simulating trial. . .
espilon = 0.1139; alpha = 0.0040
Simulating trial. . .
espilon = 0.1139; alpha = 0.0040
Simulating trial. . .
espilon = 0.1139; alpha = 0.0040
Simulating trial. . .
espilon = 0.1139; alpha = 0.0040
Simulating trial. . .
espilon = 0.1139; alpha = 0.0040
Simulating trial. . .
espilon = 0.1139; alpha = 0.0040
Simulating trial. . .
espilon = 0.1139; alpha = 0.0040
Simulating trial. . .
espilon = 0.1139; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.02)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.56)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', 'left')
Agent drove right instead of left. (rewarded 1.19)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.39)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.62)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.24)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.29)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', 'forward')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.96)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', 'forward')
Agent properly idled at a red light. (rewarded 2.80)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.16)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.83)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 0.86)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.64)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.91)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.96)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 0.99)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 0.98)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 0.89)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.38)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.93)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 545
\-------------------------
Simulating trial. . .
espilon = 0.1135; alpha = 0.0040
Simulating trial. . .
espilon = 0.1135; alpha = 0.0040
Simulating trial. . .
espilon = 0.1135; alpha = 0.0040
Simulating trial. . .
espilon = 0.1135; alpha = 0.0040
Simulating trial. . .
espilon = 0.1135; alpha = 0.0040
Simulating trial. . .
espilon = 0.1135; alpha = 0.0040
Simulating trial. . .
espilon = 0.1135; alpha = 0.0040
Simulating trial. . .
espilon = 0.1135; alpha = 0.0040
Simulating trial. . .
espilon = 0.1135; alpha = 0.0040
Simulating trial. . .
espilon = 0.1135; alpha = 0.0040
Simulating trial. . .
espilon = 0.1135; alpha = 0.0040
Simulating trial. . .
espilon = 0.1135; alpha = 0.0040
Simulating trial. . .
espilon = 0.1135; alpha = 0.0040
Simulating trial. . .
espilon = 0.1135; alpha = 0.0040
Simulating trial. . .
espilon = 0.1135; alpha = 0.0040
Simulating trial. . .
espilon = 0.1135; alpha = 0.0040
Simulating trial. . .
espilon = 0.1135; alpha = 0.0040
Simulating trial. . .
espilon = 0.1135; alpha = 0.0040
Simulating trial. . .
espilon = 0.1135; alpha = 0.0040
Simulating trial. . .
espilon = 0.1135; alpha = 0.0040
Simulating trial. . .
espilon = 0.1135; alpha = 0.0040
Simulating trial. . .
espilon = 0.1135; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', 'left')
Agent followed the waypoint right. (rewarded 1.54)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', 'right')
Agent drove forward instead of right. (rewarded 1.43)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent followed the waypoint right. (rewarded 1.28)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.98)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.63)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'left', 'left')
Agent followed the waypoint forward. (rewarded 1.52)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.46)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.23)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.04)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.01)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.67)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.58)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 546
\-------------------------
Simulating trial. . .
espilon = 0.1130; alpha = 0.0040
Simulating trial. . .
espilon = 0.1130; alpha = 0.0040
Simulating trial. . .
espilon = 0.1130; alpha = 0.0040
Simulating trial. . .
espilon = 0.1130; alpha = 0.0040
Simulating trial. . .
espilon = 0.1130; alpha = 0.0040
Simulating trial. . .
espilon = 0.1130; alpha = 0.0040
Simulating trial. . .
espilon = 0.1130; alpha = 0.0040
Simulating trial. . .
espilon = 0.1130; alpha = 0.0040
Simulating trial. . .
espilon = 0.1130; alpha = 0.0040
Simulating trial. . .
espilon = 0.1130; alpha = 0.0040
Simulating trial. . .
espilon = 0.1130; alpha = 0.0040
Simulating trial. . .
espilon = 0.1130; alpha = 0.0040
Simulating trial. . .
espilon = 0.1130; alpha = 0.0040
Simulating trial. . .
espilon = 0.1130; alpha = 0.0040
Simulating trial. . .
espilon = 0.1130; alpha = 0.0040
Simulating trial. . .
espilon = 0.1130; alpha = 0.0040
Simulating trial. . .
espilon = 0.1130; alpha = 0.0040
Simulating trial. . .
espilon = 0.1130; alpha = 0.0040
Simulating trial. . .
espilon = 0.1130; alpha = 0.0040
Simulating trial. . .
espilon = 0.1130; alpha = 0.0040
Simulating trial. . .
espilon = 0.1130; alpha = 0.0040
Simulating trial. . .
espilon = 0.1130; alpha = 0.0040
Simulating trial. . .
espilon = 0.1130; alpha = 0.0040
Simulating trial. . .
espilon = 0.1130; alpha = 0.0040
Simulating trial. . .
espilon = 0.1130; alpha = 0.0040
Simulating trial. . .
espilon = 0.1130; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.17)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.78)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.09)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.65)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 1.58)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', 'left')
Agent properly idled at a red light. (rewarded 2.87)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.56)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.06)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.82)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 1.35)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.03)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.34)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.97)
35% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 547
\-------------------------
Simulating trial. . .
espilon = 0.1126; alpha = 0.0040
Simulating trial. . .
espilon = 0.1126; alpha = 0.0040
Simulating trial. . .
espilon = 0.1126; alpha = 0.0040
Simulating trial. . .
espilon = 0.1126; alpha = 0.0040
Simulating trial. . .
espilon = 0.1126; alpha = 0.0040
Simulating trial. . .
espilon = 0.1126; alpha = 0.0040
Simulating trial. . .
espilon = 0.1126; alpha = 0.0040
Simulating trial. . .
espilon = 0.1126; alpha = 0.0040
Simulating trial. . .
espilon = 0.1126; alpha = 0.0040
Simulating trial. . .
espilon = 0.1126; alpha = 0.0040
Simulating trial. . .
espilon = 0.1126; alpha = 0.0040
Simulating trial. . .
espilon = 0.1126; alpha = 0.0040
Simulating trial. . .
espilon = 0.1126; alpha = 0.0040
Simulating trial. . .
espilon = 0.1126; alpha = 0.0040
Simulating trial. . .
espilon = 0.1126; alpha = 0.0040
Simulating trial. . .
espilon = 0.1126; alpha = 0.0040
Simulating trial. . .
espilon = 0.1126; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', 'left')
Agent followed the waypoint forward. (rewarded 2.96)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.21)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.20)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 0.29)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.00)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.01)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'left')
Agent drove right instead of left. (rewarded 1.24)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.70)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.01)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.16)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.32)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.97)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.92)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.32)
44% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 548
\-------------------------
Simulating trial. . .
espilon = 0.1121; alpha = 0.0040
Simulating trial. . .
espilon = 0.1121; alpha = 0.0040
Simulating trial. . .
espilon = 0.1121; alpha = 0.0040
Simulating trial. . .
espilon = 0.1121; alpha = 0.0040
Simulating trial. . .
espilon = 0.1121; alpha = 0.0040
Simulating trial. . .
espilon = 0.1121; alpha = 0.0040
Simulating trial. . .
espilon = 0.1121; alpha = 0.0040
Simulating trial. . .
espilon = 0.1121; alpha = 0.0040
Simulating trial. . .
espilon = 0.1121; alpha = 0.0040
Simulating trial. . .
espilon = 0.1121; alpha = 0.0040
Simulating trial. . .
espilon = 0.1121; alpha = 0.0040
Simulating trial. . .
espilon = 0.1121; alpha = 0.0040
Simulating trial. . .
espilon = 0.1121; alpha = 0.0040
Simulating trial. . .
espilon = 0.1121; alpha = 0.0040
Simulating trial. . .
espilon = 0.1121; alpha = 0.0040
Simulating trial. . .
espilon = 0.1121; alpha = 0.0040
Simulating trial. . .
espilon = 0.1121; alpha = 0.0040
Simulating trial. . .
espilon = 0.1121; alpha = 0.0040
Simulating trial. . .
espilon = 0.1121; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 0.11)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 2.99)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.36)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.55)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.05)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.86)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.06)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.53)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.12)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.76)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent drove forward instead of right. (rewarded 0.37)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.10)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.93)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.28)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.59)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'forward')
Agent drove right instead of left. (rewarded 0.32)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'forward')
Agent followed the waypoint right. (rewarded 2.16)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.48)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', 'right')
Agent followed the waypoint right. (rewarded 2.66)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.14)
33% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 549
\-------------------------
Simulating trial. . .
espilon = 0.1117; alpha = 0.0040
Simulating trial. . .
espilon = 0.1117; alpha = 0.0040
Simulating trial. . .
espilon = 0.1117; alpha = 0.0040
Simulating trial. . .
espilon = 0.1117; alpha = 0.0040
Simulating trial. . .
espilon = 0.1117; alpha = 0.0040
Simulating trial. . .
espilon = 0.1117; alpha = 0.0040
Simulating trial. . .
espilon = 0.1117; alpha = 0.0040
Simulating trial. . .
espilon = 0.1117; alpha = 0.0040
Simulating trial. . .
espilon = 0.1117; alpha = 0.0040
Simulating trial. . .
espilon = 0.1117; alpha = 0.0040
Simulating trial. . .
espilon = 0.1117; alpha = 0.0040
Simulating trial. . .
espilon = 0.1117; alpha = 0.0040
Simulating trial. . .
espilon = 0.1117; alpha = 0.0040
Simulating trial. . .
espilon = 0.1117; alpha = 0.0040
Simulating trial. . .
espilon = 0.1117; alpha = 0.0040
Simulating trial. . .
espilon = 0.1117; alpha = 0.0040
Simulating trial. . .
espilon = 0.1117; alpha = 0.0040
Simulating trial. . .
espilon = 0.1117; alpha = 0.0040
Simulating trial. . .
espilon = 0.1117; alpha = 0.0040
Simulating trial. . .
espilon = 0.1117; alpha = 0.0040
Simulating trial. . .
espilon = 0.1117; alpha = 0.0040
Simulating trial. . .
espilon = 0.1117; alpha = 0.0040
Simulating trial. . .
espilon = 0.1117; alpha = 0.0040
Simulating trial. . .
espilon = 0.1117; alpha = 0.0040
Simulating trial. . .
espilon = 0.1117; alpha = 0.0040
Simulating trial. . .
espilon = 0.1117; alpha = 0.0040
Simulating trial. . .
espilon = 0.1117; alpha = 0.0040
Simulating trial. . .
espilon = 0.1117; alpha = 0.0040
Simulating trial. . .
espilon = 0.1117; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.04)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.25)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.95)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.89)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 1.87)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 1.45)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.15)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.20)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.25)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.98)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.56)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.14)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.56)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 1.48)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.72)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.47)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.74)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.34)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.80)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 0.63)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 0.97)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 0.50)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 0.50)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.34)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.27)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 550
\-------------------------
Simulating trial. . .
espilon = 0.1112; alpha = 0.0040
Simulating trial. . .
espilon = 0.1112; alpha = 0.0040
Simulating trial. . .
espilon = 0.1112; alpha = 0.0040
Simulating trial. . .
espilon = 0.1112; alpha = 0.0040
Simulating trial. . .
espilon = 0.1112; alpha = 0.0040
Simulating trial. . .
espilon = 0.1112; alpha = 0.0040
Simulating trial. . .
espilon = 0.1112; alpha = 0.0040
Simulating trial. . .
espilon = 0.1112; alpha = 0.0040
Simulating trial. . .
espilon = 0.1112; alpha = 0.0040
Simulating trial. . .
espilon = 0.1112; alpha = 0.0040
Simulating trial. . .
espilon = 0.1112; alpha = 0.0040
Simulating trial. . .
espilon = 0.1112; alpha = 0.0040
Simulating trial. . .
espilon = 0.1112; alpha = 0.0040
Simulating trial. . .
espilon = 0.1112; alpha = 0.0040
Simulating trial. . .
espilon = 0.1112; alpha = 0.0040
Simulating trial. . .
espilon = 0.1112; alpha = 0.0040
Simulating trial. . .
espilon = 0.1112; alpha = 0.0040
Simulating trial. . .
espilon = 0.1112; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', 'right')
Agent drove forward instead of right. (rewarded 1.65)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.39)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.60)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.54)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.77)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.38)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.78)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.56)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 0.91)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.77)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.03)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.57)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.16)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.17)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 0.57)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.37)
20% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 551
\-------------------------
Simulating trial. . .
espilon = 0.1108; alpha = 0.0040
Simulating trial. . .
espilon = 0.1108; alpha = 0.0040
Simulating trial. . .
espilon = 0.1108; alpha = 0.0040
Simulating trial. . .
espilon = 0.1108; alpha = 0.0040
Simulating trial. . .
espilon = 0.1108; alpha = 0.0040
Simulating trial. . .
espilon = 0.1108; alpha = 0.0040
Simulating trial. . .
espilon = 0.1108; alpha = 0.0040
Simulating trial. . .
espilon = 0.1108; alpha = 0.0040
Simulating trial. . .
espilon = 0.1108; alpha = 0.0040
Simulating trial. . .
espilon = 0.1108; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.53)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'right')
Agent properly idled at a red light. (rewarded 2.55)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.65)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.04)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.74)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.58)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 1.48)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'left')
Agent followed the waypoint right. (rewarded 1.55)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.94)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.82)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent followed the waypoint right. (rewarded 2.28)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.91)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.67)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.54)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 1.33)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.44)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove left instead of forward. (rewarded 0.97)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.81)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', 'left')
Agent followed the waypoint right. (rewarded 2.42)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.96)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.00)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.96)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 0.62)
8% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 552
\-------------------------
Simulating trial. . .
espilon = 0.1104; alpha = 0.0040
Simulating trial. . .
espilon = 0.1104; alpha = 0.0040
Simulating trial. . .
espilon = 0.1104; alpha = 0.0040
Simulating trial. . .
espilon = 0.1104; alpha = 0.0040
Simulating trial. . .
espilon = 0.1104; alpha = 0.0040
Simulating trial. . .
espilon = 0.1104; alpha = 0.0040
Simulating trial. . .
espilon = 0.1104; alpha = 0.0040
Simulating trial. . .
espilon = 0.1104; alpha = 0.0040
Simulating trial. . .
espilon = 0.1104; alpha = 0.0040
Simulating trial. . .
espilon = 0.1104; alpha = 0.0040
Simulating trial. . .
espilon = 0.1104; alpha = 0.0040
Simulating trial. . .
espilon = 0.1104; alpha = 0.0040
Simulating trial. . .
espilon = 0.1104; alpha = 0.0040
Simulating trial. . .
espilon = 0.1104; alpha = 0.0040
Simulating trial. . .
espilon = 0.1104; alpha = 0.0040
Simulating trial. . .
espilon = 0.1104; alpha = 0.0040
Simulating trial. . .
espilon = 0.1104; alpha = 0.0040
Simulating trial. . .
espilon = 0.1104; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'forward', None)
Agent properly idled at a red light. (rewarded 2.94)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.73)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 1.96)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 0.11)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent drove right instead of left. (rewarded 0.15)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded -0.08)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 0.94)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.10)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.74)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.31)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.22)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.99)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 0.84)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent followed the waypoint forward. (rewarded 2.35)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'right')
Agent drove right instead of forward. (rewarded 0.45)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.18)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.99)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.49)
10% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 553
\-------------------------
Simulating trial. . .
espilon = 0.1099; alpha = 0.0040
Simulating trial. . .
espilon = 0.1099; alpha = 0.0040
Simulating trial. . .
espilon = 0.1099; alpha = 0.0040
Simulating trial. . .
espilon = 0.1099; alpha = 0.0040
Simulating trial. . .
espilon = 0.1099; alpha = 0.0040
Simulating trial. . .
espilon = 0.1099; alpha = 0.0040
Simulating trial. . .
espilon = 0.1099; alpha = 0.0040
Simulating trial. . .
espilon = 0.1099; alpha = 0.0040
Simulating trial. . .
espilon = 0.1099; alpha = 0.0040
Simulating trial. . .
espilon = 0.1099; alpha = 0.0040
Simulating trial. . .
espilon = 0.1099; alpha = 0.0040
Simulating trial. . .
espilon = 0.1099; alpha = 0.0040
Simulating trial. . .
espilon = 0.1099; alpha = 0.0040
Simulating trial. . .
espilon = 0.1099; alpha = 0.0040
Simulating trial. . .
espilon = 0.1099; alpha = 0.0040
Simulating trial. . .
espilon = 0.1099; alpha = 0.0040
Simulating trial. . .
espilon = 0.1099; alpha = 0.0040
Simulating trial. . .
espilon = 0.1099; alpha = 0.0040
Simulating trial. . .
espilon = 0.1099; alpha = 0.0040
Simulating trial. . .
espilon = 0.1099; alpha = 0.0040
Simulating trial. . .
espilon = 0.1099; alpha = 0.0040
Simulating trial. . .
espilon = 0.1099; alpha = 0.0040
Simulating trial. . .
espilon = 0.1099; alpha = 0.0040
Simulating trial. . .
espilon = 0.1099; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent drove right instead of forward. (rewarded 1.69)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.00)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.94)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.91)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.57)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.77)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.94)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.79)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 554
\-------------------------
Simulating trial. . .
espilon = 0.1095; alpha = 0.0040
Simulating trial. . .
espilon = 0.1095; alpha = 0.0040
Simulating trial. . .
espilon = 0.1095; alpha = 0.0040
Simulating trial. . .
espilon = 0.1095; alpha = 0.0040
Simulating trial. . .
espilon = 0.1095; alpha = 0.0040
Simulating trial. . .
espilon = 0.1095; alpha = 0.0040
Simulating trial. . .
espilon = 0.1095; alpha = 0.0040
Simulating trial. . .
espilon = 0.1095; alpha = 0.0040
Simulating trial. . .
espilon = 0.1095; alpha = 0.0040
Simulating trial. . .
espilon = 0.1095; alpha = 0.0040
Simulating trial. . .
espilon = 0.1095; alpha = 0.0040
Simulating trial. . .
espilon = 0.1095; alpha = 0.0040
Simulating trial. . .
espilon = 0.1095; alpha = 0.0040
Simulating trial. . .
espilon = 0.1095; alpha = 0.0040
Simulating trial. . .
espilon = 0.1095; alpha = 0.0040
Simulating trial. . .
espilon = 0.1095; alpha = 0.0040
Simulating trial. . .
espilon = 0.1095; alpha = 0.0040
Simulating trial. . .
espilon = 0.1095; alpha = 0.0040
Simulating trial. . .
espilon = 0.1095; alpha = 0.0040
Simulating trial. . .
espilon = 0.1095; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.67)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 1.73)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'right')
Agent properly idled at a red light. (rewarded 2.84)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.27)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.45)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.10)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.82)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.84)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.38)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.89)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.66)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.02)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.33)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.12)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 1.40)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.64)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -4.82)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, None)
Agent followed the waypoint right. (rewarded 2.37)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'left', None)
Agent followed the waypoint right. (rewarded 1.51)
37% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 555
\-------------------------
Simulating trial. . .
espilon = 0.1090; alpha = 0.0040
Simulating trial. . .
espilon = 0.1090; alpha = 0.0040
Simulating trial. . .
espilon = 0.1090; alpha = 0.0040
Simulating trial. . .
espilon = 0.1090; alpha = 0.0040
Simulating trial. . .
espilon = 0.1090; alpha = 0.0040
Simulating trial. . .
espilon = 0.1090; alpha = 0.0040
Simulating trial. . .
espilon = 0.1090; alpha = 0.0040
Simulating trial. . .
espilon = 0.1090; alpha = 0.0040
Simulating trial. . .
espilon = 0.1090; alpha = 0.0040
Simulating trial. . .
espilon = 0.1090; alpha = 0.0040
Simulating trial. . .
espilon = 0.1090; alpha = 0.0040
Simulating trial. . .
espilon = 0.1090; alpha = 0.0040
Simulating trial. . .
espilon = 0.1090; alpha = 0.0040
Simulating trial. . .
espilon = 0.1090; alpha = 0.0040
Simulating trial. . .
espilon = 0.1090; alpha = 0.0040
Simulating trial. . .
espilon = 0.1090; alpha = 0.0040
Simulating trial. . .
espilon = 0.1090; alpha = 0.0040
Simulating trial. . .
espilon = 0.1090; alpha = 0.0040
Simulating trial. . .
espilon = 0.1090; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.19)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.80)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.36)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.47)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent drove right instead of forward. (rewarded 1.51)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.97)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.89)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'right', 'right')
Agent drove forward instead of right. (rewarded 1.05)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent followed the waypoint right. (rewarded 2.14)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, 'forward')
Agent properly idled at a red light. (rewarded 2.39)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 2.00)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded 0.66)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 2.10)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.83)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.01)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 2.41)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.27)
32% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 556
\-------------------------
Simulating trial. . .
espilon = 0.1086; alpha = 0.0040
Simulating trial. . .
espilon = 0.1086; alpha = 0.0040
Simulating trial. . .
espilon = 0.1086; alpha = 0.0040
Simulating trial. . .
espilon = 0.1086; alpha = 0.0040
Simulating trial. . .
espilon = 0.1086; alpha = 0.0040
Simulating trial. . .
espilon = 0.1086; alpha = 0.0040
Simulating trial. . .
espilon = 0.1086; alpha = 0.0040
Simulating trial. . .
espilon = 0.1086; alpha = 0.0040
Simulating trial. . .
espilon = 0.1086; alpha = 0.0040
Simulating trial. . .
espilon = 0.1086; alpha = 0.0040
Simulating trial. . .
espilon = 0.1086; alpha = 0.0040
Simulating trial. . .
espilon = 0.1086; alpha = 0.0040
Simulating trial. . .
espilon = 0.1086; alpha = 0.0040
Simulating trial. . .
espilon = 0.1086; alpha = 0.0040
Simulating trial. . .
espilon = 0.1086; alpha = 0.0040
Simulating trial. . .
espilon = 0.1086; alpha = 0.0040
Simulating trial. . .
espilon = 0.1086; alpha = 0.0040
Simulating trial. . .
espilon = 0.1086; alpha = 0.0040
Simulating trial. . .
espilon = 0.1086; alpha = 0.0040
Simulating trial. . .
espilon = 0.1086; alpha = 0.0040
Simulating trial. . .
espilon = 0.1086; alpha = 0.0040
Simulating trial. . .
espilon = 0.1086; alpha = 0.0040
Simulating trial. . .
espilon = 0.1086; alpha = 0.0040
Simulating trial. . .
espilon = 0.1086; alpha = 0.0040
Simulating trial. . .
espilon = 0.1086; alpha = 0.0040
Simulating trial. . .
espilon = 0.1086; alpha = 0.0040
Simulating trial. . .
espilon = 0.1086; alpha = 0.0040
Simulating trial. . .
espilon = 0.1086; alpha = 0.0040
Simulating trial. . .
espilon = 0.1086; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.26)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.34)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.77)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent drove right instead of forward. (rewarded 1.70)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', 'forward')
Agent drove forward instead of left. (rewarded 1.21)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent attempted driving left through a red light. (rewarded -10.46)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.78)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.94)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', 'forward')
Agent properly idled at a red light. (rewarded 1.34)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.40)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.30)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.09)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.66)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.88)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.87)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.24)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.84)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.97)
10% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 557
\-------------------------
Simulating trial. . .
espilon = 0.1082; alpha = 0.0040
Simulating trial. . .
espilon = 0.1082; alpha = 0.0040
Simulating trial. . .
espilon = 0.1082; alpha = 0.0040
Simulating trial. . .
espilon = 0.1082; alpha = 0.0040
Simulating trial. . .
espilon = 0.1082; alpha = 0.0040
Simulating trial. . .
espilon = 0.1082; alpha = 0.0040
Simulating trial. . .
espilon = 0.1082; alpha = 0.0040
Simulating trial. . .
espilon = 0.1082; alpha = 0.0040
Simulating trial. . .
espilon = 0.1082; alpha = 0.0040
Simulating trial. . .
espilon = 0.1082; alpha = 0.0040
Simulating trial. . .
espilon = 0.1082; alpha = 0.0040
Simulating trial. . .
espilon = 0.1082; alpha = 0.0040
Simulating trial. . .
espilon = 0.1082; alpha = 0.0040
Simulating trial. . .
espilon = 0.1082; alpha = 0.0040
Simulating trial. . .
espilon = 0.1082; alpha = 0.0040
Simulating trial. . .
espilon = 0.1082; alpha = 0.0040
Simulating trial. . .
espilon = 0.1082; alpha = 0.0040
Simulating trial. . .
espilon = 0.1082; alpha = 0.0040
Simulating trial. . .
espilon = 0.1082; alpha = 0.0040
Simulating trial. . .
espilon = 0.1082; alpha = 0.0040
Simulating trial. . .
espilon = 0.1082; alpha = 0.0040
Simulating trial. . .
espilon = 0.1082; alpha = 0.0040
Simulating trial. . .
espilon = 0.1082; alpha = 0.0040
Simulating trial. . .
espilon = 0.1082; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', 'left')
Agent properly idled at a red light. (rewarded 2.90)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.96)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.03)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.03)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.48)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.42)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.70)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.07)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.55)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 2.52)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.32)
56% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 558
\-------------------------
Simulating trial. . .
espilon = 0.1077; alpha = 0.0040
Simulating trial. . .
espilon = 0.1077; alpha = 0.0040
Simulating trial. . .
espilon = 0.1077; alpha = 0.0040
Simulating trial. . .
espilon = 0.1077; alpha = 0.0040
Simulating trial. . .
espilon = 0.1077; alpha = 0.0040
Simulating trial. . .
espilon = 0.1077; alpha = 0.0040
Simulating trial. . .
espilon = 0.1077; alpha = 0.0040
Simulating trial. . .
espilon = 0.1077; alpha = 0.0040
Simulating trial. . .
espilon = 0.1077; alpha = 0.0040
Simulating trial. . .
espilon = 0.1077; alpha = 0.0040
Simulating trial. . .
espilon = 0.1077; alpha = 0.0040
Simulating trial. . .
espilon = 0.1077; alpha = 0.0040
Simulating trial. . .
espilon = 0.1077; alpha = 0.0040
Simulating trial. . .
espilon = 0.1077; alpha = 0.0040
Simulating trial. . .
espilon = 0.1077; alpha = 0.0040
Simulating trial. . .
espilon = 0.1077; alpha = 0.0040
Simulating trial. . .
espilon = 0.1077; alpha = 0.0040
Simulating trial. . .
espilon = 0.1077; alpha = 0.0040
Simulating trial. . .
espilon = 0.1077; alpha = 0.0040
Simulating trial. . .
espilon = 0.1077; alpha = 0.0040
Simulating trial. . .
espilon = 0.1077; alpha = 0.0040
Simulating trial. . .
espilon = 0.1077; alpha = 0.0040
Simulating trial. . .
espilon = 0.1077; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 0.32)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.51)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.20)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.94)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.26)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 1.28)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.16)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 2.18)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.84)
64% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 559
\-------------------------
Simulating trial. . .
espilon = 0.1073; alpha = 0.0040
Simulating trial. . .
espilon = 0.1073; alpha = 0.0040
Simulating trial. . .
espilon = 0.1073; alpha = 0.0040
Simulating trial. . .
espilon = 0.1073; alpha = 0.0040
Simulating trial. . .
espilon = 0.1073; alpha = 0.0040
Simulating trial. . .
espilon = 0.1073; alpha = 0.0040
Simulating trial. . .
espilon = 0.1073; alpha = 0.0040
Simulating trial. . .
espilon = 0.1073; alpha = 0.0040
Simulating trial. . .
espilon = 0.1073; alpha = 0.0040
Simulating trial. . .
espilon = 0.1073; alpha = 0.0040
Simulating trial. . .
espilon = 0.1073; alpha = 0.0040
Simulating trial. . .
espilon = 0.1073; alpha = 0.0040
Simulating trial. . .
espilon = 0.1073; alpha = 0.0040
Simulating trial. . .
espilon = 0.1073; alpha = 0.0040
Simulating trial. . .
espilon = 0.1073; alpha = 0.0040
Simulating trial. . .
espilon = 0.1073; alpha = 0.0040
Simulating trial. . .
espilon = 0.1073; alpha = 0.0040
Simulating trial. . .
espilon = 0.1073; alpha = 0.0040
Simulating trial. . .
espilon = 0.1073; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.17)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.46)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'right')
Agent followed the waypoint forward. (rewarded 2.88)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.78)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.92)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove left instead of forward. (rewarded 1.90)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.38)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.47)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'right')
Agent followed the waypoint right. (rewarded 1.60)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.01)
71% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 560
\-------------------------
Simulating trial. . .
espilon = 0.1069; alpha = 0.0040
Simulating trial. . .
espilon = 0.1069; alpha = 0.0040
Simulating trial. . .
espilon = 0.1069; alpha = 0.0040
Simulating trial. . .
espilon = 0.1069; alpha = 0.0040
Simulating trial. . .
espilon = 0.1069; alpha = 0.0040
Simulating trial. . .
espilon = 0.1069; alpha = 0.0040
Simulating trial. . .
espilon = 0.1069; alpha = 0.0040
Simulating trial. . .
espilon = 0.1069; alpha = 0.0040
Simulating trial. . .
espilon = 0.1069; alpha = 0.0040
Simulating trial. . .
espilon = 0.1069; alpha = 0.0040
Simulating trial. . .
espilon = 0.1069; alpha = 0.0040
Simulating trial. . .
espilon = 0.1069; alpha = 0.0040
Simulating trial. . .
espilon = 0.1069; alpha = 0.0040
Simulating trial. . .
espilon = 0.1069; alpha = 0.0040
Simulating trial. . .
espilon = 0.1069; alpha = 0.0040
Simulating trial. . .
espilon = 0.1069; alpha = 0.0040
Simulating trial. . .
espilon = 0.1069; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 2.06)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.79)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 0.99)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.37)
80% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 561
\-------------------------
Simulating trial. . .
espilon = 0.1065; alpha = 0.0040
Simulating trial. . .
espilon = 0.1065; alpha = 0.0040
Simulating trial. . .
espilon = 0.1065; alpha = 0.0040
Simulating trial. . .
espilon = 0.1065; alpha = 0.0040
Simulating trial. . .
espilon = 0.1065; alpha = 0.0040
Simulating trial. . .
espilon = 0.1065; alpha = 0.0040
Simulating trial. . .
espilon = 0.1065; alpha = 0.0040
Simulating trial. . .
espilon = 0.1065; alpha = 0.0040
Simulating trial. . .
espilon = 0.1065; alpha = 0.0040
Simulating trial. . .
espilon = 0.1065; alpha = 0.0040
Simulating trial. . .
espilon = 0.1065; alpha = 0.0040
Simulating trial. . .
espilon = 0.1065; alpha = 0.0040
Simulating trial. . .
espilon = 0.1065; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.53)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.85)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.40)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.87)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.90)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent followed the waypoint left. (rewarded 1.24)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.44)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.09)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.78)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.10)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.96)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.59)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 562
\-------------------------
Simulating trial. . .
espilon = 0.1060; alpha = 0.0040
Simulating trial. . .
espilon = 0.1060; alpha = 0.0040
Simulating trial. . .
espilon = 0.1060; alpha = 0.0040
Simulating trial. . .
espilon = 0.1060; alpha = 0.0040
Simulating trial. . .
espilon = 0.1060; alpha = 0.0040
Simulating trial. . .
espilon = 0.1060; alpha = 0.0040
Simulating trial. . .
espilon = 0.1060; alpha = 0.0040
Simulating trial. . .
espilon = 0.1060; alpha = 0.0040
Simulating trial. . .
espilon = 0.1060; alpha = 0.0040
Simulating trial. . .
espilon = 0.1060; alpha = 0.0040
Simulating trial. . .
espilon = 0.1060; alpha = 0.0040
Simulating trial. . .
espilon = 0.1060; alpha = 0.0040
Simulating trial. . .
espilon = 0.1060; alpha = 0.0040
Simulating trial. . .
espilon = 0.1060; alpha = 0.0040
Simulating trial. . .
espilon = 0.1060; alpha = 0.0040
Simulating trial. . .
espilon = 0.1060; alpha = 0.0040
Simulating trial. . .
espilon = 0.1060; alpha = 0.0040
Simulating trial. . .
espilon = 0.1060; alpha = 0.0040
Simulating trial. . .
espilon = 0.1060; alpha = 0.0040
Simulating trial. . .
espilon = 0.1060; alpha = 0.0040
Simulating trial. . .
espilon = 0.1060; alpha = 0.0040
Simulating trial. . .
espilon = 0.1060; alpha = 0.0040
Simulating trial. . .
espilon = 0.1060; alpha = 0.0040
Simulating trial. . .
espilon = 0.1060; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'right', 'right')
Agent properly idled at a red light. (rewarded 2.93)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', 'right')
Agent properly idled at a red light. (rewarded 1.94)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.36)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.45)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 1.41)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.43)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.18)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent followed the waypoint forward. (rewarded 1.32)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 0.92)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.80)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.00)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.68)
52% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 563
\-------------------------
Simulating trial. . .
espilon = 0.1056; alpha = 0.0040
Simulating trial. . .
espilon = 0.1056; alpha = 0.0040
Simulating trial. . .
espilon = 0.1056; alpha = 0.0040
Simulating trial. . .
espilon = 0.1056; alpha = 0.0040
Simulating trial. . .
espilon = 0.1056; alpha = 0.0040
Simulating trial. . .
espilon = 0.1056; alpha = 0.0040
Simulating trial. . .
espilon = 0.1056; alpha = 0.0040
Simulating trial. . .
espilon = 0.1056; alpha = 0.0040
Simulating trial. . .
espilon = 0.1056; alpha = 0.0040
Simulating trial. . .
espilon = 0.1056; alpha = 0.0040
Simulating trial. . .
espilon = 0.1056; alpha = 0.0040
Simulating trial. . .
espilon = 0.1056; alpha = 0.0040
Simulating trial. . .
espilon = 0.1056; alpha = 0.0040
Simulating trial. . .
espilon = 0.1056; alpha = 0.0040
Simulating trial. . .
espilon = 0.1056; alpha = 0.0040
Simulating trial. . .
espilon = 0.1056; alpha = 0.0040
Simulating trial. . .
espilon = 0.1056; alpha = 0.0040
Simulating trial. . .
espilon = 0.1056; alpha = 0.0040
Simulating trial. . .
espilon = 0.1056; alpha = 0.0040
Simulating trial. . .
espilon = 0.1056; alpha = 0.0040
Simulating trial. . .
espilon = 0.1056; alpha = 0.0040
Simulating trial. . .
espilon = 0.1056; alpha = 0.0040
Simulating trial. . .
espilon = 0.1056; alpha = 0.0040
Simulating trial. . .
espilon = 0.1056; alpha = 0.0040
Simulating trial. . .
espilon = 0.1056; alpha = 0.0040
Simulating trial. . .
espilon = 0.1056; alpha = 0.0040
Simulating trial. . .
espilon = 0.1056; alpha = 0.0040
Simulating trial. . .
espilon = 0.1056; alpha = 0.0040
Simulating trial. . .
espilon = 0.1056; alpha = 0.0040
Simulating trial. . .
espilon = 0.1056; alpha = 0.0040
Simulating trial. . .
espilon = 0.1056; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 1.90)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.93)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.96)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.26)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 1.88)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.69)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.15)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -5.44)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.50)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.53)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 564
\-------------------------
Simulating trial. . .
espilon = 0.1052; alpha = 0.0040
Simulating trial. . .
espilon = 0.1052; alpha = 0.0040
Simulating trial. . .
espilon = 0.1052; alpha = 0.0040
Simulating trial. . .
espilon = 0.1052; alpha = 0.0040
Simulating trial. . .
espilon = 0.1052; alpha = 0.0040
Simulating trial. . .
espilon = 0.1052; alpha = 0.0040
Simulating trial. . .
espilon = 0.1052; alpha = 0.0040
Simulating trial. . .
espilon = 0.1052; alpha = 0.0040
Simulating trial. . .
espilon = 0.1052; alpha = 0.0040
Simulating trial. . .
espilon = 0.1052; alpha = 0.0040
Simulating trial. . .
espilon = 0.1052; alpha = 0.0040
Simulating trial. . .
espilon = 0.1052; alpha = 0.0040
Simulating trial. . .
espilon = 0.1052; alpha = 0.0040
Simulating trial. . .
espilon = 0.1052; alpha = 0.0040
Simulating trial. . .
espilon = 0.1052; alpha = 0.0040
Simulating trial. . .
espilon = 0.1052; alpha = 0.0040
Simulating trial. . .
espilon = 0.1052; alpha = 0.0040
Simulating trial. . .
espilon = 0.1052; alpha = 0.0040
Simulating trial. . .
espilon = 0.1052; alpha = 0.0040
Simulating trial. . .
espilon = 0.1052; alpha = 0.0040
Simulating trial. . .
espilon = 0.1052; alpha = 0.0040
Simulating trial. . .
espilon = 0.1052; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.08)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 2.85)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.17)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.98)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.76)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.24)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 2.07)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.41)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.95)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.83)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.42)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.06)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 565
\-------------------------
Simulating trial. . .
espilon = 0.1048; alpha = 0.0040
Simulating trial. . .
espilon = 0.1048; alpha = 0.0040
Simulating trial. . .
espilon = 0.1048; alpha = 0.0040
Simulating trial. . .
espilon = 0.1048; alpha = 0.0040
Simulating trial. . .
espilon = 0.1048; alpha = 0.0040
Simulating trial. . .
espilon = 0.1048; alpha = 0.0040
Simulating trial. . .
espilon = 0.1048; alpha = 0.0040
Simulating trial. . .
espilon = 0.1048; alpha = 0.0040
Simulating trial. . .
espilon = 0.1048; alpha = 0.0040
Simulating trial. . .
espilon = 0.1048; alpha = 0.0040
Simulating trial. . .
espilon = 0.1048; alpha = 0.0040
Simulating trial. . .
espilon = 0.1048; alpha = 0.0040
Simulating trial. . .
espilon = 0.1048; alpha = 0.0040
Simulating trial. . .
espilon = 0.1048; alpha = 0.0040
Simulating trial. . .
espilon = 0.1048; alpha = 0.0040
Simulating trial. . .
espilon = 0.1048; alpha = 0.0040
Simulating trial. . .
espilon = 0.1048; alpha = 0.0040
Simulating trial. . .
espilon = 0.1048; alpha = 0.0040
Simulating trial. . .
espilon = 0.1048; alpha = 0.0040
Simulating trial. . .
espilon = 0.1048; alpha = 0.0040
Simulating trial. . .
espilon = 0.1048; alpha = 0.0040
Simulating trial. . .
espilon = 0.1048; alpha = 0.0040
Simulating trial. . .
espilon = 0.1048; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.67)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', 'left')
Agent followed the waypoint forward. (rewarded 2.16)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.64)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.00)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 2.77)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.84)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.74)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.80)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.86)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.52)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.31)
56% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 566
\-------------------------
Simulating trial. . .
espilon = 0.1044; alpha = 0.0040
Simulating trial. . .
espilon = 0.1044; alpha = 0.0040
Simulating trial. . .
espilon = 0.1044; alpha = 0.0040
Simulating trial. . .
espilon = 0.1044; alpha = 0.0040
Simulating trial. . .
espilon = 0.1044; alpha = 0.0040
Simulating trial. . .
espilon = 0.1044; alpha = 0.0040
Simulating trial. . .
espilon = 0.1044; alpha = 0.0040
Simulating trial. . .
espilon = 0.1044; alpha = 0.0040
Simulating trial. . .
espilon = 0.1044; alpha = 0.0040
Simulating trial. . .
espilon = 0.1044; alpha = 0.0040
Simulating trial. . .
espilon = 0.1044; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', 'right')
Agent drove right instead of left. (rewarded 1.56)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent drove forward instead of right. (rewarded 0.03)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.88)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', None)
Agent attempted driving forward through a red light. (rewarded -9.35)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.92)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.30)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.74)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.67)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.42)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.78)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.66)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.31)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.63)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.36)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.86)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.61)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.96)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 1.00)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.31)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.12)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 567
\-------------------------
Simulating trial. . .
espilon = 0.1039; alpha = 0.0040
Simulating trial. . .
espilon = 0.1039; alpha = 0.0040
Simulating trial. . .
espilon = 0.1039; alpha = 0.0040
Simulating trial. . .
espilon = 0.1039; alpha = 0.0040
Simulating trial. . .
espilon = 0.1039; alpha = 0.0040
Simulating trial. . .
espilon = 0.1039; alpha = 0.0040
Simulating trial. . .
espilon = 0.1039; alpha = 0.0040
Simulating trial. . .
espilon = 0.1039; alpha = 0.0040
Simulating trial. . .
espilon = 0.1039; alpha = 0.0040
Simulating trial. . .
espilon = 0.1039; alpha = 0.0040
Simulating trial. . .
espilon = 0.1039; alpha = 0.0040
Simulating trial. . .
espilon = 0.1039; alpha = 0.0040
Simulating trial. . .
espilon = 0.1039; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.08)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.16)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.60)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.56)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 0.65)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.22)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'forward')
Agent followed the waypoint right. (rewarded 1.88)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 2.29)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 0.93)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent drove right instead of forward. (rewarded 0.15)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.07)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 2.19)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.65)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.69)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', 'left')
Agent drove right instead of forward. (rewarded 0.64)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'left')
Agent followed the waypoint left. (rewarded 1.90)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 1.48)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.03)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.26)
24% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 568
\-------------------------
Simulating trial. . .
espilon = 0.1035; alpha = 0.0040
Simulating trial. . .
espilon = 0.1035; alpha = 0.0040
Simulating trial. . .
espilon = 0.1035; alpha = 0.0040
Simulating trial. . .
espilon = 0.1035; alpha = 0.0040
Simulating trial. . .
espilon = 0.1035; alpha = 0.0040
Simulating trial. . .
espilon = 0.1035; alpha = 0.0040
Simulating trial. . .
espilon = 0.1035; alpha = 0.0040
Simulating trial. . .
espilon = 0.1035; alpha = 0.0040
Simulating trial. . .
espilon = 0.1035; alpha = 0.0040
Simulating trial. . .
espilon = 0.1035; alpha = 0.0040
Simulating trial. . .
espilon = 0.1035; alpha = 0.0040
Simulating trial. . .
espilon = 0.1035; alpha = 0.0040
Simulating trial. . .
espilon = 0.1035; alpha = 0.0040
Simulating trial. . .
espilon = 0.1035; alpha = 0.0040
Simulating trial. . .
espilon = 0.1035; alpha = 0.0040
Simulating trial. . .
espilon = 0.1035; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.06)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 0.75)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.65)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.61)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 1.03)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.10)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.17)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.88)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', 'left')
Agent followed the waypoint left. (rewarded 2.54)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent followed the waypoint right. (rewarded 2.79)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 569
\-------------------------
Simulating trial. . .
espilon = 0.1031; alpha = 0.0040
Simulating trial. . .
espilon = 0.1031; alpha = 0.0040
Simulating trial. . .
espilon = 0.1031; alpha = 0.0040
Simulating trial. . .
espilon = 0.1031; alpha = 0.0040
Simulating trial. . .
espilon = 0.1031; alpha = 0.0040
Simulating trial. . .
espilon = 0.1031; alpha = 0.0040
Simulating trial. . .
espilon = 0.1031; alpha = 0.0040
Simulating trial. . .
espilon = 0.1031; alpha = 0.0040
Simulating trial. . .
espilon = 0.1031; alpha = 0.0040
Simulating trial. . .
espilon = 0.1031; alpha = 0.0040
Simulating trial. . .
espilon = 0.1031; alpha = 0.0040
Simulating trial. . .
espilon = 0.1031; alpha = 0.0040
Simulating trial. . .
espilon = 0.1031; alpha = 0.0040
Simulating trial. . .
espilon = 0.1031; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'right', None)
Agent attempted driving forward through a red light. (rewarded -9.08)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.79)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 2.89)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.04)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.86)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.19)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.78)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.22)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.93)
64% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 570
\-------------------------
Simulating trial. . .
espilon = 0.1027; alpha = 0.0040
Simulating trial. . .
espilon = 0.1027; alpha = 0.0040
Simulating trial. . .
espilon = 0.1027; alpha = 0.0040
Simulating trial. . .
espilon = 0.1027; alpha = 0.0040
Simulating trial. . .
espilon = 0.1027; alpha = 0.0040
Simulating trial. . .
espilon = 0.1027; alpha = 0.0040
Simulating trial. . .
espilon = 0.1027; alpha = 0.0040
Simulating trial. . .
espilon = 0.1027; alpha = 0.0040
Simulating trial. . .
espilon = 0.1027; alpha = 0.0040
Simulating trial. . .
espilon = 0.1027; alpha = 0.0040
Simulating trial. . .
espilon = 0.1027; alpha = 0.0040
Simulating trial. . .
espilon = 0.1027; alpha = 0.0040
Simulating trial. . .
espilon = 0.1027; alpha = 0.0040
Simulating trial. . .
espilon = 0.1027; alpha = 0.0040
Simulating trial. . .
espilon = 0.1027; alpha = 0.0040
Simulating trial. . .
espilon = 0.1027; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.85)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', 'forward')
Agent followed the waypoint right. (rewarded 2.96)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.74)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 1.61)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.41)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.47)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.77)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.11)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', 'left')
Agent drove right instead of left. (rewarded 0.40)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.89)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.90)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.60)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'forward', 'left')
Agent followed the waypoint right. (rewarded 1.28)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.78)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.64)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 0.69)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.61)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'left', None)
Agent drove forward instead of left. (rewarded 0.27)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.80)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent followed the waypoint left. (rewarded 0.57)
20% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 571
\-------------------------
Simulating trial. . .
espilon = 0.1023; alpha = 0.0040
Simulating trial. . .
espilon = 0.1023; alpha = 0.0040
Simulating trial. . .
espilon = 0.1023; alpha = 0.0040
Simulating trial. . .
espilon = 0.1023; alpha = 0.0040
Simulating trial. . .
espilon = 0.1023; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.85)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.23)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent followed the waypoint right. (rewarded 2.62)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.76)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 1.10)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.07)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.77)
65% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 572
\-------------------------
Simulating trial. . .
espilon = 0.1019; alpha = 0.0040
Simulating trial. . .
espilon = 0.1019; alpha = 0.0040
Simulating trial. . .
espilon = 0.1019; alpha = 0.0040
Simulating trial. . .
espilon = 0.1019; alpha = 0.0040
Simulating trial. . .
espilon = 0.1019; alpha = 0.0040
Simulating trial. . .
espilon = 0.1019; alpha = 0.0040
Simulating trial. . .
espilon = 0.1019; alpha = 0.0040
Simulating trial. . .
espilon = 0.1019; alpha = 0.0040
Simulating trial. . .
espilon = 0.1019; alpha = 0.0040
Simulating trial. . .
espilon = 0.1019; alpha = 0.0040
Simulating trial. . .
espilon = 0.1019; alpha = 0.0040
Simulating trial. . .
espilon = 0.1019; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent followed the waypoint left. (rewarded 2.18)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'forward', 'right')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.34)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.96)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.29)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.68)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.67)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'left')
Agent drove right instead of left. (rewarded 1.40)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'right', 'right')
Agent attempted driving forward through a red light. (rewarded -10.00)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.54)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.06)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', 'forward')
Agent followed the waypoint right. (rewarded 1.61)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent drove right instead of forward. (rewarded 0.88)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent drove right instead of left. (rewarded 0.58)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.17)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.75)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.28)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.15)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.39)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.59)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'left')
Agent drove forward instead of right. (rewarded -0.07)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 573
\-------------------------
Simulating trial. . .
espilon = 0.1015; alpha = 0.0040
Simulating trial. . .
espilon = 0.1015; alpha = 0.0040
Simulating trial. . .
espilon = 0.1015; alpha = 0.0040
Simulating trial. . .
espilon = 0.1015; alpha = 0.0040
Simulating trial. . .
espilon = 0.1015; alpha = 0.0040
Simulating trial. . .
espilon = 0.1015; alpha = 0.0040
Simulating trial. . .
espilon = 0.1015; alpha = 0.0040
Simulating trial. . .
espilon = 0.1015; alpha = 0.0040
Simulating trial. . .
espilon = 0.1015; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.78)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.20)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.20)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.81)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.20)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.75)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.67)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.36)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 1.17)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent followed the waypoint right. (rewarded 2.25)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.20)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.37)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.78)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.20)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove forward instead of right. (rewarded 1.19)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.28)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.09)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'right')
Agent drove right instead of left. (rewarded 0.54)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.14)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.69)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.87)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.93)
12% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 574
\-------------------------
Simulating trial. . .
espilon = 0.1011; alpha = 0.0040
Simulating trial. . .
espilon = 0.1011; alpha = 0.0040
Simulating trial. . .
espilon = 0.1011; alpha = 0.0040
Simulating trial. . .
espilon = 0.1011; alpha = 0.0040
Simulating trial. . .
espilon = 0.1011; alpha = 0.0040
Simulating trial. . .
espilon = 0.1011; alpha = 0.0040
Simulating trial. . .
espilon = 0.1011; alpha = 0.0040
Simulating trial. . .
espilon = 0.1011; alpha = 0.0040
Simulating trial. . .
espilon = 0.1011; alpha = 0.0040
Simulating trial. . .
espilon = 0.1011; alpha = 0.0040
Simulating trial. . .
espilon = 0.1011; alpha = 0.0040
Simulating trial. . .
espilon = 0.1011; alpha = 0.0040
Simulating trial. . .
espilon = 0.1011; alpha = 0.0040
Simulating trial. . .
espilon = 0.1011; alpha = 0.0040
Simulating trial. . .
espilon = 0.1011; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', 'right')
Agent followed the waypoint right. (rewarded 1.52)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent drove right instead of forward. (rewarded 0.23)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.73)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 1.88)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.36)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 1.31)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.16)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.99)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.93)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded 1.84)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.68)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.71)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'right')
Agent drove right instead of forward. (rewarded 0.98)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.23)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.23)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.87)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.38)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent drove right instead of left. (rewarded 0.17)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.39)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.03)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.23)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 0.97)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.87)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.52)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.65)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 575
\-------------------------
Simulating trial. . .
espilon = 0.1007; alpha = 0.0040
Simulating trial. . .
espilon = 0.1007; alpha = 0.0040
Simulating trial. . .
espilon = 0.1007; alpha = 0.0040
Simulating trial. . .
espilon = 0.1007; alpha = 0.0040
Simulating trial. . .
espilon = 0.1007; alpha = 0.0040
Simulating trial. . .
espilon = 0.1007; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', 'forward')
Agent followed the waypoint right. (rewarded 2.32)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.32)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.65)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.50)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.32)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.14)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.47)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'right')
Agent drove right instead of forward. (rewarded 1.58)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 1.97)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.18)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.32)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.11)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent followed the waypoint left. (rewarded 1.98)
48% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 576
\-------------------------
Simulating trial. . .
espilon = 0.1003; alpha = 0.0040
Simulating trial. . .
espilon = 0.1003; alpha = 0.0040
Simulating trial. . .
espilon = 0.1003; alpha = 0.0040
Simulating trial. . .
espilon = 0.1003; alpha = 0.0040
Simulating trial. . .
espilon = 0.1003; alpha = 0.0040
Simulating trial. . .
espilon = 0.1003; alpha = 0.0040
Simulating trial. . .
espilon = 0.1003; alpha = 0.0040
Simulating trial. . .
espilon = 0.1003; alpha = 0.0040
Simulating trial. . .
espilon = 0.1003; alpha = 0.0040
Simulating trial. . .
espilon = 0.1003; alpha = 0.0040
Simulating trial. . .
espilon = 0.1003; alpha = 0.0040
Simulating trial. . .
espilon = 0.1003; alpha = 0.0040
Simulating trial. . .
espilon = 0.1003; alpha = 0.0040
Simulating trial. . .
espilon = 0.1003; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.57)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', 'left')
Agent followed the waypoint forward. (rewarded 2.78)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.27)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.13)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.78)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.79)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.62)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.80)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'right', None)
Agent followed the waypoint forward. (rewarded 1.63)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.39)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.76)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.86)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 577
\-------------------------
Simulating trial. . .
espilon = 0.0999; alpha = 0.0040
Simulating trial. . .
espilon = 0.0999; alpha = 0.0040
Simulating trial. . .
espilon = 0.0999; alpha = 0.0040
Simulating trial. . .
espilon = 0.0999; alpha = 0.0040
Simulating trial. . .
espilon = 0.0999; alpha = 0.0040
Simulating trial. . .
espilon = 0.0999; alpha = 0.0040
Simulating trial. . .
espilon = 0.0999; alpha = 0.0040
Simulating trial. . .
espilon = 0.0999; alpha = 0.0040
Simulating trial. . .
espilon = 0.0999; alpha = 0.0040
Simulating trial. . .
espilon = 0.0999; alpha = 0.0040
Simulating trial. . .
espilon = 0.0999; alpha = 0.0040
Simulating trial. . .
espilon = 0.0999; alpha = 0.0040
Simulating trial. . .
espilon = 0.0999; alpha = 0.0040
Simulating trial. . .
espilon = 0.0999; alpha = 0.0040
Simulating trial. . .
espilon = 0.0999; alpha = 0.0040
Simulating trial. . .
espilon = 0.0999; alpha = 0.0040
Simulating trial. . .
espilon = 0.0999; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.39)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'right', 'left')
Agent properly idled at a red light. (rewarded 2.31)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.95)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.80)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.61)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.44)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.56)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.55)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.87)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.05)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 0.79)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.48)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.59)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.78)
30% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 578
\-------------------------
Simulating trial. . .
espilon = 0.0995; alpha = 0.0040
Simulating trial. . .
espilon = 0.0995; alpha = 0.0040
Simulating trial. . .
espilon = 0.0995; alpha = 0.0040
Simulating trial. . .
espilon = 0.0995; alpha = 0.0040
Simulating trial. . .
espilon = 0.0995; alpha = 0.0040
Simulating trial. . .
espilon = 0.0995; alpha = 0.0040
Simulating trial. . .
espilon = 0.0995; alpha = 0.0040
Simulating trial. . .
espilon = 0.0995; alpha = 0.0040
Simulating trial. . .
espilon = 0.0995; alpha = 0.0040
Simulating trial. . .
espilon = 0.0995; alpha = 0.0040
Simulating trial. . .
espilon = 0.0995; alpha = 0.0040
Simulating trial. . .
espilon = 0.0995; alpha = 0.0040
Simulating trial. . .
espilon = 0.0995; alpha = 0.0040
Simulating trial. . .
espilon = 0.0995; alpha = 0.0040
Simulating trial. . .
espilon = 0.0995; alpha = 0.0040
Simulating trial. . .
espilon = 0.0995; alpha = 0.0040
Simulating trial. . .
espilon = 0.0995; alpha = 0.0040
Simulating trial. . .
espilon = 0.0995; alpha = 0.0040
Simulating trial. . .
espilon = 0.0995; alpha = 0.0040
Simulating trial. . .
espilon = 0.0995; alpha = 0.0040
Simulating trial. . .
espilon = 0.0995; alpha = 0.0040
Simulating trial. . .
espilon = 0.0995; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.94)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 1.95)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', 'forward')
Agent drove right instead of left. (rewarded 1.38)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent followed the waypoint right. (rewarded 2.71)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 1.53)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.86)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.02)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.11)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.37)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.92)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.98)
45% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 579
\-------------------------
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
Simulating trial. . .
espilon = 0.0991; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 2.04)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.41)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.83)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.06)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.96)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.01)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 1.76)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.28)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 2.37)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.19)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.67)
45% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 580
\-------------------------
Simulating trial. . .
espilon = 0.0987; alpha = 0.0040
Simulating trial. . .
espilon = 0.0987; alpha = 0.0040
Simulating trial. . .
espilon = 0.0987; alpha = 0.0040
Simulating trial. . .
espilon = 0.0987; alpha = 0.0040
Simulating trial. . .
espilon = 0.0987; alpha = 0.0040
Simulating trial. . .
espilon = 0.0987; alpha = 0.0040
Simulating trial. . .
espilon = 0.0987; alpha = 0.0040
Simulating trial. . .
espilon = 0.0987; alpha = 0.0040
Simulating trial. . .
espilon = 0.0987; alpha = 0.0040
Simulating trial. . .
espilon = 0.0987; alpha = 0.0040
Simulating trial. . .
espilon = 0.0987; alpha = 0.0040
Simulating trial. . .
espilon = 0.0987; alpha = 0.0040
Simulating trial. . .
espilon = 0.0987; alpha = 0.0040
Simulating trial. . .
espilon = 0.0987; alpha = 0.0040
Simulating trial. . .
espilon = 0.0987; alpha = 0.0040
Simulating trial. . .
espilon = 0.0987; alpha = 0.0040
Simulating trial. . .
espilon = 0.0987; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.51)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.88)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.20)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.86)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.62)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'left')
Agent followed the waypoint left. (rewarded 2.20)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.62)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 0.95)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.00)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.70)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.58)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'right')
Agent properly idled at a red light. (rewarded 1.11)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'forward', 'left')
Agent drove right instead of left. (rewarded 0.97)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.30)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'forward')
Agent followed the waypoint right. (rewarded 2.63)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'forward')
Agent followed the waypoint right. (rewarded 0.90)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.29)
32% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 581
\-------------------------
Simulating trial. . .
espilon = 0.0983; alpha = 0.0040
Simulating trial. . .
espilon = 0.0983; alpha = 0.0040
Simulating trial. . .
espilon = 0.0983; alpha = 0.0040
Simulating trial. . .
espilon = 0.0983; alpha = 0.0040
Simulating trial. . .
espilon = 0.0983; alpha = 0.0040
Simulating trial. . .
espilon = 0.0983; alpha = 0.0040
Simulating trial. . .
espilon = 0.0983; alpha = 0.0040
Simulating trial. . .
espilon = 0.0983; alpha = 0.0040
Simulating trial. . .
espilon = 0.0983; alpha = 0.0040
Simulating trial. . .
espilon = 0.0983; alpha = 0.0040
Simulating trial. . .
espilon = 0.0983; alpha = 0.0040
Simulating trial. . .
espilon = 0.0983; alpha = 0.0040
Simulating trial. . .
espilon = 0.0983; alpha = 0.0040
Simulating trial. . .
espilon = 0.0983; alpha = 0.0040
Simulating trial. . .
espilon = 0.0983; alpha = 0.0040
Simulating trial. . .
espilon = 0.0983; alpha = 0.0040
Simulating trial. . .
espilon = 0.0983; alpha = 0.0040
Simulating trial. . .
espilon = 0.0983; alpha = 0.0040
Simulating trial. . .
espilon = 0.0983; alpha = 0.0040
Simulating trial. . .
espilon = 0.0983; alpha = 0.0040
Simulating trial. . .
espilon = 0.0983; alpha = 0.0040
Simulating trial. . .
espilon = 0.0983; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.43)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.25)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.24)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.43)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.97)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.28)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.69)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.61)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.64)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.87)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.24)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.10)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.44)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove right instead of forward. (rewarded -0.11)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.44)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.39)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent followed the waypoint left. (rewarded 1.74)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.31)
10% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 582
\-------------------------
Simulating trial. . .
espilon = 0.0979; alpha = 0.0040
Simulating trial. . .
espilon = 0.0979; alpha = 0.0040
Simulating trial. . .
espilon = 0.0979; alpha = 0.0040
Simulating trial. . .
espilon = 0.0979; alpha = 0.0040
Simulating trial. . .
espilon = 0.0979; alpha = 0.0040
Simulating trial. . .
espilon = 0.0979; alpha = 0.0040
Simulating trial. . .
espilon = 0.0979; alpha = 0.0040
Simulating trial. . .
espilon = 0.0979; alpha = 0.0040
Simulating trial. . .
espilon = 0.0979; alpha = 0.0040
Simulating trial. . .
espilon = 0.0979; alpha = 0.0040
Simulating trial. . .
espilon = 0.0979; alpha = 0.0040
Simulating trial. . .
espilon = 0.0979; alpha = 0.0040
Simulating trial. . .
espilon = 0.0979; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.41)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.78)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.62)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.33)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.89)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.32)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.35)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.42)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.91)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.60)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.00)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.34)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'right', 'left')
Agent properly idled at a red light. (rewarded 1.18)
63% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.12)
60% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent attempted driving forward through a red light. (rewarded -10.81)
57% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.35)
54% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.23)
51% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent attempted driving left through a red light. (rewarded -9.59)
49% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.55)
46% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.12)
43% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 2.13)
40% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.62)
37% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.39)
34% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.88)
31% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.96)
29% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.03)
26% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.17)
23% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 583
\-------------------------
Simulating trial. . .
espilon = 0.0975; alpha = 0.0040
Simulating trial. . .
espilon = 0.0975; alpha = 0.0040
Simulating trial. . .
espilon = 0.0975; alpha = 0.0040
Simulating trial. . .
espilon = 0.0975; alpha = 0.0040
Simulating trial. . .
espilon = 0.0975; alpha = 0.0040
Simulating trial. . .
espilon = 0.0975; alpha = 0.0040
Simulating trial. . .
espilon = 0.0975; alpha = 0.0040
Simulating trial. . .
espilon = 0.0975; alpha = 0.0040
Simulating trial. . .
espilon = 0.0975; alpha = 0.0040
Simulating trial. . .
espilon = 0.0975; alpha = 0.0040
Simulating trial. . .
espilon = 0.0975; alpha = 0.0040
Simulating trial. . .
espilon = 0.0975; alpha = 0.0040
Simulating trial. . .
espilon = 0.0975; alpha = 0.0040
Simulating trial. . .
espilon = 0.0975; alpha = 0.0040
Simulating trial. . .
espilon = 0.0975; alpha = 0.0040
Simulating trial. . .
espilon = 0.0975; alpha = 0.0040
Simulating trial. . .
espilon = 0.0975; alpha = 0.0040
Simulating trial. . .
espilon = 0.0975; alpha = 0.0040
Simulating trial. . .
espilon = 0.0975; alpha = 0.0040
Simulating trial. . .
espilon = 0.0975; alpha = 0.0040
Simulating trial. . .
espilon = 0.0975; alpha = 0.0040
Simulating trial. . .
espilon = 0.0975; alpha = 0.0040
Simulating trial. . .
espilon = 0.0975; alpha = 0.0040
Simulating trial. . .
espilon = 0.0975; alpha = 0.0040
Simulating trial. . .
espilon = 0.0975; alpha = 0.0040
Simulating trial. . .
espilon = 0.0975; alpha = 0.0040
Simulating trial. . .
espilon = 0.0975; alpha = 0.0040
Simulating trial. . .
espilon = 0.0975; alpha = 0.0040
Simulating trial. . .
espilon = 0.0975; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent followed the waypoint left. (rewarded 1.20)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.24)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'left')
Agent properly idled at a red light. (rewarded 1.44)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 2.55)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded 1.06)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.76)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.37)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 2.24)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.39)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.77)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 1.35)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.61)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.65)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.81)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.68)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.94)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.18)
15% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 584
\-------------------------
Simulating trial. . .
espilon = 0.0971; alpha = 0.0040
Simulating trial. . .
espilon = 0.0971; alpha = 0.0040
Simulating trial. . .
espilon = 0.0971; alpha = 0.0040
Simulating trial. . .
espilon = 0.0971; alpha = 0.0040
Simulating trial. . .
espilon = 0.0971; alpha = 0.0040
Simulating trial. . .
espilon = 0.0971; alpha = 0.0040
Simulating trial. . .
espilon = 0.0971; alpha = 0.0040
Simulating trial. . .
espilon = 0.0971; alpha = 0.0040
Simulating trial. . .
espilon = 0.0971; alpha = 0.0040
Simulating trial. . .
espilon = 0.0971; alpha = 0.0040
Simulating trial. . .
espilon = 0.0971; alpha = 0.0040
Simulating trial. . .
espilon = 0.0971; alpha = 0.0040
Simulating trial. . .
espilon = 0.0971; alpha = 0.0040
Simulating trial. . .
espilon = 0.0971; alpha = 0.0040
Simulating trial. . .
espilon = 0.0971; alpha = 0.0040
Simulating trial. . .
espilon = 0.0971; alpha = 0.0040
Simulating trial. . .
espilon = 0.0971; alpha = 0.0040
Simulating trial. . .
espilon = 0.0971; alpha = 0.0040
Simulating trial. . .
espilon = 0.0971; alpha = 0.0040
Simulating trial. . .
espilon = 0.0971; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'right', None)
Agent drove left instead of forward. (rewarded 0.85)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent drove forward instead of right. (rewarded 1.97)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.26)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 1.27)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.24)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.19)
80% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 585
\-------------------------
Simulating trial. . .
espilon = 0.0967; alpha = 0.0040
Simulating trial. . .
espilon = 0.0967; alpha = 0.0040
Simulating trial. . .
espilon = 0.0967; alpha = 0.0040
Simulating trial. . .
espilon = 0.0967; alpha = 0.0040
Simulating trial. . .
espilon = 0.0967; alpha = 0.0040
Simulating trial. . .
espilon = 0.0967; alpha = 0.0040
Simulating trial. . .
espilon = 0.0967; alpha = 0.0040
Simulating trial. . .
espilon = 0.0967; alpha = 0.0040
Simulating trial. . .
espilon = 0.0967; alpha = 0.0040
Simulating trial. . .
espilon = 0.0967; alpha = 0.0040
Simulating trial. . .
espilon = 0.0967; alpha = 0.0040
Simulating trial. . .
espilon = 0.0967; alpha = 0.0040
Simulating trial. . .
espilon = 0.0967; alpha = 0.0040
Simulating trial. . .
espilon = 0.0967; alpha = 0.0040
Simulating trial. . .
espilon = 0.0967; alpha = 0.0040
Simulating trial. . .
espilon = 0.0967; alpha = 0.0040
Simulating trial. . .
espilon = 0.0967; alpha = 0.0040
Simulating trial. . .
espilon = 0.0967; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.17)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.66)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.49)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.09)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.13)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 0.99)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.66)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 2.03)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', 'left')
Agent drove right instead of forward. (rewarded 0.12)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.58)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.57)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.58)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded 0.40)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.57)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.31)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.30)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.16)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.69)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'forward', 'left')
Agent drove right instead of left. (rewarded 1.38)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent followed the waypoint right. (rewarded 0.82)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.36)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.33)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.47)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.48)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.76)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 586
\-------------------------
Simulating trial. . .
espilon = 0.0963; alpha = 0.0040
Simulating trial. . .
espilon = 0.0963; alpha = 0.0040
Simulating trial. . .
espilon = 0.0963; alpha = 0.0040
Simulating trial. . .
espilon = 0.0963; alpha = 0.0040
Simulating trial. . .
espilon = 0.0963; alpha = 0.0040
Simulating trial. . .
espilon = 0.0963; alpha = 0.0040
Simulating trial. . .
espilon = 0.0963; alpha = 0.0040
Simulating trial. . .
espilon = 0.0963; alpha = 0.0040
Simulating trial. . .
espilon = 0.0963; alpha = 0.0040
Simulating trial. . .
espilon = 0.0963; alpha = 0.0040
Simulating trial. . .
espilon = 0.0963; alpha = 0.0040
Simulating trial. . .
espilon = 0.0963; alpha = 0.0040
Simulating trial. . .
espilon = 0.0963; alpha = 0.0040
Simulating trial. . .
espilon = 0.0963; alpha = 0.0040
Simulating trial. . .
espilon = 0.0963; alpha = 0.0040
Simulating trial. . .
espilon = 0.0963; alpha = 0.0040
Simulating trial. . .
espilon = 0.0963; alpha = 0.0040
Simulating trial. . .
espilon = 0.0963; alpha = 0.0040
Simulating trial. . .
espilon = 0.0963; alpha = 0.0040
Simulating trial. . .
espilon = 0.0963; alpha = 0.0040
Simulating trial. . .
espilon = 0.0963; alpha = 0.0040
Simulating trial. . .
espilon = 0.0963; alpha = 0.0040
Simulating trial. . .
espilon = 0.0963; alpha = 0.0040
Simulating trial. . .
espilon = 0.0963; alpha = 0.0040
Simulating trial. . .
espilon = 0.0963; alpha = 0.0040
Simulating trial. . .
espilon = 0.0963; alpha = 0.0040
Simulating trial. . .
espilon = 0.0963; alpha = 0.0040
Simulating trial. . .
espilon = 0.0963; alpha = 0.0040
Simulating trial. . .
espilon = 0.0963; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 2.40)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 2.75)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.26)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.70)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.61)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.66)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 0.94)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.81)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent attempted driving left through a red light. (rewarded -9.13)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'left')
Agent drove right instead of left. (rewarded 0.36)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.39)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.70)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.86)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.63)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, None)
Agent followed the waypoint right. (rewarded 1.18)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.12)
36% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 587
\-------------------------
Simulating trial. . .
espilon = 0.0959; alpha = 0.0040
Simulating trial. . .
espilon = 0.0959; alpha = 0.0040
Simulating trial. . .
espilon = 0.0959; alpha = 0.0040
Simulating trial. . .
espilon = 0.0959; alpha = 0.0040
Simulating trial. . .
espilon = 0.0959; alpha = 0.0040
Simulating trial. . .
espilon = 0.0959; alpha = 0.0040
Simulating trial. . .
espilon = 0.0959; alpha = 0.0040
Simulating trial. . .
espilon = 0.0959; alpha = 0.0040
Simulating trial. . .
espilon = 0.0959; alpha = 0.0040
Simulating trial. . .
espilon = 0.0959; alpha = 0.0040
Simulating trial. . .
espilon = 0.0959; alpha = 0.0040
Simulating trial. . .
espilon = 0.0959; alpha = 0.0040
Simulating trial. . .
espilon = 0.0959; alpha = 0.0040
Simulating trial. . .
espilon = 0.0959; alpha = 0.0040
Simulating trial. . .
espilon = 0.0959; alpha = 0.0040
Simulating trial. . .
espilon = 0.0959; alpha = 0.0040
Simulating trial. . .
espilon = 0.0959; alpha = 0.0040
Simulating trial. . .
espilon = 0.0959; alpha = 0.0040
Simulating trial. . .
espilon = 0.0959; alpha = 0.0040
Simulating trial. . .
espilon = 0.0959; alpha = 0.0040
Simulating trial. . .
espilon = 0.0959; alpha = 0.0040
Simulating trial. . .
espilon = 0.0959; alpha = 0.0040
Simulating trial. . .
espilon = 0.0959; alpha = 0.0040
Simulating trial. . .
espilon = 0.0959; alpha = 0.0040
Simulating trial. . .
espilon = 0.0959; alpha = 0.0040
Simulating trial. . .
espilon = 0.0959; alpha = 0.0040
Simulating trial. . .
espilon = 0.0959; alpha = 0.0040
Simulating trial. . .
espilon = 0.0959; alpha = 0.0040
Simulating trial. . .
espilon = 0.0959; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.01)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.54)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent attempted driving forward through a red light. (rewarded -9.06)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.50)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove left instead of forward. (rewarded 0.00)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.49)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'right')
Agent drove right instead of forward. (rewarded 0.27)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.34)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', 'right')
Agent drove right instead of left. (rewarded 0.82)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.85)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'right', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.79)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 0.99)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.89)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.65)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.13)
25% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 588
\-------------------------
Simulating trial. . .
espilon = 0.0956; alpha = 0.0040
Simulating trial. . .
espilon = 0.0956; alpha = 0.0040
Simulating trial. . .
espilon = 0.0956; alpha = 0.0040
Simulating trial. . .
espilon = 0.0956; alpha = 0.0040
Simulating trial. . .
espilon = 0.0956; alpha = 0.0040
Simulating trial. . .
espilon = 0.0956; alpha = 0.0040
Simulating trial. . .
espilon = 0.0956; alpha = 0.0040
Simulating trial. . .
espilon = 0.0956; alpha = 0.0040
Simulating trial. . .
espilon = 0.0956; alpha = 0.0040
Simulating trial. . .
espilon = 0.0956; alpha = 0.0040
Simulating trial. . .
espilon = 0.0956; alpha = 0.0040
Simulating trial. . .
espilon = 0.0956; alpha = 0.0040
Simulating trial. . .
espilon = 0.0956; alpha = 0.0040
Simulating trial. . .
espilon = 0.0956; alpha = 0.0040
Simulating trial. . .
espilon = 0.0956; alpha = 0.0040
Simulating trial. . .
espilon = 0.0956; alpha = 0.0040
Simulating trial. . .
espilon = 0.0956; alpha = 0.0040
Simulating trial. . .
espilon = 0.0956; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.03)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.84)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.92)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.29)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.65)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.22)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.32)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 0.92)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.87)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 0.96)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 1.47)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.24)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.38)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.30)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 0.97)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', 'left')
Agent properly idled at a red light. (rewarded 2.09)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', 'left')
Agent followed the waypoint forward. (rewarded 1.44)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.65)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -9.43)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.50)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 589
\-------------------------
Simulating trial. . .
espilon = 0.0952; alpha = 0.0040
Simulating trial. . .
espilon = 0.0952; alpha = 0.0040
Simulating trial. . .
espilon = 0.0952; alpha = 0.0040
Simulating trial. . .
espilon = 0.0952; alpha = 0.0040
Simulating trial. . .
espilon = 0.0952; alpha = 0.0040
Simulating trial. . .
espilon = 0.0952; alpha = 0.0040
Simulating trial. . .
espilon = 0.0952; alpha = 0.0040
Simulating trial. . .
espilon = 0.0952; alpha = 0.0040
Simulating trial. . .
espilon = 0.0952; alpha = 0.0040
Simulating trial. . .
espilon = 0.0952; alpha = 0.0040
Simulating trial. . .
espilon = 0.0952; alpha = 0.0040
Simulating trial. . .
espilon = 0.0952; alpha = 0.0040
Simulating trial. . .
espilon = 0.0952; alpha = 0.0040
Simulating trial. . .
espilon = 0.0952; alpha = 0.0040
Simulating trial. . .
espilon = 0.0952; alpha = 0.0040
Simulating trial. . .
espilon = 0.0952; alpha = 0.0040
Simulating trial. . .
espilon = 0.0952; alpha = 0.0040
Simulating trial. . .
espilon = 0.0952; alpha = 0.0040
Simulating trial. . .
espilon = 0.0952; alpha = 0.0040
Simulating trial. . .
espilon = 0.0952; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'right', 'right')
Agent drove forward instead of left. (rewarded 1.23)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.43)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'right', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.57)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.18)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.24)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.79)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.84)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.40)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.94)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.07)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.55)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.89)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.34)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 0.77)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.26)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.76)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.77)
15% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 590
\-------------------------
Simulating trial. . .
espilon = 0.0948; alpha = 0.0040
Simulating trial. . .
espilon = 0.0948; alpha = 0.0040
Simulating trial. . .
espilon = 0.0948; alpha = 0.0040
Simulating trial. . .
espilon = 0.0948; alpha = 0.0040
Simulating trial. . .
espilon = 0.0948; alpha = 0.0040
Simulating trial. . .
espilon = 0.0948; alpha = 0.0040
Simulating trial. . .
espilon = 0.0948; alpha = 0.0040
Simulating trial. . .
espilon = 0.0948; alpha = 0.0040
Simulating trial. . .
espilon = 0.0948; alpha = 0.0040
Simulating trial. . .
espilon = 0.0948; alpha = 0.0040
Simulating trial. . .
espilon = 0.0948; alpha = 0.0040
Simulating trial. . .
espilon = 0.0948; alpha = 0.0040
Simulating trial. . .
espilon = 0.0948; alpha = 0.0040
Simulating trial. . .
espilon = 0.0948; alpha = 0.0040
Simulating trial. . .
espilon = 0.0948; alpha = 0.0040
Simulating trial. . .
espilon = 0.0948; alpha = 0.0040
Simulating trial. . .
espilon = 0.0948; alpha = 0.0040
Simulating trial. . .
espilon = 0.0948; alpha = 0.0040
Simulating trial. . .
espilon = 0.0948; alpha = 0.0040
Simulating trial. . .
espilon = 0.0948; alpha = 0.0040
Simulating trial. . .
espilon = 0.0948; alpha = 0.0040
Simulating trial. . .
espilon = 0.0948; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.28)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.75)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.76)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 0.17)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 2.82)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.89)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 1.06)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.67)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 0.05)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.83)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', 'forward')
Agent drove forward instead of right. (rewarded 1.01)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.90)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', 'right')
Agent drove forward instead of right. (rewarded 0.75)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.67)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.20)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.01)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.80)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 0.59)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded -0.19)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 0.84)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 0.92)
16% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 591
\-------------------------
Simulating trial. . .
espilon = 0.0944; alpha = 0.0040
Simulating trial. . .
espilon = 0.0944; alpha = 0.0040
Simulating trial. . .
espilon = 0.0944; alpha = 0.0040
Simulating trial. . .
espilon = 0.0944; alpha = 0.0040
Simulating trial. . .
espilon = 0.0944; alpha = 0.0040
Simulating trial. . .
espilon = 0.0944; alpha = 0.0040
Simulating trial. . .
espilon = 0.0944; alpha = 0.0040
Simulating trial. . .
espilon = 0.0944; alpha = 0.0040
Simulating trial. . .
espilon = 0.0944; alpha = 0.0040
Simulating trial. . .
espilon = 0.0944; alpha = 0.0040
Simulating trial. . .
espilon = 0.0944; alpha = 0.0040
Simulating trial. . .
espilon = 0.0944; alpha = 0.0040
Simulating trial. . .
espilon = 0.0944; alpha = 0.0040
Simulating trial. . .
espilon = 0.0944; alpha = 0.0040
Simulating trial. . .
espilon = 0.0944; alpha = 0.0040
Simulating trial. . .
espilon = 0.0944; alpha = 0.0040
Simulating trial. . .
espilon = 0.0944; alpha = 0.0040
Simulating trial. . .
espilon = 0.0944; alpha = 0.0040
Simulating trial. . .
espilon = 0.0944; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', 'right')
Agent drove right instead of left. (rewarded 1.67)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 1.18)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.01)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent drove right instead of left. (rewarded 1.56)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent followed the waypoint right. (rewarded 0.98)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.82)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 0.97)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.28)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.54)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.54)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 0.50)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.32)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.91)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.31)
30% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 592
\-------------------------
Simulating trial. . .
espilon = 0.0940; alpha = 0.0040
Simulating trial. . .
espilon = 0.0940; alpha = 0.0040
Simulating trial. . .
espilon = 0.0940; alpha = 0.0040
Simulating trial. . .
espilon = 0.0940; alpha = 0.0040
Simulating trial. . .
espilon = 0.0940; alpha = 0.0040
Simulating trial. . .
espilon = 0.0940; alpha = 0.0040
Simulating trial. . .
espilon = 0.0940; alpha = 0.0040
Simulating trial. . .
espilon = 0.0940; alpha = 0.0040
Simulating trial. . .
espilon = 0.0940; alpha = 0.0040
Simulating trial. . .
espilon = 0.0940; alpha = 0.0040
Simulating trial. . .
espilon = 0.0940; alpha = 0.0040
Simulating trial. . .
espilon = 0.0940; alpha = 0.0040
Simulating trial. . .
espilon = 0.0940; alpha = 0.0040
Simulating trial. . .
espilon = 0.0940; alpha = 0.0040
Simulating trial. . .
espilon = 0.0940; alpha = 0.0040
Simulating trial. . .
espilon = 0.0940; alpha = 0.0040
Simulating trial. . .
espilon = 0.0940; alpha = 0.0040
Simulating trial. . .
espilon = 0.0940; alpha = 0.0040
Simulating trial. . .
espilon = 0.0940; alpha = 0.0040
Simulating trial. . .
espilon = 0.0940; alpha = 0.0040
Simulating trial. . .
espilon = 0.0940; alpha = 0.0040
Simulating trial. . .
espilon = 0.0940; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.19)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.02)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.30)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.79)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.29)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.25)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.49)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.21)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.65)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'right', None)
Agent drove right instead of forward. (rewarded 0.86)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.23)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.18)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.35)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.62)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 0.15)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.49)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.43)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove right instead of left. (rewarded 0.39)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent followed the waypoint right. (rewarded 0.93)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.19)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.09)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.58)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.91)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.56)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.20)
17% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 593
\-------------------------
Simulating trial. . .
espilon = 0.0937; alpha = 0.0040
Simulating trial. . .
espilon = 0.0937; alpha = 0.0040
Simulating trial. . .
espilon = 0.0937; alpha = 0.0040
Simulating trial. . .
espilon = 0.0937; alpha = 0.0040
Simulating trial. . .
espilon = 0.0937; alpha = 0.0040
Simulating trial. . .
espilon = 0.0937; alpha = 0.0040
Simulating trial. . .
espilon = 0.0937; alpha = 0.0040
Simulating trial. . .
espilon = 0.0937; alpha = 0.0040
Simulating trial. . .
espilon = 0.0937; alpha = 0.0040
Simulating trial. . .
espilon = 0.0937; alpha = 0.0040
Simulating trial. . .
espilon = 0.0937; alpha = 0.0040
Simulating trial. . .
espilon = 0.0937; alpha = 0.0040
Simulating trial. . .
espilon = 0.0937; alpha = 0.0040
Simulating trial. . .
espilon = 0.0937; alpha = 0.0040
Simulating trial. . .
espilon = 0.0937; alpha = 0.0040
Simulating trial. . .
espilon = 0.0937; alpha = 0.0040
Simulating trial. . .
espilon = 0.0937; alpha = 0.0040
Simulating trial. . .
espilon = 0.0937; alpha = 0.0040
Simulating trial. . .
espilon = 0.0937; alpha = 0.0040
Simulating trial. . .
espilon = 0.0937; alpha = 0.0040
Simulating trial. . .
espilon = 0.0937; alpha = 0.0040
Simulating trial. . .
espilon = 0.0937; alpha = 0.0040
Simulating trial. . .
espilon = 0.0937; alpha = 0.0040
Simulating trial. . .
espilon = 0.0937; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.59)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.44)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.02)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 0.24)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.43)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.00)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.76)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 2.09)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.84)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 1.61)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, None)
Agent followed the waypoint right. (rewarded 0.97)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.88)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'right', 'forward')
Agent followed the waypoint right. (rewarded 1.07)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.03)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.88)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.15)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.74)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.84)
28% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 594
\-------------------------
Simulating trial. . .
espilon = 0.0933; alpha = 0.0040
Simulating trial. . .
espilon = 0.0933; alpha = 0.0040
Simulating trial. . .
espilon = 0.0933; alpha = 0.0040
Simulating trial. . .
espilon = 0.0933; alpha = 0.0040
Simulating trial. . .
espilon = 0.0933; alpha = 0.0040
Simulating trial. . .
espilon = 0.0933; alpha = 0.0040
Simulating trial. . .
espilon = 0.0933; alpha = 0.0040
Simulating trial. . .
espilon = 0.0933; alpha = 0.0040
Simulating trial. . .
espilon = 0.0933; alpha = 0.0040
Simulating trial. . .
espilon = 0.0933; alpha = 0.0040
Simulating trial. . .
espilon = 0.0933; alpha = 0.0040
Simulating trial. . .
espilon = 0.0933; alpha = 0.0040
Simulating trial. . .
espilon = 0.0933; alpha = 0.0040
Simulating trial. . .
espilon = 0.0933; alpha = 0.0040
Simulating trial. . .
espilon = 0.0933; alpha = 0.0040
Simulating trial. . .
espilon = 0.0933; alpha = 0.0040
Simulating trial. . .
espilon = 0.0933; alpha = 0.0040
Simulating trial. . .
espilon = 0.0933; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'right')
Agent drove forward instead of left. (rewarded 0.34)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', 'forward')
Agent drove forward instead of left. (rewarded 1.90)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.39)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.79)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.70)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.57)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'left')
Agent drove right instead of left. (rewarded 1.26)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.02)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.59)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 2.67)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 1.53)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.98)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 1.48)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.57)
44% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 595
\-------------------------
Simulating trial. . .
espilon = 0.0929; alpha = 0.0040
Simulating trial. . .
espilon = 0.0929; alpha = 0.0040
Simulating trial. . .
espilon = 0.0929; alpha = 0.0040
Simulating trial. . .
espilon = 0.0929; alpha = 0.0040
Simulating trial. . .
espilon = 0.0929; alpha = 0.0040
Simulating trial. . .
espilon = 0.0929; alpha = 0.0040
Simulating trial. . .
espilon = 0.0929; alpha = 0.0040
Simulating trial. . .
espilon = 0.0929; alpha = 0.0040
Simulating trial. . .
espilon = 0.0929; alpha = 0.0040
Simulating trial. . .
espilon = 0.0929; alpha = 0.0040
Simulating trial. . .
espilon = 0.0929; alpha = 0.0040
Simulating trial. . .
espilon = 0.0929; alpha = 0.0040
Simulating trial. . .
espilon = 0.0929; alpha = 0.0040
Simulating trial. . .
espilon = 0.0929; alpha = 0.0040
Simulating trial. . .
espilon = 0.0929; alpha = 0.0040
Simulating trial. . .
espilon = 0.0929; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent followed the waypoint right. (rewarded 1.93)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.95)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'right', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.03)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 1.34)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.12)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.11)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.66)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.11)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.05)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 1.71)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.98)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.06)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.93)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.63)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.68)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.97)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.17)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'forward', None)
Agent followed the waypoint right. (rewarded 1.82)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 596
\-------------------------
Simulating trial. . .
espilon = 0.0926; alpha = 0.0040
Simulating trial. . .
espilon = 0.0926; alpha = 0.0040
Simulating trial. . .
espilon = 0.0926; alpha = 0.0040
Simulating trial. . .
espilon = 0.0926; alpha = 0.0040
Simulating trial. . .
espilon = 0.0926; alpha = 0.0040
Simulating trial. . .
espilon = 0.0926; alpha = 0.0040
Simulating trial. . .
espilon = 0.0926; alpha = 0.0040
Simulating trial. . .
espilon = 0.0926; alpha = 0.0040
Simulating trial. . .
espilon = 0.0926; alpha = 0.0040
Simulating trial. . .
espilon = 0.0926; alpha = 0.0040
Simulating trial. . .
espilon = 0.0926; alpha = 0.0040
Simulating trial. . .
espilon = 0.0926; alpha = 0.0040
Simulating trial. . .
espilon = 0.0926; alpha = 0.0040
Simulating trial. . .
espilon = 0.0926; alpha = 0.0040
Simulating trial. . .
espilon = 0.0926; alpha = 0.0040
Simulating trial. . .
espilon = 0.0926; alpha = 0.0040
Simulating trial. . .
espilon = 0.0926; alpha = 0.0040
Simulating trial. . .
espilon = 0.0926; alpha = 0.0040
Simulating trial. . .
espilon = 0.0926; alpha = 0.0040
Simulating trial. . .
espilon = 0.0926; alpha = 0.0040
Simulating trial. . .
espilon = 0.0926; alpha = 0.0040
Simulating trial. . .
espilon = 0.0926; alpha = 0.0040
Simulating trial. . .
espilon = 0.0926; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'right')
Agent drove right instead of left. (rewarded 1.35)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.89)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.65)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.75)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove right instead of forward. (rewarded 1.19)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 1.97)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.68)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.31)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.29)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.97)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.99)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.03)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.12)
63% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.74)
60% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 1.58)
57% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.78)
54% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.68)
51% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 1.84)
49% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.96)
46% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 597
\-------------------------
Simulating trial. . .
espilon = 0.0922; alpha = 0.0040
Simulating trial. . .
espilon = 0.0922; alpha = 0.0040
Simulating trial. . .
espilon = 0.0922; alpha = 0.0040
Simulating trial. . .
espilon = 0.0922; alpha = 0.0040
Simulating trial. . .
espilon = 0.0922; alpha = 0.0040
Simulating trial. . .
espilon = 0.0922; alpha = 0.0040
Simulating trial. . .
espilon = 0.0922; alpha = 0.0040
Simulating trial. . .
espilon = 0.0922; alpha = 0.0040
Simulating trial. . .
espilon = 0.0922; alpha = 0.0040
Simulating trial. . .
espilon = 0.0922; alpha = 0.0040
Simulating trial. . .
espilon = 0.0922; alpha = 0.0040
Simulating trial. . .
espilon = 0.0922; alpha = 0.0040
Simulating trial. . .
espilon = 0.0922; alpha = 0.0040
Simulating trial. . .
espilon = 0.0922; alpha = 0.0040
Simulating trial. . .
espilon = 0.0922; alpha = 0.0040
Simulating trial. . .
espilon = 0.0922; alpha = 0.0040
Simulating trial. . .
espilon = 0.0922; alpha = 0.0040
Simulating trial. . .
espilon = 0.0922; alpha = 0.0040
Simulating trial. . .
espilon = 0.0922; alpha = 0.0040
Simulating trial. . .
espilon = 0.0922; alpha = 0.0040
Simulating trial. . .
espilon = 0.0922; alpha = 0.0040
Simulating trial. . .
espilon = 0.0922; alpha = 0.0040
Simulating trial. . .
espilon = 0.0922; alpha = 0.0040
Simulating trial. . .
espilon = 0.0922; alpha = 0.0040
Simulating trial. . .
espilon = 0.0922; alpha = 0.0040
Simulating trial. . .
espilon = 0.0922; alpha = 0.0040
Simulating trial. . .
espilon = 0.0922; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.72)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.48)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.43)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.46)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.26)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.67)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.41)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', 'left')
Agent properly idled at a red light. (rewarded 1.82)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.02)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.15)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.44)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.61)
52% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 598
\-------------------------
Simulating trial. . .
espilon = 0.0918; alpha = 0.0040
Simulating trial. . .
espilon = 0.0918; alpha = 0.0040
Simulating trial. . .
espilon = 0.0918; alpha = 0.0040
Simulating trial. . .
espilon = 0.0918; alpha = 0.0040
Simulating trial. . .
espilon = 0.0918; alpha = 0.0040
Simulating trial. . .
espilon = 0.0918; alpha = 0.0040
Simulating trial. . .
espilon = 0.0918; alpha = 0.0040
Simulating trial. . .
espilon = 0.0918; alpha = 0.0040
Simulating trial. . .
espilon = 0.0918; alpha = 0.0040
Simulating trial. . .
espilon = 0.0918; alpha = 0.0040
Simulating trial. . .
espilon = 0.0918; alpha = 0.0040
Simulating trial. . .
espilon = 0.0918; alpha = 0.0040
Simulating trial. . .
espilon = 0.0918; alpha = 0.0040
Simulating trial. . .
espilon = 0.0918; alpha = 0.0040
Simulating trial. . .
espilon = 0.0918; alpha = 0.0040
Simulating trial. . .
espilon = 0.0918; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.13)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.64)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.97)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.26)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.09)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.31)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.56)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.90)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.81)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.78)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 1.00)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.37)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 0.78)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.97)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.10)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.76)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', 'left')
Agent drove right instead of left. (rewarded 0.48)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent followed the waypoint right. (rewarded 1.80)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.16)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.04)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 599
\-------------------------
Simulating trial. . .
espilon = 0.0914; alpha = 0.0040
Simulating trial. . .
espilon = 0.0914; alpha = 0.0040
Simulating trial. . .
espilon = 0.0914; alpha = 0.0040
Simulating trial. . .
espilon = 0.0914; alpha = 0.0040
Simulating trial. . .
espilon = 0.0914; alpha = 0.0040
Simulating trial. . .
espilon = 0.0914; alpha = 0.0040
Simulating trial. . .
espilon = 0.0914; alpha = 0.0040
Simulating trial. . .
espilon = 0.0914; alpha = 0.0040
Simulating trial. . .
espilon = 0.0914; alpha = 0.0040
Simulating trial. . .
espilon = 0.0914; alpha = 0.0040
Simulating trial. . .
espilon = 0.0914; alpha = 0.0040
Simulating trial. . .
espilon = 0.0914; alpha = 0.0040
Simulating trial. . .
espilon = 0.0914; alpha = 0.0040
Simulating trial. . .
espilon = 0.0914; alpha = 0.0040
Simulating trial. . .
espilon = 0.0914; alpha = 0.0040
Simulating trial. . .
espilon = 0.0914; alpha = 0.0040
Simulating trial. . .
espilon = 0.0914; alpha = 0.0040
Simulating trial. . .
espilon = 0.0914; alpha = 0.0040
Simulating trial. . .
espilon = 0.0914; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent followed the waypoint right. (rewarded 2.30)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.30)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 1.53)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.92)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.04)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.75)
70% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 600
\-------------------------
Simulating trial. . .
espilon = 0.0911; alpha = 0.0040
Simulating trial. . .
espilon = 0.0911; alpha = 0.0040
Simulating trial. . .
espilon = 0.0911; alpha = 0.0040
Simulating trial. . .
espilon = 0.0911; alpha = 0.0040
Simulating trial. . .
espilon = 0.0911; alpha = 0.0040
Simulating trial. . .
espilon = 0.0911; alpha = 0.0040
Simulating trial. . .
espilon = 0.0911; alpha = 0.0040
Simulating trial. . .
espilon = 0.0911; alpha = 0.0040
Simulating trial. . .
espilon = 0.0911; alpha = 0.0040
Simulating trial. . .
espilon = 0.0911; alpha = 0.0040
Simulating trial. . .
espilon = 0.0911; alpha = 0.0040
Simulating trial. . .
espilon = 0.0911; alpha = 0.0040
Simulating trial. . .
espilon = 0.0911; alpha = 0.0040
Simulating trial. . .
espilon = 0.0911; alpha = 0.0040
Simulating trial. . .
espilon = 0.0911; alpha = 0.0040
Simulating trial. . .
espilon = 0.0911; alpha = 0.0040
Simulating trial. . .
espilon = 0.0911; alpha = 0.0040
Simulating trial. . .
espilon = 0.0911; alpha = 0.0040
Simulating trial. . .
espilon = 0.0911; alpha = 0.0040
Simulating trial. . .
espilon = 0.0911; alpha = 0.0040
Simulating trial. . .
espilon = 0.0911; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.19)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.82)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 2.29)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.63)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.75)
75% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 601
\-------------------------
Simulating trial. . .
espilon = 0.0907; alpha = 0.0040
Simulating trial. . .
espilon = 0.0907; alpha = 0.0040
Simulating trial. . .
espilon = 0.0907; alpha = 0.0040
Simulating trial. . .
espilon = 0.0907; alpha = 0.0040
Simulating trial. . .
espilon = 0.0907; alpha = 0.0040
Simulating trial. . .
espilon = 0.0907; alpha = 0.0040
Simulating trial. . .
espilon = 0.0907; alpha = 0.0040
Simulating trial. . .
espilon = 0.0907; alpha = 0.0040
Simulating trial. . .
espilon = 0.0907; alpha = 0.0040
Simulating trial. . .
espilon = 0.0907; alpha = 0.0040
Simulating trial. . .
espilon = 0.0907; alpha = 0.0040
Simulating trial. . .
espilon = 0.0907; alpha = 0.0040
Simulating trial. . .
espilon = 0.0907; alpha = 0.0040
Simulating trial. . .
espilon = 0.0907; alpha = 0.0040
Simulating trial. . .
espilon = 0.0907; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'left')
Agent drove forward instead of right. (rewarded 1.33)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.30)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 2.71)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 1.85)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', 'forward')
Agent properly idled at a red light. (rewarded 2.11)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.74)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'forward')
Agent properly idled at a red light. (rewarded 1.36)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'forward')
Agent drove forward instead of left. (rewarded 0.44)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.41)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.54)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.62)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.55)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 1.12)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.89)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'forward', 'forward')
Agent properly idled at a red light. (rewarded 0.76)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.60)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.16)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.52)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent followed the waypoint left. (rewarded 0.66)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.04)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 602
\-------------------------
Simulating trial. . .
espilon = 0.0904; alpha = 0.0040
Simulating trial. . .
espilon = 0.0904; alpha = 0.0040
Simulating trial. . .
espilon = 0.0904; alpha = 0.0040
Simulating trial. . .
espilon = 0.0904; alpha = 0.0040
Simulating trial. . .
espilon = 0.0904; alpha = 0.0040
Simulating trial. . .
espilon = 0.0904; alpha = 0.0040
Simulating trial. . .
espilon = 0.0904; alpha = 0.0040
Simulating trial. . .
espilon = 0.0904; alpha = 0.0040
Simulating trial. . .
espilon = 0.0904; alpha = 0.0040
Simulating trial. . .
espilon = 0.0904; alpha = 0.0040
Simulating trial. . .
espilon = 0.0904; alpha = 0.0040
Simulating trial. . .
espilon = 0.0904; alpha = 0.0040
Simulating trial. . .
espilon = 0.0904; alpha = 0.0040
Simulating trial. . .
espilon = 0.0904; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.31)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'right')
Agent followed the waypoint right. (rewarded 2.53)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent drove right instead of forward. (rewarded 1.11)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.04)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.57)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.12)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 2.26)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.76)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.19)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.43)
50% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 603
\-------------------------
Simulating trial. . .
espilon = 0.0900; alpha = 0.0040
Simulating trial. . .
espilon = 0.0900; alpha = 0.0040
Simulating trial. . .
espilon = 0.0900; alpha = 0.0040
Simulating trial. . .
espilon = 0.0900; alpha = 0.0040
Simulating trial. . .
espilon = 0.0900; alpha = 0.0040
Simulating trial. . .
espilon = 0.0900; alpha = 0.0040
Simulating trial. . .
espilon = 0.0900; alpha = 0.0040
Simulating trial. . .
espilon = 0.0900; alpha = 0.0040
Simulating trial. . .
espilon = 0.0900; alpha = 0.0040
Simulating trial. . .
espilon = 0.0900; alpha = 0.0040
Simulating trial. . .
espilon = 0.0900; alpha = 0.0040
Simulating trial. . .
espilon = 0.0900; alpha = 0.0040
Simulating trial. . .
espilon = 0.0900; alpha = 0.0040
Simulating trial. . .
espilon = 0.0900; alpha = 0.0040
Simulating trial. . .
espilon = 0.0900; alpha = 0.0040
Simulating trial. . .
espilon = 0.0900; alpha = 0.0040
Simulating trial. . .
espilon = 0.0900; alpha = 0.0040
Simulating trial. . .
espilon = 0.0900; alpha = 0.0040
Simulating trial. . .
espilon = 0.0900; alpha = 0.0040
Simulating trial. . .
espilon = 0.0900; alpha = 0.0040
Simulating trial. . .
espilon = 0.0900; alpha = 0.0040
Simulating trial. . .
espilon = 0.0900; alpha = 0.0040
Simulating trial. . .
espilon = 0.0900; alpha = 0.0040
Simulating trial. . .
espilon = 0.0900; alpha = 0.0040
Simulating trial. . .
espilon = 0.0900; alpha = 0.0040
Simulating trial. . .
espilon = 0.0900; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.44)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.49)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.33)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.98)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 2.44)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.41)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.22)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 1.13)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.27)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'right', None)
Agent drove left instead of forward. (rewarded 1.08)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.01)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'right')
Agent followed the waypoint right. (rewarded 2.43)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.23)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', 'left')
Agent drove left instead of right. (rewarded 1.03)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent followed the waypoint right. (rewarded 2.55)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.12)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.24)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.90)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.22)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.14)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 604
\-------------------------
Simulating trial. . .
espilon = 0.0896; alpha = 0.0040
Simulating trial. . .
espilon = 0.0896; alpha = 0.0040
Simulating trial. . .
espilon = 0.0896; alpha = 0.0040
Simulating trial. . .
espilon = 0.0896; alpha = 0.0040
Simulating trial. . .
espilon = 0.0896; alpha = 0.0040
Simulating trial. . .
espilon = 0.0896; alpha = 0.0040
Simulating trial. . .
espilon = 0.0896; alpha = 0.0040
Simulating trial. . .
espilon = 0.0896; alpha = 0.0040
Simulating trial. . .
espilon = 0.0896; alpha = 0.0040
Simulating trial. . .
espilon = 0.0896; alpha = 0.0040
Simulating trial. . .
espilon = 0.0896; alpha = 0.0040
Simulating trial. . .
espilon = 0.0896; alpha = 0.0040
Simulating trial. . .
espilon = 0.0896; alpha = 0.0040
Simulating trial. . .
espilon = 0.0896; alpha = 0.0040
Simulating trial. . .
espilon = 0.0896; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'right')
Agent properly idled at a red light. (rewarded 1.14)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'right')
Agent drove right instead of left. (rewarded 0.30)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.86)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.33)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.64)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, None)
Agent followed the waypoint right. (rewarded 0.99)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.57)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.84)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 2.80)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.96)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.15)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 0.99)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.45)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 1.14)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.24)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 1.86)
20% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 605
\-------------------------
Simulating trial. . .
espilon = 0.0893; alpha = 0.0040
Simulating trial. . .
espilon = 0.0893; alpha = 0.0040
Simulating trial. . .
espilon = 0.0893; alpha = 0.0040
Simulating trial. . .
espilon = 0.0893; alpha = 0.0040
Simulating trial. . .
espilon = 0.0893; alpha = 0.0040
Simulating trial. . .
espilon = 0.0893; alpha = 0.0040
Simulating trial. . .
espilon = 0.0893; alpha = 0.0040
Simulating trial. . .
espilon = 0.0893; alpha = 0.0040
Simulating trial. . .
espilon = 0.0893; alpha = 0.0040
Simulating trial. . .
espilon = 0.0893; alpha = 0.0040
Simulating trial. . .
espilon = 0.0893; alpha = 0.0040
Simulating trial. . .
espilon = 0.0893; alpha = 0.0040
Simulating trial. . .
espilon = 0.0893; alpha = 0.0040
Simulating trial. . .
espilon = 0.0893; alpha = 0.0040
Simulating trial. . .
espilon = 0.0893; alpha = 0.0040
Simulating trial. . .
espilon = 0.0893; alpha = 0.0040
Simulating trial. . .
espilon = 0.0893; alpha = 0.0040
Simulating trial. . .
espilon = 0.0893; alpha = 0.0040
Simulating trial. . .
espilon = 0.0893; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.39)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.18)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.44)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.64)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'right', None)
Agent drove right instead of forward. (rewarded 1.05)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 1.37)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 1.91)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', 'forward')
Agent properly idled at a red light. (rewarded 2.23)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.36)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 0.86)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.51)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 0.99)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.63)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.71)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 0.64)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.12)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.62)
15% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 606
\-------------------------
Simulating trial. . .
espilon = 0.0889; alpha = 0.0040
Simulating trial. . .
espilon = 0.0889; alpha = 0.0040
Simulating trial. . .
espilon = 0.0889; alpha = 0.0040
Simulating trial. . .
espilon = 0.0889; alpha = 0.0040
Simulating trial. . .
espilon = 0.0889; alpha = 0.0040
Simulating trial. . .
espilon = 0.0889; alpha = 0.0040
Simulating trial. . .
espilon = 0.0889; alpha = 0.0040
Simulating trial. . .
espilon = 0.0889; alpha = 0.0040
Simulating trial. . .
espilon = 0.0889; alpha = 0.0040
Simulating trial. . .
espilon = 0.0889; alpha = 0.0040
Simulating trial. . .
espilon = 0.0889; alpha = 0.0040
Simulating trial. . .
espilon = 0.0889; alpha = 0.0040
Simulating trial. . .
espilon = 0.0889; alpha = 0.0040
Simulating trial. . .
espilon = 0.0889; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'right', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.67)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'right', None)
Agent drove forward instead of left. (rewarded 0.85)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.24)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.32)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.64)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.62)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent followed the waypoint right. (rewarded 1.89)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.17)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent attempted driving left through a red light. (rewarded -9.13)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.29)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.87)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.69)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.79)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.26)
30% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 607
\-------------------------
Simulating trial. . .
espilon = 0.0886; alpha = 0.0040
Simulating trial. . .
espilon = 0.0886; alpha = 0.0040
Simulating trial. . .
espilon = 0.0886; alpha = 0.0040
Simulating trial. . .
espilon = 0.0886; alpha = 0.0040
Simulating trial. . .
espilon = 0.0886; alpha = 0.0040
Simulating trial. . .
espilon = 0.0886; alpha = 0.0040
Simulating trial. . .
espilon = 0.0886; alpha = 0.0040
Simulating trial. . .
espilon = 0.0886; alpha = 0.0040
Simulating trial. . .
espilon = 0.0886; alpha = 0.0040
Simulating trial. . .
espilon = 0.0886; alpha = 0.0040
Simulating trial. . .
espilon = 0.0886; alpha = 0.0040
Simulating trial. . .
espilon = 0.0886; alpha = 0.0040
Simulating trial. . .
espilon = 0.0886; alpha = 0.0040
Simulating trial. . .
espilon = 0.0886; alpha = 0.0040
Simulating trial. . .
espilon = 0.0886; alpha = 0.0040
Simulating trial. . .
espilon = 0.0886; alpha = 0.0040
Simulating trial. . .
espilon = 0.0886; alpha = 0.0040
Simulating trial. . .
espilon = 0.0886; alpha = 0.0040
Simulating trial. . .
espilon = 0.0886; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'right')
Agent followed the waypoint right. (rewarded 2.99)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.51)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'right')
Agent drove right instead of forward. (rewarded 1.87)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 2.53)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.65)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 2.48)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.53)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.83)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.57)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.87)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', 'left')
Agent properly idled at a red light. (rewarded 2.10)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.59)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.04)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.39)
44% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 608
\-------------------------
Simulating trial. . .
espilon = 0.0882; alpha = 0.0040
Simulating trial. . .
espilon = 0.0882; alpha = 0.0040
Simulating trial. . .
espilon = 0.0882; alpha = 0.0040
Simulating trial. . .
espilon = 0.0882; alpha = 0.0040
Simulating trial. . .
espilon = 0.0882; alpha = 0.0040
Simulating trial. . .
espilon = 0.0882; alpha = 0.0040
Simulating trial. . .
espilon = 0.0882; alpha = 0.0040
Simulating trial. . .
espilon = 0.0882; alpha = 0.0040
Simulating trial. . .
espilon = 0.0882; alpha = 0.0040
Simulating trial. . .
espilon = 0.0882; alpha = 0.0040
Simulating trial. . .
espilon = 0.0882; alpha = 0.0040
Simulating trial. . .
espilon = 0.0882; alpha = 0.0040
Simulating trial. . .
espilon = 0.0882; alpha = 0.0040
Simulating trial. . .
espilon = 0.0882; alpha = 0.0040
Simulating trial. . .
espilon = 0.0882; alpha = 0.0040
Simulating trial. . .
espilon = 0.0882; alpha = 0.0040
Simulating trial. . .
espilon = 0.0882; alpha = 0.0040
Simulating trial. . .
espilon = 0.0882; alpha = 0.0040
Simulating trial. . .
espilon = 0.0882; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', 'left')
Agent properly idled at a red light. (rewarded 2.39)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', 'left')
Agent properly idled at a red light. (rewarded 2.70)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', 'left')
Agent properly idled at a red light. (rewarded 2.56)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.92)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.94)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent drove forward instead of left. (rewarded 1.26)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 2.09)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.22)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.52)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.21)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.19)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.19)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'right', None)
Agent followed the waypoint forward. (rewarded 2.66)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.42)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.28)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.29)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.13)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.20)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.61)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.30)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 609
\-------------------------
Simulating trial. . .
espilon = 0.0879; alpha = 0.0040
Simulating trial. . .
espilon = 0.0879; alpha = 0.0040
Simulating trial. . .
espilon = 0.0879; alpha = 0.0040
Simulating trial. . .
espilon = 0.0879; alpha = 0.0040
Simulating trial. . .
espilon = 0.0879; alpha = 0.0040
Simulating trial. . .
espilon = 0.0879; alpha = 0.0040
Simulating trial. . .
espilon = 0.0879; alpha = 0.0040
Simulating trial. . .
espilon = 0.0879; alpha = 0.0040
Simulating trial. . .
espilon = 0.0879; alpha = 0.0040
Simulating trial. . .
espilon = 0.0879; alpha = 0.0040
Simulating trial. . .
espilon = 0.0879; alpha = 0.0040
Simulating trial. . .
espilon = 0.0879; alpha = 0.0040
Simulating trial. . .
espilon = 0.0879; alpha = 0.0040
Simulating trial. . .
espilon = 0.0879; alpha = 0.0040
Simulating trial. . .
espilon = 0.0879; alpha = 0.0040
Simulating trial. . .
espilon = 0.0879; alpha = 0.0040
Simulating trial. . .
espilon = 0.0879; alpha = 0.0040
Simulating trial. . .
espilon = 0.0879; alpha = 0.0040
Simulating trial. . .
espilon = 0.0879; alpha = 0.0040
Simulating trial. . .
espilon = 0.0879; alpha = 0.0040
Simulating trial. . .
espilon = 0.0879; alpha = 0.0040
Simulating trial. . .
espilon = 0.0879; alpha = 0.0040
Simulating trial. . .
espilon = 0.0879; alpha = 0.0040
Simulating trial. . .
espilon = 0.0879; alpha = 0.0040
Simulating trial. . .
espilon = 0.0879; alpha = 0.0040
Simulating trial. . .
espilon = 0.0879; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.85)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent followed the waypoint left. (rewarded 1.54)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.95)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.20)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.72)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.74)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 0.15)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.13)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.00)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.88)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 0.95)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.66)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.23)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 0.91)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.09)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.51)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent followed the waypoint left. (rewarded 1.50)
32% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 610
\-------------------------
Simulating trial. . .
espilon = 0.0875; alpha = 0.0040
Simulating trial. . .
espilon = 0.0875; alpha = 0.0040
Simulating trial. . .
espilon = 0.0875; alpha = 0.0040
Simulating trial. . .
espilon = 0.0875; alpha = 0.0040
Simulating trial. . .
espilon = 0.0875; alpha = 0.0040
Simulating trial. . .
espilon = 0.0875; alpha = 0.0040
Simulating trial. . .
espilon = 0.0875; alpha = 0.0040
Simulating trial. . .
espilon = 0.0875; alpha = 0.0040
Simulating trial. . .
espilon = 0.0875; alpha = 0.0040
Simulating trial. . .
espilon = 0.0875; alpha = 0.0040
Simulating trial. . .
espilon = 0.0875; alpha = 0.0040
Simulating trial. . .
espilon = 0.0875; alpha = 0.0040
Simulating trial. . .
espilon = 0.0875; alpha = 0.0040
Simulating trial. . .
espilon = 0.0875; alpha = 0.0040
Simulating trial. . .
espilon = 0.0875; alpha = 0.0040
Simulating trial. . .
espilon = 0.0875; alpha = 0.0040
Simulating trial. . .
espilon = 0.0875; alpha = 0.0040
Simulating trial. . .
espilon = 0.0875; alpha = 0.0040
Simulating trial. . .
espilon = 0.0875; alpha = 0.0040
Simulating trial. . .
espilon = 0.0875; alpha = 0.0040
Simulating trial. . .
espilon = 0.0875; alpha = 0.0040
Simulating trial. . .
espilon = 0.0875; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent followed the waypoint left. (rewarded 2.96)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', None)
Agent properly idled at a red light. (rewarded 1.12)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.64)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.56)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.70)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'right', 'left')
Agent properly idled at a red light. (rewarded 1.18)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded -0.02)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.05)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.12)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.78)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.85)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 0.90)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.27)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.29)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.02)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.42)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 0.17)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.01)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.56)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 1.99)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.50)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded -0.38)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove forward instead of left. (rewarded 0.25)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 2.11)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.02)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 611
\-------------------------
Simulating trial. . .
espilon = 0.0872; alpha = 0.0040
Simulating trial. . .
espilon = 0.0872; alpha = 0.0040
Simulating trial. . .
espilon = 0.0872; alpha = 0.0040
Simulating trial. . .
espilon = 0.0872; alpha = 0.0040
Simulating trial. . .
espilon = 0.0872; alpha = 0.0040
Simulating trial. . .
espilon = 0.0872; alpha = 0.0040
Simulating trial. . .
espilon = 0.0872; alpha = 0.0040
Simulating trial. . .
espilon = 0.0872; alpha = 0.0040
Simulating trial. . .
espilon = 0.0872; alpha = 0.0040
Simulating trial. . .
espilon = 0.0872; alpha = 0.0040
Simulating trial. . .
espilon = 0.0872; alpha = 0.0040
Simulating trial. . .
espilon = 0.0872; alpha = 0.0040
Simulating trial. . .
espilon = 0.0872; alpha = 0.0040
Simulating trial. . .
espilon = 0.0872; alpha = 0.0040
Simulating trial. . .
espilon = 0.0872; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', 'left')
Agent followed the waypoint forward. (rewarded 1.78)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent drove right instead of forward. (rewarded 1.27)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 0.98)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent followed the waypoint right. (rewarded 1.07)
80% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 612
\-------------------------
Simulating trial. . .
espilon = 0.0868; alpha = 0.0040
Simulating trial. . .
espilon = 0.0868; alpha = 0.0040
Simulating trial. . .
espilon = 0.0868; alpha = 0.0040
Simulating trial. . .
espilon = 0.0868; alpha = 0.0040
Simulating trial. . .
espilon = 0.0868; alpha = 0.0040
Simulating trial. . .
espilon = 0.0868; alpha = 0.0040
Simulating trial. . .
espilon = 0.0868; alpha = 0.0040
Simulating trial. . .
espilon = 0.0868; alpha = 0.0040
Simulating trial. . .
espilon = 0.0868; alpha = 0.0040
Simulating trial. . .
espilon = 0.0868; alpha = 0.0040
Simulating trial. . .
espilon = 0.0868; alpha = 0.0040
Simulating trial. . .
espilon = 0.0868; alpha = 0.0040
Simulating trial. . .
espilon = 0.0868; alpha = 0.0040
Simulating trial. . .
espilon = 0.0868; alpha = 0.0040
Simulating trial. . .
espilon = 0.0868; alpha = 0.0040
Simulating trial. . .
espilon = 0.0868; alpha = 0.0040
Simulating trial. . .
espilon = 0.0868; alpha = 0.0040
Simulating trial. . .
espilon = 0.0868; alpha = 0.0040
Simulating trial. . .
espilon = 0.0868; alpha = 0.0040
Simulating trial. . .
espilon = 0.0868; alpha = 0.0040
Simulating trial. . .
espilon = 0.0868; alpha = 0.0040
Simulating trial. . .
espilon = 0.0868; alpha = 0.0040
Simulating trial. . .
espilon = 0.0868; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'forward')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.33)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 1.46)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.00)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.07)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.76)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent drove right instead of left. (rewarded 1.76)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.17)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.04)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.04)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent drove right instead of forward. (rewarded 1.35)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.97)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.53)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 0.99)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'right', 'left')
Agent drove forward instead of left. (rewarded 0.95)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.24)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 0.70)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.81)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.22)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'right')
Agent drove right instead of forward. (rewarded 0.47)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.59)
20% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 613
\-------------------------
Simulating trial. . .
espilon = 0.0865; alpha = 0.0040
Simulating trial. . .
espilon = 0.0865; alpha = 0.0040
Simulating trial. . .
espilon = 0.0865; alpha = 0.0040
Simulating trial. . .
espilon = 0.0865; alpha = 0.0040
Simulating trial. . .
espilon = 0.0865; alpha = 0.0040
Simulating trial. . .
espilon = 0.0865; alpha = 0.0040
Simulating trial. . .
espilon = 0.0865; alpha = 0.0040
Simulating trial. . .
espilon = 0.0865; alpha = 0.0040
Simulating trial. . .
espilon = 0.0865; alpha = 0.0040
Simulating trial. . .
espilon = 0.0865; alpha = 0.0040
Simulating trial. . .
espilon = 0.0865; alpha = 0.0040
Simulating trial. . .
espilon = 0.0865; alpha = 0.0040
Simulating trial. . .
espilon = 0.0865; alpha = 0.0040
Simulating trial. . .
espilon = 0.0865; alpha = 0.0040
Simulating trial. . .
espilon = 0.0865; alpha = 0.0040
Simulating trial. . .
espilon = 0.0865; alpha = 0.0040
Simulating trial. . .
espilon = 0.0865; alpha = 0.0040
Simulating trial. . .
espilon = 0.0865; alpha = 0.0040
Simulating trial. . .
espilon = 0.0865; alpha = 0.0040
Simulating trial. . .
espilon = 0.0865; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.75)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.81)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.80)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.66)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.11)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.85)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', 'right')
Agent drove right instead of forward. (rewarded 0.67)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.95)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.66)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.56)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.54)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', 'left')
Agent followed the waypoint left. (rewarded 0.96)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 1.65)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.81)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.88)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.34)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.11)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 1.91)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 1.48)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.02)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.67)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.88)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.81)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', 'right')
Agent followed the waypoint right. (rewarded 0.66)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.15)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 614
\-------------------------
Simulating trial. . .
espilon = 0.0861; alpha = 0.0040
Simulating trial. . .
espilon = 0.0861; alpha = 0.0040
Simulating trial. . .
espilon = 0.0861; alpha = 0.0040
Simulating trial. . .
espilon = 0.0861; alpha = 0.0040
Simulating trial. . .
espilon = 0.0861; alpha = 0.0040
Simulating trial. . .
espilon = 0.0861; alpha = 0.0040
Simulating trial. . .
espilon = 0.0861; alpha = 0.0040
Simulating trial. . .
espilon = 0.0861; alpha = 0.0040
Simulating trial. . .
espilon = 0.0861; alpha = 0.0040
Simulating trial. . .
espilon = 0.0861; alpha = 0.0040
Simulating trial. . .
espilon = 0.0861; alpha = 0.0040
Simulating trial. . .
espilon = 0.0861; alpha = 0.0040
Simulating trial. . .
espilon = 0.0861; alpha = 0.0040
Simulating trial. . .
espilon = 0.0861; alpha = 0.0040
Simulating trial. . .
espilon = 0.0861; alpha = 0.0040
Simulating trial. . .
espilon = 0.0861; alpha = 0.0040
Simulating trial. . .
espilon = 0.0861; alpha = 0.0040
Simulating trial. . .
espilon = 0.0861; alpha = 0.0040
Simulating trial. . .
espilon = 0.0861; alpha = 0.0040
Simulating trial. . .
espilon = 0.0861; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.29)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.69)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.87)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.44)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.39)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 1.83)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.07)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent drove right instead of left. (rewarded 0.16)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.22)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.02)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.66)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.50)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 0.94)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.54)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'right')
Agent properly idled at a red light. (rewarded 1.62)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 0.70)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'forward')
Agent drove right instead of left. (rewarded 0.25)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.96)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent followed the waypoint left. (rewarded 1.52)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.44)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 615
\-------------------------
Simulating trial. . .
espilon = 0.0858; alpha = 0.0040
Simulating trial. . .
espilon = 0.0858; alpha = 0.0040
Simulating trial. . .
espilon = 0.0858; alpha = 0.0040
Simulating trial. . .
espilon = 0.0858; alpha = 0.0040
Simulating trial. . .
espilon = 0.0858; alpha = 0.0040
Simulating trial. . .
espilon = 0.0858; alpha = 0.0040
Simulating trial. . .
espilon = 0.0858; alpha = 0.0040
Simulating trial. . .
espilon = 0.0858; alpha = 0.0040
Simulating trial. . .
espilon = 0.0858; alpha = 0.0040
Simulating trial. . .
espilon = 0.0858; alpha = 0.0040
Simulating trial. . .
espilon = 0.0858; alpha = 0.0040
Simulating trial. . .
espilon = 0.0858; alpha = 0.0040
Simulating trial. . .
espilon = 0.0858; alpha = 0.0040
Simulating trial. . .
espilon = 0.0858; alpha = 0.0040
Simulating trial. . .
espilon = 0.0858; alpha = 0.0040
Simulating trial. . .
espilon = 0.0858; alpha = 0.0040
Simulating trial. . .
espilon = 0.0858; alpha = 0.0040
Simulating trial. . .
espilon = 0.0858; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'left', 'left')
Agent drove forward instead of left. (rewarded 0.63)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.70)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.53)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.78)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 1.74)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.60)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 2.20)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'right')
Agent drove right instead of left. (rewarded 0.68)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.58)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.42)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.54)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.30)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'forward', None)
Agent followed the waypoint left. (rewarded 2.79)
63% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.48)
60% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.82)
57% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.96)
54% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'right')
Agent drove forward instead of left. (rewarded 1.34)
51% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.72)
49% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded 1.59)
46% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.22)
43% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 0.76)
40% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.67)
37% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 616
\-------------------------
Simulating trial. . .
espilon = 0.0854; alpha = 0.0040
Simulating trial. . .
espilon = 0.0854; alpha = 0.0040
Simulating trial. . .
espilon = 0.0854; alpha = 0.0040
Simulating trial. . .
espilon = 0.0854; alpha = 0.0040
Simulating trial. . .
espilon = 0.0854; alpha = 0.0040
Simulating trial. . .
espilon = 0.0854; alpha = 0.0040
Simulating trial. . .
espilon = 0.0854; alpha = 0.0040
Simulating trial. . .
espilon = 0.0854; alpha = 0.0040
Simulating trial. . .
espilon = 0.0854; alpha = 0.0040
Simulating trial. . .
espilon = 0.0854; alpha = 0.0040
Simulating trial. . .
espilon = 0.0854; alpha = 0.0040
Simulating trial. . .
espilon = 0.0854; alpha = 0.0040
Simulating trial. . .
espilon = 0.0854; alpha = 0.0040
Simulating trial. . .
espilon = 0.0854; alpha = 0.0040
Simulating trial. . .
espilon = 0.0854; alpha = 0.0040
Simulating trial. . .
espilon = 0.0854; alpha = 0.0040
Simulating trial. . .
espilon = 0.0854; alpha = 0.0040
Simulating trial. . .
espilon = 0.0854; alpha = 0.0040
Simulating trial. . .
espilon = 0.0854; alpha = 0.0040
Simulating trial. . .
espilon = 0.0854; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 1.94)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.16)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded 0.06)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent followed the waypoint right. (rewarded 2.79)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'left')
Agent drove forward instead of right. (rewarded 1.00)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.69)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.30)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.29)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', None)
Agent followed the waypoint forward. (rewarded 0.90)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 0.91)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.98)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'right', 'left')
Agent followed the waypoint forward. (rewarded 0.93)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.70)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 0.62)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.90)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'right')
Agent drove right instead of forward. (rewarded -0.46)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.33)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 0.52)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.96)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.96)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 617
\-------------------------
Simulating trial. . .
espilon = 0.0851; alpha = 0.0040
Simulating trial. . .
espilon = 0.0851; alpha = 0.0040
Simulating trial. . .
espilon = 0.0851; alpha = 0.0040
Simulating trial. . .
espilon = 0.0851; alpha = 0.0040
Simulating trial. . .
espilon = 0.0851; alpha = 0.0040
Simulating trial. . .
espilon = 0.0851; alpha = 0.0040
Simulating trial. . .
espilon = 0.0851; alpha = 0.0040
Simulating trial. . .
espilon = 0.0851; alpha = 0.0040
Simulating trial. . .
espilon = 0.0851; alpha = 0.0040
Simulating trial. . .
espilon = 0.0851; alpha = 0.0040
Simulating trial. . .
espilon = 0.0851; alpha = 0.0040
Simulating trial. . .
espilon = 0.0851; alpha = 0.0040
Simulating trial. . .
espilon = 0.0851; alpha = 0.0040
Simulating trial. . .
espilon = 0.0851; alpha = 0.0040
Simulating trial. . .
espilon = 0.0851; alpha = 0.0040
Simulating trial. . .
espilon = 0.0851; alpha = 0.0040
Simulating trial. . .
espilon = 0.0851; alpha = 0.0040
Simulating trial. . .
espilon = 0.0851; alpha = 0.0040
Simulating trial. . .
espilon = 0.0851; alpha = 0.0040
Simulating trial. . .
espilon = 0.0851; alpha = 0.0040
Simulating trial. . .
espilon = 0.0851; alpha = 0.0040
Simulating trial. . .
espilon = 0.0851; alpha = 0.0040
Simulating trial. . .
espilon = 0.0851; alpha = 0.0040
Simulating trial. . .
espilon = 0.0851; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.22)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'right', 'forward')
Agent followed the waypoint right. (rewarded 2.95)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.04)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.71)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.24)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 2.57)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.93)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.85)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.52)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'left', 'left')
Agent followed the waypoint forward. (rewarded 1.38)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent followed the waypoint forward. (rewarded 2.71)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 0.36)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.20)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.46)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 0.84)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 2.08)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.94)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'forward')
Agent followed the waypoint forward. (rewarded 1.93)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 618
\-------------------------
Simulating trial. . .
espilon = 0.0848; alpha = 0.0040
Simulating trial. . .
espilon = 0.0848; alpha = 0.0040
Simulating trial. . .
espilon = 0.0848; alpha = 0.0040
Simulating trial. . .
espilon = 0.0848; alpha = 0.0040
Simulating trial. . .
espilon = 0.0848; alpha = 0.0040
Simulating trial. . .
espilon = 0.0848; alpha = 0.0040
Simulating trial. . .
espilon = 0.0848; alpha = 0.0040
Simulating trial. . .
espilon = 0.0848; alpha = 0.0040
Simulating trial. . .
espilon = 0.0848; alpha = 0.0040
Simulating trial. . .
espilon = 0.0848; alpha = 0.0040
Simulating trial. . .
espilon = 0.0848; alpha = 0.0040
Simulating trial. . .
espilon = 0.0848; alpha = 0.0040
Simulating trial. . .
espilon = 0.0848; alpha = 0.0040
Simulating trial. . .
espilon = 0.0848; alpha = 0.0040
Simulating trial. . .
espilon = 0.0848; alpha = 0.0040
Simulating trial. . .
espilon = 0.0848; alpha = 0.0040
Simulating trial. . .
espilon = 0.0848; alpha = 0.0040
Simulating trial. . .
espilon = 0.0848; alpha = 0.0040
Simulating trial. . .
espilon = 0.0848; alpha = 0.0040
Simulating trial. . .
espilon = 0.0848; alpha = 0.0040
Simulating trial. . .
espilon = 0.0848; alpha = 0.0040
Simulating trial. . .
espilon = 0.0848; alpha = 0.0040
Simulating trial. . .
espilon = 0.0848; alpha = 0.0040
Simulating trial. . .
espilon = 0.0848; alpha = 0.0040
Simulating trial. . .
espilon = 0.0848; alpha = 0.0040
Simulating trial. . .
espilon = 0.0848; alpha = 0.0040
Simulating trial. . .
espilon = 0.0848; alpha = 0.0040
Simulating trial. . .
espilon = 0.0848; alpha = 0.0040
Simulating trial. . .
espilon = 0.0848; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', 'right')
Agent followed the waypoint right. (rewarded 2.36)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.43)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.18)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.18)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.06)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.49)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'right')
Agent drove forward instead of right. (rewarded 0.01)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.95)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.08)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.57)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.16)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent followed the waypoint right. (rewarded 2.26)
52% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 619
\-------------------------
Simulating trial. . .
espilon = 0.0844; alpha = 0.0040
Simulating trial. . .
espilon = 0.0844; alpha = 0.0040
Simulating trial. . .
espilon = 0.0844; alpha = 0.0040
Simulating trial. . .
espilon = 0.0844; alpha = 0.0040
Simulating trial. . .
espilon = 0.0844; alpha = 0.0040
Simulating trial. . .
espilon = 0.0844; alpha = 0.0040
Simulating trial. . .
espilon = 0.0844; alpha = 0.0040
Simulating trial. . .
espilon = 0.0844; alpha = 0.0040
Simulating trial. . .
espilon = 0.0844; alpha = 0.0040
Simulating trial. . .
espilon = 0.0844; alpha = 0.0040
Simulating trial. . .
espilon = 0.0844; alpha = 0.0040
Simulating trial. . .
espilon = 0.0844; alpha = 0.0040
Simulating trial. . .
espilon = 0.0844; alpha = 0.0040
Simulating trial. . .
espilon = 0.0844; alpha = 0.0040
Simulating trial. . .
espilon = 0.0844; alpha = 0.0040
Simulating trial. . .
espilon = 0.0844; alpha = 0.0040
Simulating trial. . .
espilon = 0.0844; alpha = 0.0040
Simulating trial. . .
espilon = 0.0844; alpha = 0.0040
Simulating trial. . .
espilon = 0.0844; alpha = 0.0040
Simulating trial. . .
espilon = 0.0844; alpha = 0.0040
Simulating trial. . .
espilon = 0.0844; alpha = 0.0040
Simulating trial. . .
espilon = 0.0844; alpha = 0.0040
Simulating trial. . .
espilon = 0.0844; alpha = 0.0040
Simulating trial. . .
espilon = 0.0844; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.50)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.75)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 2.27)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.01)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 1.64)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'right', None)
Agent drove forward instead of left. (rewarded 1.08)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.60)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.38)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.27)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.14)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.22)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.49)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 0.82)
48% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 620
\-------------------------
Simulating trial. . .
espilon = 0.0841; alpha = 0.0040
Simulating trial. . .
espilon = 0.0841; alpha = 0.0040
Simulating trial. . .
espilon = 0.0841; alpha = 0.0040
Simulating trial. . .
espilon = 0.0841; alpha = 0.0040
Simulating trial. . .
espilon = 0.0841; alpha = 0.0040
Simulating trial. . .
espilon = 0.0841; alpha = 0.0040
Simulating trial. . .
espilon = 0.0841; alpha = 0.0040
Simulating trial. . .
espilon = 0.0841; alpha = 0.0040
Simulating trial. . .
espilon = 0.0841; alpha = 0.0040
Simulating trial. . .
espilon = 0.0841; alpha = 0.0040
Simulating trial. . .
espilon = 0.0841; alpha = 0.0040
Simulating trial. . .
espilon = 0.0841; alpha = 0.0040
Simulating trial. . .
espilon = 0.0841; alpha = 0.0040
Simulating trial. . .
espilon = 0.0841; alpha = 0.0040
Simulating trial. . .
espilon = 0.0841; alpha = 0.0040
Simulating trial. . .
espilon = 0.0841; alpha = 0.0040
Simulating trial. . .
espilon = 0.0841; alpha = 0.0040
Simulating trial. . .
espilon = 0.0841; alpha = 0.0040
Simulating trial. . .
espilon = 0.0841; alpha = 0.0040
Simulating trial. . .
espilon = 0.0841; alpha = 0.0040
Simulating trial. . .
espilon = 0.0841; alpha = 0.0040
Simulating trial. . .
espilon = 0.0841; alpha = 0.0040
Simulating trial. . .
espilon = 0.0841; alpha = 0.0040
Simulating trial. . .
espilon = 0.0841; alpha = 0.0040
Simulating trial. . .
espilon = 0.0841; alpha = 0.0040
Simulating trial. . .
espilon = 0.0841; alpha = 0.0040
Simulating trial. . .
espilon = 0.0841; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.73)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.88)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.23)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.87)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.04)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.49)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.75)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 2.27)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 0.97)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.69)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.65)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.43)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 621
\-------------------------
Simulating trial. . .
espilon = 0.0837; alpha = 0.0040
Simulating trial. . .
espilon = 0.0837; alpha = 0.0040
Simulating trial. . .
espilon = 0.0837; alpha = 0.0040
Simulating trial. . .
espilon = 0.0837; alpha = 0.0040
Simulating trial. . .
espilon = 0.0837; alpha = 0.0040
Simulating trial. . .
espilon = 0.0837; alpha = 0.0040
Simulating trial. . .
espilon = 0.0837; alpha = 0.0040
Simulating trial. . .
espilon = 0.0837; alpha = 0.0040
Simulating trial. . .
espilon = 0.0837; alpha = 0.0040
Simulating trial. . .
espilon = 0.0837; alpha = 0.0040
Simulating trial. . .
espilon = 0.0837; alpha = 0.0040
Simulating trial. . .
espilon = 0.0837; alpha = 0.0040
Simulating trial. . .
espilon = 0.0837; alpha = 0.0040
Simulating trial. . .
espilon = 0.0837; alpha = 0.0040
Simulating trial. . .
espilon = 0.0837; alpha = 0.0040
Simulating trial. . .
espilon = 0.0837; alpha = 0.0040
Simulating trial. . .
espilon = 0.0837; alpha = 0.0040
Simulating trial. . .
espilon = 0.0837; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.46)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'right')
Agent properly idled at a red light. (rewarded 1.78)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.71)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.34)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.98)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent drove right instead of left. (rewarded 0.24)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove forward instead of right. (rewarded 0.81)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 1.56)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.35)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.25)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.52)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.39)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.16)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.08)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded -0.04)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 1.40)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.35)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'right', None)
Agent properly idled at a red light. (rewarded 2.04)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.59)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.90)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 622
\-------------------------
Simulating trial. . .
espilon = 0.0834; alpha = 0.0040
Simulating trial. . .
espilon = 0.0834; alpha = 0.0040
Simulating trial. . .
espilon = 0.0834; alpha = 0.0040
Simulating trial. . .
espilon = 0.0834; alpha = 0.0040
Simulating trial. . .
espilon = 0.0834; alpha = 0.0040
Simulating trial. . .
espilon = 0.0834; alpha = 0.0040
Simulating trial. . .
espilon = 0.0834; alpha = 0.0040
Simulating trial. . .
espilon = 0.0834; alpha = 0.0040
Simulating trial. . .
espilon = 0.0834; alpha = 0.0040
Simulating trial. . .
espilon = 0.0834; alpha = 0.0040
Simulating trial. . .
espilon = 0.0834; alpha = 0.0040
Simulating trial. . .
espilon = 0.0834; alpha = 0.0040
Simulating trial. . .
espilon = 0.0834; alpha = 0.0040
Simulating trial. . .
espilon = 0.0834; alpha = 0.0040
Simulating trial. . .
espilon = 0.0834; alpha = 0.0040
Simulating trial. . .
espilon = 0.0834; alpha = 0.0040
Simulating trial. . .
espilon = 0.0834; alpha = 0.0040
Simulating trial. . .
espilon = 0.0834; alpha = 0.0040
Simulating trial. . .
espilon = 0.0834; alpha = 0.0040
Simulating trial. . .
espilon = 0.0834; alpha = 0.0040
Simulating trial. . .
espilon = 0.0834; alpha = 0.0040
Simulating trial. . .
espilon = 0.0834; alpha = 0.0040
Simulating trial. . .
espilon = 0.0834; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent followed the waypoint right. (rewarded 1.40)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.22)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.98)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 2.46)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.23)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.10)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.30)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'right', 'left')
Agent drove right instead of forward. (rewarded 1.09)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'right', None)
Agent drove forward instead of left. (rewarded 1.82)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.55)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.61)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'forward', None)
Agent drove right instead of left. (rewarded 1.61)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'right', None)
Agent drove left instead of right. (rewarded 1.55)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 1.78)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'right')
Agent followed the waypoint forward. (rewarded 2.50)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.57)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 0.98)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.23)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.49)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 1.25)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 0.65)
30% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 623
\-------------------------
Simulating trial. . .
espilon = 0.0831; alpha = 0.0040
Simulating trial. . .
espilon = 0.0831; alpha = 0.0040
Simulating trial. . .
espilon = 0.0831; alpha = 0.0040
Simulating trial. . .
espilon = 0.0831; alpha = 0.0040
Simulating trial. . .
espilon = 0.0831; alpha = 0.0040
Simulating trial. . .
espilon = 0.0831; alpha = 0.0040
Simulating trial. . .
espilon = 0.0831; alpha = 0.0040
Simulating trial. . .
espilon = 0.0831; alpha = 0.0040
Simulating trial. . .
espilon = 0.0831; alpha = 0.0040
Simulating trial. . .
espilon = 0.0831; alpha = 0.0040
Simulating trial. . .
espilon = 0.0831; alpha = 0.0040
Simulating trial. . .
espilon = 0.0831; alpha = 0.0040
Simulating trial. . .
espilon = 0.0831; alpha = 0.0040
Simulating trial. . .
espilon = 0.0831; alpha = 0.0040
Simulating trial. . .
espilon = 0.0831; alpha = 0.0040
Simulating trial. . .
espilon = 0.0831; alpha = 0.0040
Simulating trial. . .
espilon = 0.0831; alpha = 0.0040
Simulating trial. . .
espilon = 0.0831; alpha = 0.0040
Simulating trial. . .
espilon = 0.0831; alpha = 0.0040
Simulating trial. . .
espilon = 0.0831; alpha = 0.0040
Simulating trial. . .
espilon = 0.0831; alpha = 0.0040
Simulating trial. . .
espilon = 0.0831; alpha = 0.0040
Simulating trial. . .
espilon = 0.0831; alpha = 0.0040
Simulating trial. . .
espilon = 0.0831; alpha = 0.0040
Simulating trial. . .
espilon = 0.0831; alpha = 0.0040
Simulating trial. . .
espilon = 0.0831; alpha = 0.0040
Simulating trial. . .
espilon = 0.0831; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'right')
Agent followed the waypoint right. (rewarded 1.54)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.50)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 0.58)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.39)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.80)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.40)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded -0.05)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.17)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.19)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.41)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.28)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 0.96)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.96)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'forward', None)
Agent followed the waypoint left. (rewarded 1.43)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 0.84)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.06)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.58)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.57)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 1.31)
24% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 624
\-------------------------
Simulating trial. . .
espilon = 0.0827; alpha = 0.0040
Simulating trial. . .
espilon = 0.0827; alpha = 0.0040
Simulating trial. . .
espilon = 0.0827; alpha = 0.0040
Simulating trial. . .
espilon = 0.0827; alpha = 0.0040
Simulating trial. . .
espilon = 0.0827; alpha = 0.0040
Simulating trial. . .
espilon = 0.0827; alpha = 0.0040
Simulating trial. . .
espilon = 0.0827; alpha = 0.0040
Simulating trial. . .
espilon = 0.0827; alpha = 0.0040
Simulating trial. . .
espilon = 0.0827; alpha = 0.0040
Simulating trial. . .
espilon = 0.0827; alpha = 0.0040
Simulating trial. . .
espilon = 0.0827; alpha = 0.0040
Simulating trial. . .
espilon = 0.0827; alpha = 0.0040
Simulating trial. . .
espilon = 0.0827; alpha = 0.0040
Simulating trial. . .
espilon = 0.0827; alpha = 0.0040
Simulating trial. . .
espilon = 0.0827; alpha = 0.0040
Simulating trial. . .
espilon = 0.0827; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.48)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'right', None)
Agent properly idled at a red light. (rewarded 2.22)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.48)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.77)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 0.22)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'forward')
Agent drove right instead of left. (rewarded 1.51)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.51)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', 'forward')
Agent drove forward instead of right. (rewarded 0.68)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent followed the waypoint right. (rewarded 2.30)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.13)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.88)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.48)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 625
\-------------------------
Simulating trial. . .
espilon = 0.0824; alpha = 0.0040
Simulating trial. . .
espilon = 0.0824; alpha = 0.0040
Simulating trial. . .
espilon = 0.0824; alpha = 0.0040
Simulating trial. . .
espilon = 0.0824; alpha = 0.0040
Simulating trial. . .
espilon = 0.0824; alpha = 0.0040
Simulating trial. . .
espilon = 0.0824; alpha = 0.0040
Simulating trial. . .
espilon = 0.0824; alpha = 0.0040
Simulating trial. . .
espilon = 0.0824; alpha = 0.0040
Simulating trial. . .
espilon = 0.0824; alpha = 0.0040
Simulating trial. . .
espilon = 0.0824; alpha = 0.0040
Simulating trial. . .
espilon = 0.0824; alpha = 0.0040
Simulating trial. . .
espilon = 0.0824; alpha = 0.0040
Simulating trial. . .
espilon = 0.0824; alpha = 0.0040
Simulating trial. . .
espilon = 0.0824; alpha = 0.0040
Simulating trial. . .
espilon = 0.0824; alpha = 0.0040
Simulating trial. . .
espilon = 0.0824; alpha = 0.0040
Simulating trial. . .
espilon = 0.0824; alpha = 0.0040
Simulating trial. . .
espilon = 0.0824; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.50)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.22)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.65)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.45)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.43)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'left')
Agent drove forward instead of right. (rewarded -0.05)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent followed the waypoint right. (rewarded 1.43)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', 'left')
Agent properly idled at a red light. (rewarded 2.57)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent followed the waypoint right. (rewarded 2.01)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.50)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.16)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.46)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', None)
Agent followed the waypoint forward. (rewarded 2.32)
57% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 626
\-------------------------
Simulating trial. . .
espilon = 0.0821; alpha = 0.0040
Simulating trial. . .
espilon = 0.0821; alpha = 0.0040
Simulating trial. . .
espilon = 0.0821; alpha = 0.0040
Simulating trial. . .
espilon = 0.0821; alpha = 0.0040
Simulating trial. . .
espilon = 0.0821; alpha = 0.0040
Simulating trial. . .
espilon = 0.0821; alpha = 0.0040
Simulating trial. . .
espilon = 0.0821; alpha = 0.0040
Simulating trial. . .
espilon = 0.0821; alpha = 0.0040
Simulating trial. . .
espilon = 0.0821; alpha = 0.0040
Simulating trial. . .
espilon = 0.0821; alpha = 0.0040
Simulating trial. . .
espilon = 0.0821; alpha = 0.0040
Simulating trial. . .
espilon = 0.0821; alpha = 0.0040
Simulating trial. . .
espilon = 0.0821; alpha = 0.0040
Simulating trial. . .
espilon = 0.0821; alpha = 0.0040
Simulating trial. . .
espilon = 0.0821; alpha = 0.0040
Simulating trial. . .
espilon = 0.0821; alpha = 0.0040
Simulating trial. . .
espilon = 0.0821; alpha = 0.0040
Simulating trial. . .
espilon = 0.0821; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.83)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.98)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.34)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.47)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.41)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.42)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.05)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.59)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 1.69)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.21)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'left')
Agent followed the waypoint forward. (rewarded 0.96)
56% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 627
\-------------------------
Simulating trial. . .
espilon = 0.0818; alpha = 0.0040
Simulating trial. . .
espilon = 0.0818; alpha = 0.0040
Simulating trial. . .
espilon = 0.0818; alpha = 0.0040
Simulating trial. . .
espilon = 0.0818; alpha = 0.0040
Simulating trial. . .
espilon = 0.0818; alpha = 0.0040
Simulating trial. . .
espilon = 0.0818; alpha = 0.0040
Simulating trial. . .
espilon = 0.0818; alpha = 0.0040
Simulating trial. . .
espilon = 0.0818; alpha = 0.0040
Simulating trial. . .
espilon = 0.0818; alpha = 0.0040
Simulating trial. . .
espilon = 0.0818; alpha = 0.0040
Simulating trial. . .
espilon = 0.0818; alpha = 0.0040
Simulating trial. . .
espilon = 0.0818; alpha = 0.0040
Simulating trial. . .
espilon = 0.0818; alpha = 0.0040
Simulating trial. . .
espilon = 0.0818; alpha = 0.0040
Simulating trial. . .
espilon = 0.0818; alpha = 0.0040
Simulating trial. . .
espilon = 0.0818; alpha = 0.0040
Simulating trial. . .
espilon = 0.0818; alpha = 0.0040
Simulating trial. . .
espilon = 0.0818; alpha = 0.0040
Simulating trial. . .
espilon = 0.0818; alpha = 0.0040
Simulating trial. . .
espilon = 0.0818; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.79)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent drove right instead of forward. (rewarded 1.68)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 1.39)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.20)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.86)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.08)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.19)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.72)
68% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 628
\-------------------------
Simulating trial. . .
espilon = 0.0814; alpha = 0.0040
Simulating trial. . .
espilon = 0.0814; alpha = 0.0040
Simulating trial. . .
espilon = 0.0814; alpha = 0.0040
Simulating trial. . .
espilon = 0.0814; alpha = 0.0040
Simulating trial. . .
espilon = 0.0814; alpha = 0.0040
Simulating trial. . .
espilon = 0.0814; alpha = 0.0040
Simulating trial. . .
espilon = 0.0814; alpha = 0.0040
Simulating trial. . .
espilon = 0.0814; alpha = 0.0040
Simulating trial. . .
espilon = 0.0814; alpha = 0.0040
Simulating trial. . .
espilon = 0.0814; alpha = 0.0040
Simulating trial. . .
espilon = 0.0814; alpha = 0.0040
Simulating trial. . .
espilon = 0.0814; alpha = 0.0040
Simulating trial. . .
espilon = 0.0814; alpha = 0.0040
Simulating trial. . .
espilon = 0.0814; alpha = 0.0040
Simulating trial. . .
espilon = 0.0814; alpha = 0.0040
Simulating trial. . .
espilon = 0.0814; alpha = 0.0040
Simulating trial. . .
espilon = 0.0814; alpha = 0.0040
Simulating trial. . .
espilon = 0.0814; alpha = 0.0040
Simulating trial. . .
espilon = 0.0814; alpha = 0.0040
Simulating trial. . .
espilon = 0.0814; alpha = 0.0040
Simulating trial. . .
espilon = 0.0814; alpha = 0.0040
Simulating trial. . .
espilon = 0.0814; alpha = 0.0040
Simulating trial. . .
espilon = 0.0814; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'left')
Agent followed the waypoint right. (rewarded 2.42)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.36)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.54)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.50)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.35)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.89)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'right')
Agent followed the waypoint forward. (rewarded 2.15)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.38)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.19)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 1.11)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 0.98)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.18)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 629
\-------------------------
Simulating trial. . .
espilon = 0.0811; alpha = 0.0040
Simulating trial. . .
espilon = 0.0811; alpha = 0.0040
Simulating trial. . .
espilon = 0.0811; alpha = 0.0040
Simulating trial. . .
espilon = 0.0811; alpha = 0.0040
Simulating trial. . .
espilon = 0.0811; alpha = 0.0040
Simulating trial. . .
espilon = 0.0811; alpha = 0.0040
Simulating trial. . .
espilon = 0.0811; alpha = 0.0040
Simulating trial. . .
espilon = 0.0811; alpha = 0.0040
Simulating trial. . .
espilon = 0.0811; alpha = 0.0040
Simulating trial. . .
espilon = 0.0811; alpha = 0.0040
Simulating trial. . .
espilon = 0.0811; alpha = 0.0040
Simulating trial. . .
espilon = 0.0811; alpha = 0.0040
Simulating trial. . .
espilon = 0.0811; alpha = 0.0040
Simulating trial. . .
espilon = 0.0811; alpha = 0.0040
Simulating trial. . .
espilon = 0.0811; alpha = 0.0040
Simulating trial. . .
espilon = 0.0811; alpha = 0.0040
Simulating trial. . .
espilon = 0.0811; alpha = 0.0040
Simulating trial. . .
espilon = 0.0811; alpha = 0.0040
Simulating trial. . .
espilon = 0.0811; alpha = 0.0040
Simulating trial. . .
espilon = 0.0811; alpha = 0.0040
Simulating trial. . .
espilon = 0.0811; alpha = 0.0040
Simulating trial. . .
espilon = 0.0811; alpha = 0.0040
Simulating trial. . .
espilon = 0.0811; alpha = 0.0040
Simulating trial. . .
espilon = 0.0811; alpha = 0.0040
Simulating trial. . .
espilon = 0.0811; alpha = 0.0040
Simulating trial. . .
espilon = 0.0811; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.01)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.35)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, 'forward')
Agent properly idled at a red light. (rewarded 2.51)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.46)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'right')
Agent followed the waypoint forward. (rewarded 1.69)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.45)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.14)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'forward')
Agent drove right instead of left. (rewarded 0.19)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'right')
Agent properly idled at a red light. (rewarded 1.67)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.07)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 2.42)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.69)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.21)
48% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 630
\-------------------------
Simulating trial. . .
espilon = 0.0808; alpha = 0.0040
Simulating trial. . .
espilon = 0.0808; alpha = 0.0040
Simulating trial. . .
espilon = 0.0808; alpha = 0.0040
Simulating trial. . .
espilon = 0.0808; alpha = 0.0040
Simulating trial. . .
espilon = 0.0808; alpha = 0.0040
Simulating trial. . .
espilon = 0.0808; alpha = 0.0040
Simulating trial. . .
espilon = 0.0808; alpha = 0.0040
Simulating trial. . .
espilon = 0.0808; alpha = 0.0040
Simulating trial. . .
espilon = 0.0808; alpha = 0.0040
Simulating trial. . .
espilon = 0.0808; alpha = 0.0040
Simulating trial. . .
espilon = 0.0808; alpha = 0.0040
Simulating trial. . .
espilon = 0.0808; alpha = 0.0040
Simulating trial. . .
espilon = 0.0808; alpha = 0.0040
Simulating trial. . .
espilon = 0.0808; alpha = 0.0040
Simulating trial. . .
espilon = 0.0808; alpha = 0.0040
Simulating trial. . .
espilon = 0.0808; alpha = 0.0040
Simulating trial. . .
espilon = 0.0808; alpha = 0.0040
Simulating trial. . .
espilon = 0.0808; alpha = 0.0040
Simulating trial. . .
espilon = 0.0808; alpha = 0.0040
Simulating trial. . .
espilon = 0.0808; alpha = 0.0040
Simulating trial. . .
espilon = 0.0808; alpha = 0.0040
Simulating trial. . .
espilon = 0.0808; alpha = 0.0040
Simulating trial. . .
espilon = 0.0808; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', 'left')
Agent followed the waypoint left. (rewarded 1.81)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent followed the waypoint left. (rewarded 2.27)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.80)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.54)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.70)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.18)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent followed the waypoint forward. (rewarded 1.72)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 1.32)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.12)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.36)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.82)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.63)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 631
\-------------------------
Simulating trial. . .
espilon = 0.0805; alpha = 0.0040
Simulating trial. . .
espilon = 0.0805; alpha = 0.0040
Simulating trial. . .
espilon = 0.0805; alpha = 0.0040
Simulating trial. . .
espilon = 0.0805; alpha = 0.0040
Simulating trial. . .
espilon = 0.0805; alpha = 0.0040
Simulating trial. . .
espilon = 0.0805; alpha = 0.0040
Simulating trial. . .
espilon = 0.0805; alpha = 0.0040
Simulating trial. . .
espilon = 0.0805; alpha = 0.0040
Simulating trial. . .
espilon = 0.0805; alpha = 0.0040
Simulating trial. . .
espilon = 0.0805; alpha = 0.0040
Simulating trial. . .
espilon = 0.0805; alpha = 0.0040
Simulating trial. . .
espilon = 0.0805; alpha = 0.0040
Simulating trial. . .
espilon = 0.0805; alpha = 0.0040
Simulating trial. . .
espilon = 0.0805; alpha = 0.0040
Simulating trial. . .
espilon = 0.0805; alpha = 0.0040
Simulating trial. . .
espilon = 0.0805; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.62)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 2.53)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', 'left')
Agent followed the waypoint forward. (rewarded 1.58)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.94)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.19)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 2.68)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', 'right')
Agent followed the waypoint forward. (rewarded 2.11)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.62)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.69)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.36)
67% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 632
\-------------------------
Simulating trial. . .
espilon = 0.0801; alpha = 0.0040
Simulating trial. . .
espilon = 0.0801; alpha = 0.0040
Simulating trial. . .
espilon = 0.0801; alpha = 0.0040
Simulating trial. . .
espilon = 0.0801; alpha = 0.0040
Simulating trial. . .
espilon = 0.0801; alpha = 0.0040
Simulating trial. . .
espilon = 0.0801; alpha = 0.0040
Simulating trial. . .
espilon = 0.0801; alpha = 0.0040
Simulating trial. . .
espilon = 0.0801; alpha = 0.0040
Simulating trial. . .
espilon = 0.0801; alpha = 0.0040
Simulating trial. . .
espilon = 0.0801; alpha = 0.0040
Simulating trial. . .
espilon = 0.0801; alpha = 0.0040
Simulating trial. . .
espilon = 0.0801; alpha = 0.0040
Simulating trial. . .
espilon = 0.0801; alpha = 0.0040
Simulating trial. . .
espilon = 0.0801; alpha = 0.0040
Simulating trial. . .
espilon = 0.0801; alpha = 0.0040
Simulating trial. . .
espilon = 0.0801; alpha = 0.0040
Simulating trial. . .
espilon = 0.0801; alpha = 0.0040
Simulating trial. . .
espilon = 0.0801; alpha = 0.0040
Simulating trial. . .
espilon = 0.0801; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'right', 'forward')
Agent properly idled at a red light. (rewarded 2.31)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.43)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'forward')
Agent followed the waypoint right. (rewarded 2.06)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'right')
Agent followed the waypoint forward. (rewarded 1.37)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.34)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.88)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.58)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.30)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.82)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.90)
67% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 633
\-------------------------
Simulating trial. . .
espilon = 0.0798; alpha = 0.0040
Simulating trial. . .
espilon = 0.0798; alpha = 0.0040
Simulating trial. . .
espilon = 0.0798; alpha = 0.0040
Simulating trial. . .
espilon = 0.0798; alpha = 0.0040
Simulating trial. . .
espilon = 0.0798; alpha = 0.0040
Simulating trial. . .
espilon = 0.0798; alpha = 0.0040
Simulating trial. . .
espilon = 0.0798; alpha = 0.0040
Simulating trial. . .
espilon = 0.0798; alpha = 0.0040
Simulating trial. . .
espilon = 0.0798; alpha = 0.0040
Simulating trial. . .
espilon = 0.0798; alpha = 0.0040
Simulating trial. . .
espilon = 0.0798; alpha = 0.0040
Simulating trial. . .
espilon = 0.0798; alpha = 0.0040
Simulating trial. . .
espilon = 0.0798; alpha = 0.0040
Simulating trial. . .
espilon = 0.0798; alpha = 0.0040
Simulating trial. . .
espilon = 0.0798; alpha = 0.0040
Simulating trial. . .
espilon = 0.0798; alpha = 0.0040
Simulating trial. . .
espilon = 0.0798; alpha = 0.0040
Simulating trial. . .
espilon = 0.0798; alpha = 0.0040
Simulating trial. . .
espilon = 0.0798; alpha = 0.0040
Simulating trial. . .
espilon = 0.0798; alpha = 0.0040
Simulating trial. . .
espilon = 0.0798; alpha = 0.0040
Simulating trial. . .
espilon = 0.0798; alpha = 0.0040
Simulating trial. . .
espilon = 0.0798; alpha = 0.0040
Simulating trial. . .
espilon = 0.0798; alpha = 0.0040
Simulating trial. . .
espilon = 0.0798; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.99)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.71)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.91)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'right')
Agent drove right instead of forward. (rewarded 1.78)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent followed the waypoint left. (rewarded 1.38)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.51)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 1.30)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.46)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.71)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 0.99)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'left', None)
Agent drove forward instead of left. (rewarded 0.41)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 0.00)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'left')
Agent followed the waypoint right. (rewarded 0.76)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.87)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent followed the waypoint right. (rewarded 0.71)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 2.34)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.31)
15% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 634
\-------------------------
Simulating trial. . .
espilon = 0.0795; alpha = 0.0040
Simulating trial. . .
espilon = 0.0795; alpha = 0.0040
Simulating trial. . .
espilon = 0.0795; alpha = 0.0040
Simulating trial. . .
espilon = 0.0795; alpha = 0.0040
Simulating trial. . .
espilon = 0.0795; alpha = 0.0040
Simulating trial. . .
espilon = 0.0795; alpha = 0.0040
Simulating trial. . .
espilon = 0.0795; alpha = 0.0040
Simulating trial. . .
espilon = 0.0795; alpha = 0.0040
Simulating trial. . .
espilon = 0.0795; alpha = 0.0040
Simulating trial. . .
espilon = 0.0795; alpha = 0.0040
Simulating trial. . .
espilon = 0.0795; alpha = 0.0040
Simulating trial. . .
espilon = 0.0795; alpha = 0.0040
Simulating trial. . .
espilon = 0.0795; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.19)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.60)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.11)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 1.92)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.45)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.54)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.44)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.34)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.92)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.59)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.82)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 2.02)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.18)
35% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 635
\-------------------------
Simulating trial. . .
espilon = 0.0792; alpha = 0.0040
Simulating trial. . .
espilon = 0.0792; alpha = 0.0040
Simulating trial. . .
espilon = 0.0792; alpha = 0.0040
Simulating trial. . .
espilon = 0.0792; alpha = 0.0040
Simulating trial. . .
espilon = 0.0792; alpha = 0.0040
Simulating trial. . .
espilon = 0.0792; alpha = 0.0040
Simulating trial. . .
espilon = 0.0792; alpha = 0.0040
Simulating trial. . .
espilon = 0.0792; alpha = 0.0040
Simulating trial. . .
espilon = 0.0792; alpha = 0.0040
Simulating trial. . .
espilon = 0.0792; alpha = 0.0040
Simulating trial. . .
espilon = 0.0792; alpha = 0.0040
Simulating trial. . .
espilon = 0.0792; alpha = 0.0040
Simulating trial. . .
espilon = 0.0792; alpha = 0.0040
Simulating trial. . .
espilon = 0.0792; alpha = 0.0040
Simulating trial. . .
espilon = 0.0792; alpha = 0.0040
Simulating trial. . .
espilon = 0.0792; alpha = 0.0040
Simulating trial. . .
espilon = 0.0792; alpha = 0.0040
Simulating trial. . .
espilon = 0.0792; alpha = 0.0040
Simulating trial. . .
espilon = 0.0792; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.15)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.04)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.11)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.40)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.94)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.50)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 1.78)
65% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 636
\-------------------------
Simulating trial. . .
espilon = 0.0789; alpha = 0.0040
Simulating trial. . .
espilon = 0.0789; alpha = 0.0040
Simulating trial. . .
espilon = 0.0789; alpha = 0.0040
Simulating trial. . .
espilon = 0.0789; alpha = 0.0040
Simulating trial. . .
espilon = 0.0789; alpha = 0.0040
Simulating trial. . .
espilon = 0.0789; alpha = 0.0040
Simulating trial. . .
espilon = 0.0789; alpha = 0.0040
Simulating trial. . .
espilon = 0.0789; alpha = 0.0040
Simulating trial. . .
espilon = 0.0789; alpha = 0.0040
Simulating trial. . .
espilon = 0.0789; alpha = 0.0040
Simulating trial. . .
espilon = 0.0789; alpha = 0.0040
Simulating trial. . .
espilon = 0.0789; alpha = 0.0040
Simulating trial. . .
espilon = 0.0789; alpha = 0.0040
Simulating trial. . .
espilon = 0.0789; alpha = 0.0040
Simulating trial. . .
espilon = 0.0789; alpha = 0.0040
Simulating trial. . .
espilon = 0.0789; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.66)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent followed the waypoint right. (rewarded 2.36)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.89)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.85)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'right')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.54)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.25)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 2.32)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.81)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 0.52)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove forward instead of left. (rewarded 0.21)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.67)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.18)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.05)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 2.35)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded -0.01)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded 0.26)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.47)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', 'left')
Agent drove right instead of left. (rewarded 0.34)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.08)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.52)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.97)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.07)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.49)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 0.58)
20% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 637
\-------------------------
Simulating trial. . .
espilon = 0.0786; alpha = 0.0040
Simulating trial. . .
espilon = 0.0786; alpha = 0.0040
Simulating trial. . .
espilon = 0.0786; alpha = 0.0040
Simulating trial. . .
espilon = 0.0786; alpha = 0.0040
Simulating trial. . .
espilon = 0.0786; alpha = 0.0040
Simulating trial. . .
espilon = 0.0786; alpha = 0.0040
Simulating trial. . .
espilon = 0.0786; alpha = 0.0040
Simulating trial. . .
espilon = 0.0786; alpha = 0.0040
Simulating trial. . .
espilon = 0.0786; alpha = 0.0040
Simulating trial. . .
espilon = 0.0786; alpha = 0.0040
Simulating trial. . .
espilon = 0.0786; alpha = 0.0040
Simulating trial. . .
espilon = 0.0786; alpha = 0.0040
Simulating trial. . .
espilon = 0.0786; alpha = 0.0040
Simulating trial. . .
espilon = 0.0786; alpha = 0.0040
Simulating trial. . .
espilon = 0.0786; alpha = 0.0040
Simulating trial. . .
espilon = 0.0786; alpha = 0.0040
Simulating trial. . .
espilon = 0.0786; alpha = 0.0040
Simulating trial. . .
espilon = 0.0786; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 1.67)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.90)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.93)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.87)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.34)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.07)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.27)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', 'forward')
Agent drove right instead of forward. (rewarded 0.88)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.72)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', None)
Agent properly idled at a red light. (rewarded 1.48)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.84)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'left')
Agent followed the waypoint left. (rewarded 1.30)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 2.16)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.27)
44% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 638
\-------------------------
Simulating trial. . .
espilon = 0.0782; alpha = 0.0040
Simulating trial. . .
espilon = 0.0782; alpha = 0.0040
Simulating trial. . .
espilon = 0.0782; alpha = 0.0040
Simulating trial. . .
espilon = 0.0782; alpha = 0.0040
Simulating trial. . .
espilon = 0.0782; alpha = 0.0040
Simulating trial. . .
espilon = 0.0782; alpha = 0.0040
Simulating trial. . .
espilon = 0.0782; alpha = 0.0040
Simulating trial. . .
espilon = 0.0782; alpha = 0.0040
Simulating trial. . .
espilon = 0.0782; alpha = 0.0040
Simulating trial. . .
espilon = 0.0782; alpha = 0.0040
Simulating trial. . .
espilon = 0.0782; alpha = 0.0040
Simulating trial. . .
espilon = 0.0782; alpha = 0.0040
Simulating trial. . .
espilon = 0.0782; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'right')
Agent properly idled at a red light. (rewarded 1.46)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.38)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 1.32)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.02)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove left instead of forward. (rewarded -0.05)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.59)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'left')
Agent properly idled at a red light. (rewarded 2.80)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.03)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.80)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.56)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.69)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded -0.24)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.33)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.71)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.44)
25% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 639
\-------------------------
Simulating trial. . .
espilon = 0.0779; alpha = 0.0040
Simulating trial. . .
espilon = 0.0779; alpha = 0.0040
Simulating trial. . .
espilon = 0.0779; alpha = 0.0040
Simulating trial. . .
espilon = 0.0779; alpha = 0.0040
Simulating trial. . .
espilon = 0.0779; alpha = 0.0040
Simulating trial. . .
espilon = 0.0779; alpha = 0.0040
Simulating trial. . .
espilon = 0.0779; alpha = 0.0040
Simulating trial. . .
espilon = 0.0779; alpha = 0.0040
Simulating trial. . .
espilon = 0.0779; alpha = 0.0040
Simulating trial. . .
espilon = 0.0779; alpha = 0.0040
Simulating trial. . .
espilon = 0.0779; alpha = 0.0040
Simulating trial. . .
espilon = 0.0779; alpha = 0.0040
Simulating trial. . .
espilon = 0.0779; alpha = 0.0040
Simulating trial. . .
espilon = 0.0779; alpha = 0.0040
Simulating trial. . .
espilon = 0.0779; alpha = 0.0040
Simulating trial. . .
espilon = 0.0779; alpha = 0.0040
Simulating trial. . .
espilon = 0.0779; alpha = 0.0040
Simulating trial. . .
espilon = 0.0779; alpha = 0.0040
Simulating trial. . .
espilon = 0.0779; alpha = 0.0040
Simulating trial. . .
espilon = 0.0779; alpha = 0.0040
Simulating trial. . .
espilon = 0.0779; alpha = 0.0040
Simulating trial. . .
espilon = 0.0779; alpha = 0.0040
Simulating trial. . .
espilon = 0.0779; alpha = 0.0040
Simulating trial. . .
espilon = 0.0779; alpha = 0.0040
Simulating trial. . .
espilon = 0.0779; alpha = 0.0040
Simulating trial. . .
espilon = 0.0779; alpha = 0.0040
Simulating trial. . .
espilon = 0.0779; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.85)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.57)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 2.29)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 2.90)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.89)
80% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 640
\-------------------------
Simulating trial. . .
espilon = 0.0776; alpha = 0.0040
Simulating trial. . .
espilon = 0.0776; alpha = 0.0040
Simulating trial. . .
espilon = 0.0776; alpha = 0.0040
Simulating trial. . .
espilon = 0.0776; alpha = 0.0040
Simulating trial. . .
espilon = 0.0776; alpha = 0.0040
Simulating trial. . .
espilon = 0.0776; alpha = 0.0040
Simulating trial. . .
espilon = 0.0776; alpha = 0.0040
Simulating trial. . .
espilon = 0.0776; alpha = 0.0040
Simulating trial. . .
espilon = 0.0776; alpha = 0.0040
Simulating trial. . .
espilon = 0.0776; alpha = 0.0040
Simulating trial. . .
espilon = 0.0776; alpha = 0.0040
Simulating trial. . .
espilon = 0.0776; alpha = 0.0040
Simulating trial. . .
espilon = 0.0776; alpha = 0.0040
Simulating trial. . .
espilon = 0.0776; alpha = 0.0040
Simulating trial. . .
espilon = 0.0776; alpha = 0.0040
Simulating trial. . .
espilon = 0.0776; alpha = 0.0040
Simulating trial. . .
espilon = 0.0776; alpha = 0.0040
Simulating trial. . .
espilon = 0.0776; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.12)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 1.69)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.80)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.02)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.15)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.20)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.97)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.51)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.24)
55% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 641
\-------------------------
Simulating trial. . .
espilon = 0.0773; alpha = 0.0040
Simulating trial. . .
espilon = 0.0773; alpha = 0.0040
Simulating trial. . .
espilon = 0.0773; alpha = 0.0040
Simulating trial. . .
espilon = 0.0773; alpha = 0.0040
Simulating trial. . .
espilon = 0.0773; alpha = 0.0040
Simulating trial. . .
espilon = 0.0773; alpha = 0.0040
Simulating trial. . .
espilon = 0.0773; alpha = 0.0040
Simulating trial. . .
espilon = 0.0773; alpha = 0.0040
Simulating trial. . .
espilon = 0.0773; alpha = 0.0040
Simulating trial. . .
espilon = 0.0773; alpha = 0.0040
Simulating trial. . .
espilon = 0.0773; alpha = 0.0040
Simulating trial. . .
espilon = 0.0773; alpha = 0.0040
Simulating trial. . .
espilon = 0.0773; alpha = 0.0040
Simulating trial. . .
espilon = 0.0773; alpha = 0.0040
Simulating trial. . .
espilon = 0.0773; alpha = 0.0040
Simulating trial. . .
espilon = 0.0773; alpha = 0.0040
Simulating trial. . .
espilon = 0.0773; alpha = 0.0040
Simulating trial. . .
espilon = 0.0773; alpha = 0.0040
Simulating trial. . .
espilon = 0.0773; alpha = 0.0040
Simulating trial. . .
espilon = 0.0773; alpha = 0.0040
Simulating trial. . .
espilon = 0.0773; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'forward')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -20.69)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 1.64)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 1.80)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.47)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.96)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.65)
70% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 642
\-------------------------
Simulating trial. . .
espilon = 0.0770; alpha = 0.0040
Simulating trial. . .
espilon = 0.0770; alpha = 0.0040
Simulating trial. . .
espilon = 0.0770; alpha = 0.0040
Simulating trial. . .
espilon = 0.0770; alpha = 0.0040
Simulating trial. . .
espilon = 0.0770; alpha = 0.0040
Simulating trial. . .
espilon = 0.0770; alpha = 0.0040
Simulating trial. . .
espilon = 0.0770; alpha = 0.0040
Simulating trial. . .
espilon = 0.0770; alpha = 0.0040
Simulating trial. . .
espilon = 0.0770; alpha = 0.0040
Simulating trial. . .
espilon = 0.0770; alpha = 0.0040
Simulating trial. . .
espilon = 0.0770; alpha = 0.0040
Simulating trial. . .
espilon = 0.0770; alpha = 0.0040
Simulating trial. . .
espilon = 0.0770; alpha = 0.0040
Simulating trial. . .
espilon = 0.0770; alpha = 0.0040
Simulating trial. . .
espilon = 0.0770; alpha = 0.0040
Simulating trial. . .
espilon = 0.0770; alpha = 0.0040
Simulating trial. . .
espilon = 0.0770; alpha = 0.0040
Simulating trial. . .
espilon = 0.0770; alpha = 0.0040
Simulating trial. . .
espilon = 0.0770; alpha = 0.0040
Simulating trial. . .
espilon = 0.0770; alpha = 0.0040
Simulating trial. . .
espilon = 0.0770; alpha = 0.0040
Simulating trial. . .
espilon = 0.0770; alpha = 0.0040
Simulating trial. . .
espilon = 0.0770; alpha = 0.0040
Simulating trial. . .
espilon = 0.0770; alpha = 0.0040
Simulating trial. . .
espilon = 0.0770; alpha = 0.0040
Simulating trial. . .
espilon = 0.0770; alpha = 0.0040
Simulating trial. . .
espilon = 0.0770; alpha = 0.0040
Simulating trial. . .
espilon = 0.0770; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.93)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.49)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.03)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.85)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.53)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.92)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.22)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.50)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.27)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 2.16)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.65)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 1.85)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.88)
35% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 643
\-------------------------
Simulating trial. . .
espilon = 0.0767; alpha = 0.0040
Simulating trial. . .
espilon = 0.0767; alpha = 0.0040
Simulating trial. . .
espilon = 0.0767; alpha = 0.0040
Simulating trial. . .
espilon = 0.0767; alpha = 0.0040
Simulating trial. . .
espilon = 0.0767; alpha = 0.0040
Simulating trial. . .
espilon = 0.0767; alpha = 0.0040
Simulating trial. . .
espilon = 0.0767; alpha = 0.0040
Simulating trial. . .
espilon = 0.0767; alpha = 0.0040
Simulating trial. . .
espilon = 0.0767; alpha = 0.0040
Simulating trial. . .
espilon = 0.0767; alpha = 0.0040
Simulating trial. . .
espilon = 0.0767; alpha = 0.0040
Simulating trial. . .
espilon = 0.0767; alpha = 0.0040
Simulating trial. . .
espilon = 0.0767; alpha = 0.0040
Simulating trial. . .
espilon = 0.0767; alpha = 0.0040
Simulating trial. . .
espilon = 0.0767; alpha = 0.0040
Simulating trial. . .
espilon = 0.0767; alpha = 0.0040
Simulating trial. . .
espilon = 0.0767; alpha = 0.0040
Simulating trial. . .
espilon = 0.0767; alpha = 0.0040
Simulating trial. . .
espilon = 0.0767; alpha = 0.0040
Simulating trial. . .
espilon = 0.0767; alpha = 0.0040
Simulating trial. . .
espilon = 0.0767; alpha = 0.0040
Simulating trial. . .
espilon = 0.0767; alpha = 0.0040
Simulating trial. . .
espilon = 0.0767; alpha = 0.0040
Simulating trial. . .
espilon = 0.0767; alpha = 0.0040
Simulating trial. . .
espilon = 0.0767; alpha = 0.0040
Simulating trial. . .
espilon = 0.0767; alpha = 0.0040
Simulating trial. . .
espilon = 0.0767; alpha = 0.0040
Simulating trial. . .
espilon = 0.0767; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, 'left')
Agent followed the waypoint right. (rewarded 1.70)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.95)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.93)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.02)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.65)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent drove right instead of forward. (rewarded 0.83)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.11)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'right')
Agent drove forward instead of left. (rewarded 1.08)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 2.44)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.63)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 1.83)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.37)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'right')
Agent properly idled at a red light. (rewarded 2.41)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.47)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'left', None)
Agent followed the waypoint forward. (rewarded 1.75)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.00)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.94)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 1.06)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.57)
5% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 644
\-------------------------
Simulating trial. . .
espilon = 0.0764; alpha = 0.0040
Simulating trial. . .
espilon = 0.0764; alpha = 0.0040
Simulating trial. . .
espilon = 0.0764; alpha = 0.0040
Simulating trial. . .
espilon = 0.0764; alpha = 0.0040
Simulating trial. . .
espilon = 0.0764; alpha = 0.0040
Simulating trial. . .
espilon = 0.0764; alpha = 0.0040
Simulating trial. . .
espilon = 0.0764; alpha = 0.0040
Simulating trial. . .
espilon = 0.0764; alpha = 0.0040
Simulating trial. . .
espilon = 0.0764; alpha = 0.0040
Simulating trial. . .
espilon = 0.0764; alpha = 0.0040
Simulating trial. . .
espilon = 0.0764; alpha = 0.0040
Simulating trial. . .
espilon = 0.0764; alpha = 0.0040
Simulating trial. . .
espilon = 0.0764; alpha = 0.0040
Simulating trial. . .
espilon = 0.0764; alpha = 0.0040
Simulating trial. . .
espilon = 0.0764; alpha = 0.0040
Simulating trial. . .
espilon = 0.0764; alpha = 0.0040
Simulating trial. . .
espilon = 0.0764; alpha = 0.0040
Simulating trial. . .
espilon = 0.0764; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'right')
Agent drove forward instead of left. (rewarded 0.02)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 2.20)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', 'left')
Agent properly idled at a red light. (rewarded 2.75)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.76)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.67)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.94)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 0.29)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'right', None)
Agent drove forward instead of left. (rewarded 1.01)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded 1.82)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.61)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent followed the waypoint right. (rewarded 1.52)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.18)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.18)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.04)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.52)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.84)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.88)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.84)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.99)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'forward', None)
Agent drove left instead of right. (rewarded 0.21)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.10)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.27)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'right', None)
Agent drove forward instead of right. (rewarded 0.80)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.27)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.11)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 0.39)
13% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 645
\-------------------------
Simulating trial. . .
espilon = 0.0761; alpha = 0.0040
Simulating trial. . .
espilon = 0.0761; alpha = 0.0040
Simulating trial. . .
espilon = 0.0761; alpha = 0.0040
Simulating trial. . .
espilon = 0.0761; alpha = 0.0040
Simulating trial. . .
espilon = 0.0761; alpha = 0.0040
Simulating trial. . .
espilon = 0.0761; alpha = 0.0040
Simulating trial. . .
espilon = 0.0761; alpha = 0.0040
Simulating trial. . .
espilon = 0.0761; alpha = 0.0040
Simulating trial. . .
espilon = 0.0761; alpha = 0.0040
Simulating trial. . .
espilon = 0.0761; alpha = 0.0040
Simulating trial. . .
espilon = 0.0761; alpha = 0.0040
Simulating trial. . .
espilon = 0.0761; alpha = 0.0040
Simulating trial. . .
espilon = 0.0761; alpha = 0.0040
Simulating trial. . .
espilon = 0.0761; alpha = 0.0040
Simulating trial. . .
espilon = 0.0761; alpha = 0.0040
Simulating trial. . .
espilon = 0.0761; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 1.68)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.40)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'forward', None)
Agent followed the waypoint right. (rewarded 2.87)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.23)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.80)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.48)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.95)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 2.62)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'left', None)
Agent followed the waypoint forward. (rewarded 1.41)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.01)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.60)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.41)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.07)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.35)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.25)
50% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 646
\-------------------------
Simulating trial. . .
espilon = 0.0758; alpha = 0.0040
Simulating trial. . .
espilon = 0.0758; alpha = 0.0040
Simulating trial. . .
espilon = 0.0758; alpha = 0.0040
Simulating trial. . .
espilon = 0.0758; alpha = 0.0040
Simulating trial. . .
espilon = 0.0758; alpha = 0.0040
Simulating trial. . .
espilon = 0.0758; alpha = 0.0040
Simulating trial. . .
espilon = 0.0758; alpha = 0.0040
Simulating trial. . .
espilon = 0.0758; alpha = 0.0040
Simulating trial. . .
espilon = 0.0758; alpha = 0.0040
Simulating trial. . .
espilon = 0.0758; alpha = 0.0040
Simulating trial. . .
espilon = 0.0758; alpha = 0.0040
Simulating trial. . .
espilon = 0.0758; alpha = 0.0040
Simulating trial. . .
espilon = 0.0758; alpha = 0.0040
Simulating trial. . .
espilon = 0.0758; alpha = 0.0040
Simulating trial. . .
espilon = 0.0758; alpha = 0.0040
Simulating trial. . .
espilon = 0.0758; alpha = 0.0040
Simulating trial. . .
espilon = 0.0758; alpha = 0.0040
Simulating trial. . .
espilon = 0.0758; alpha = 0.0040
Simulating trial. . .
espilon = 0.0758; alpha = 0.0040
Simulating trial. . .
espilon = 0.0758; alpha = 0.0040
Simulating trial. . .
espilon = 0.0758; alpha = 0.0040
Simulating trial. . .
espilon = 0.0758; alpha = 0.0040
Simulating trial. . .
espilon = 0.0758; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', 'left')
Agent followed the waypoint left. (rewarded 2.10)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.20)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.26)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.94)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.93)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 0.83)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.63)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent followed the waypoint left. (rewarded 0.97)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.56)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.00)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.30)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.81)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 647
\-------------------------
Simulating trial. . .
espilon = 0.0755; alpha = 0.0040
Simulating trial. . .
espilon = 0.0755; alpha = 0.0040
Simulating trial. . .
espilon = 0.0755; alpha = 0.0040
Simulating trial. . .
espilon = 0.0755; alpha = 0.0040
Simulating trial. . .
espilon = 0.0755; alpha = 0.0040
Simulating trial. . .
espilon = 0.0755; alpha = 0.0040
Simulating trial. . .
espilon = 0.0755; alpha = 0.0040
Simulating trial. . .
espilon = 0.0755; alpha = 0.0040
Simulating trial. . .
espilon = 0.0755; alpha = 0.0040
Simulating trial. . .
espilon = 0.0755; alpha = 0.0040
Simulating trial. . .
espilon = 0.0755; alpha = 0.0040
Simulating trial. . .
espilon = 0.0755; alpha = 0.0040
Simulating trial. . .
espilon = 0.0755; alpha = 0.0040
Simulating trial. . .
espilon = 0.0755; alpha = 0.0040
Simulating trial. . .
espilon = 0.0755; alpha = 0.0040
Simulating trial. . .
espilon = 0.0755; alpha = 0.0040
Simulating trial. . .
espilon = 0.0755; alpha = 0.0040
Simulating trial. . .
espilon = 0.0755; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.53)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 1.57)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.83)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.86)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.90)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.72)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.76)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.59)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'right')
Agent followed the waypoint right. (rewarded 2.83)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.53)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 648
\-------------------------
Simulating trial. . .
espilon = 0.0752; alpha = 0.0040
Simulating trial. . .
espilon = 0.0752; alpha = 0.0040
Simulating trial. . .
espilon = 0.0752; alpha = 0.0040
Simulating trial. . .
espilon = 0.0752; alpha = 0.0040
Simulating trial. . .
espilon = 0.0752; alpha = 0.0040
Simulating trial. . .
espilon = 0.0752; alpha = 0.0040
Simulating trial. . .
espilon = 0.0752; alpha = 0.0040
Simulating trial. . .
espilon = 0.0752; alpha = 0.0040
Simulating trial. . .
espilon = 0.0752; alpha = 0.0040
Simulating trial. . .
espilon = 0.0752; alpha = 0.0040
Simulating trial. . .
espilon = 0.0752; alpha = 0.0040
Simulating trial. . .
espilon = 0.0752; alpha = 0.0040
Simulating trial. . .
espilon = 0.0752; alpha = 0.0040
Simulating trial. . .
espilon = 0.0752; alpha = 0.0040
Simulating trial. . .
espilon = 0.0752; alpha = 0.0040
Simulating trial. . .
espilon = 0.0752; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 1.97)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.27)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.73)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.14)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.52)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.08)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', 'forward')
Agent drove forward instead of right. (rewarded -0.04)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.09)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.49)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.87)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.92)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.19)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.05)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 0.83)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.28)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.39)
36% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 649
\-------------------------
Simulating trial. . .
espilon = 0.0749; alpha = 0.0040
Simulating trial. . .
espilon = 0.0749; alpha = 0.0040
Simulating trial. . .
espilon = 0.0749; alpha = 0.0040
Simulating trial. . .
espilon = 0.0749; alpha = 0.0040
Simulating trial. . .
espilon = 0.0749; alpha = 0.0040
Simulating trial. . .
espilon = 0.0749; alpha = 0.0040
Simulating trial. . .
espilon = 0.0749; alpha = 0.0040
Simulating trial. . .
espilon = 0.0749; alpha = 0.0040
Simulating trial. . .
espilon = 0.0749; alpha = 0.0040
Simulating trial. . .
espilon = 0.0749; alpha = 0.0040
Simulating trial. . .
espilon = 0.0749; alpha = 0.0040
Simulating trial. . .
espilon = 0.0749; alpha = 0.0040
Simulating trial. . .
espilon = 0.0749; alpha = 0.0040
Simulating trial. . .
espilon = 0.0749; alpha = 0.0040
Simulating trial. . .
espilon = 0.0749; alpha = 0.0040
Simulating trial. . .
espilon = 0.0749; alpha = 0.0040
Simulating trial. . .
espilon = 0.0749; alpha = 0.0040
Simulating trial. . .
espilon = 0.0749; alpha = 0.0040
Simulating trial. . .
espilon = 0.0749; alpha = 0.0040
Simulating trial. . .
espilon = 0.0749; alpha = 0.0040
Simulating trial. . .
espilon = 0.0749; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.71)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.28)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.56)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.95)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.98)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.94)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.95)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', 'left')
Agent followed the waypoint forward. (rewarded 2.37)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.33)
55% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 650
\-------------------------
Simulating trial. . .
espilon = 0.0746; alpha = 0.0040
Simulating trial. . .
espilon = 0.0746; alpha = 0.0040
Simulating trial. . .
espilon = 0.0746; alpha = 0.0040
Simulating trial. . .
espilon = 0.0746; alpha = 0.0040
Simulating trial. . .
espilon = 0.0746; alpha = 0.0040
Simulating trial. . .
espilon = 0.0746; alpha = 0.0040
Simulating trial. . .
espilon = 0.0746; alpha = 0.0040
Simulating trial. . .
espilon = 0.0746; alpha = 0.0040
Simulating trial. . .
espilon = 0.0746; alpha = 0.0040
Simulating trial. . .
espilon = 0.0746; alpha = 0.0040
Simulating trial. . .
espilon = 0.0746; alpha = 0.0040
Simulating trial. . .
espilon = 0.0746; alpha = 0.0040
Simulating trial. . .
espilon = 0.0746; alpha = 0.0040
Simulating trial. . .
espilon = 0.0746; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 1.55)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.78)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', 'left')
Agent properly idled at a red light. (rewarded 2.08)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 0.42)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', 'left')
Agent drove forward instead of left. (rewarded 0.92)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.32)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.42)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.21)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent attempted driving forward through a red light. (rewarded -9.56)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.06)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.95)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 0.84)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.15)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 2.59)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.96)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'left')
Agent followed the waypoint left. (rewarded 1.68)
36% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 651
\-------------------------
Simulating trial. . .
espilon = 0.0743; alpha = 0.0040
Simulating trial. . .
espilon = 0.0743; alpha = 0.0040
Simulating trial. . .
espilon = 0.0743; alpha = 0.0040
Simulating trial. . .
espilon = 0.0743; alpha = 0.0040
Simulating trial. . .
espilon = 0.0743; alpha = 0.0040
Simulating trial. . .
espilon = 0.0743; alpha = 0.0040
Simulating trial. . .
espilon = 0.0743; alpha = 0.0040
Simulating trial. . .
espilon = 0.0743; alpha = 0.0040
Simulating trial. . .
espilon = 0.0743; alpha = 0.0040
Simulating trial. . .
espilon = 0.0743; alpha = 0.0040
Simulating trial. . .
espilon = 0.0743; alpha = 0.0040
Simulating trial. . .
espilon = 0.0743; alpha = 0.0040
Simulating trial. . .
espilon = 0.0743; alpha = 0.0040
Simulating trial. . .
espilon = 0.0743; alpha = 0.0040
Simulating trial. . .
espilon = 0.0743; alpha = 0.0040
Simulating trial. . .
espilon = 0.0743; alpha = 0.0040
Simulating trial. . .
espilon = 0.0743; alpha = 0.0040
Simulating trial. . .
espilon = 0.0743; alpha = 0.0040
Simulating trial. . .
espilon = 0.0743; alpha = 0.0040
Simulating trial. . .
espilon = 0.0743; alpha = 0.0040
Simulating trial. . .
espilon = 0.0743; alpha = 0.0040
Simulating trial. . .
espilon = 0.0743; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.72)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.56)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent drove right instead of left. (rewarded 0.92)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.59)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.86)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.53)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 2.00)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.76)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.10)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.41)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.94)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 2.22)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'right')
Agent drove right instead of left. (rewarded 0.33)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 0.91)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.62)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 0.98)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 0.98)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.86)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'right', 'right')
Agent followed the waypoint forward. (rewarded 1.65)
5% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 652
\-------------------------
Simulating trial. . .
espilon = 0.0740; alpha = 0.0040
Simulating trial. . .
espilon = 0.0740; alpha = 0.0040
Simulating trial. . .
espilon = 0.0740; alpha = 0.0040
Simulating trial. . .
espilon = 0.0740; alpha = 0.0040
Simulating trial. . .
espilon = 0.0740; alpha = 0.0040
Simulating trial. . .
espilon = 0.0740; alpha = 0.0040
Simulating trial. . .
espilon = 0.0740; alpha = 0.0040
Simulating trial. . .
espilon = 0.0740; alpha = 0.0040
Simulating trial. . .
espilon = 0.0740; alpha = 0.0040
Simulating trial. . .
espilon = 0.0740; alpha = 0.0040
Simulating trial. . .
espilon = 0.0740; alpha = 0.0040
Simulating trial. . .
espilon = 0.0740; alpha = 0.0040
Simulating trial. . .
espilon = 0.0740; alpha = 0.0040
Simulating trial. . .
espilon = 0.0740; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.78)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'right', None)
Agent drove left instead of forward. (rewarded 0.53)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.99)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.97)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 0.97)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.78)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 0.74)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.48)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.27)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.78)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'right', 'forward')
Agent followed the waypoint forward. (rewarded 1.18)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.74)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'right')
Agent drove right instead of forward. (rewarded 1.08)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.88)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.45)
50% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 653
\-------------------------
Simulating trial. . .
espilon = 0.0737; alpha = 0.0040
Simulating trial. . .
espilon = 0.0737; alpha = 0.0040
Simulating trial. . .
espilon = 0.0737; alpha = 0.0040
Simulating trial. . .
espilon = 0.0737; alpha = 0.0040
Simulating trial. . .
espilon = 0.0737; alpha = 0.0040
Simulating trial. . .
espilon = 0.0737; alpha = 0.0040
Simulating trial. . .
espilon = 0.0737; alpha = 0.0040
Simulating trial. . .
espilon = 0.0737; alpha = 0.0040
Simulating trial. . .
espilon = 0.0737; alpha = 0.0040
Simulating trial. . .
espilon = 0.0737; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'left')
Agent drove forward instead of right. (rewarded 0.84)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.68)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.40)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.62)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.90)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.20)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.35)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'right')
Agent drove right instead of forward. (rewarded 1.37)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.02)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.08)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 0.95)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.37)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 0.82)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.82)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.51)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 0.81)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.52)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.68)
10% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 654
\-------------------------
Simulating trial. . .
espilon = 0.0734; alpha = 0.0040
Simulating trial. . .
espilon = 0.0734; alpha = 0.0040
Simulating trial. . .
espilon = 0.0734; alpha = 0.0040
Simulating trial. . .
espilon = 0.0734; alpha = 0.0040
Simulating trial. . .
espilon = 0.0734; alpha = 0.0040
Simulating trial. . .
espilon = 0.0734; alpha = 0.0040
Simulating trial. . .
espilon = 0.0734; alpha = 0.0040
Simulating trial. . .
espilon = 0.0734; alpha = 0.0040
Simulating trial. . .
espilon = 0.0734; alpha = 0.0040
Simulating trial. . .
espilon = 0.0734; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 1.80)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.97)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.78)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.23)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.96)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.59)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.36)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.44)
68% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 655
\-------------------------
Simulating trial. . .
espilon = 0.0731; alpha = 0.0040
Simulating trial. . .
espilon = 0.0731; alpha = 0.0040
Simulating trial. . .
espilon = 0.0731; alpha = 0.0040
Simulating trial. . .
espilon = 0.0731; alpha = 0.0040
Simulating trial. . .
espilon = 0.0731; alpha = 0.0040
Simulating trial. . .
espilon = 0.0731; alpha = 0.0040
Simulating trial. . .
espilon = 0.0731; alpha = 0.0040
Simulating trial. . .
espilon = 0.0731; alpha = 0.0040
Simulating trial. . .
espilon = 0.0731; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.39)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'forward', 'forward')
Agent properly idled at a red light. (rewarded 1.47)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.44)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.42)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.47)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.70)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'right', 'left')
Agent drove right instead of left. (rewarded 0.36)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.42)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.00)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.30)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.72)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.72)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 656
\-------------------------
Simulating trial. . .
espilon = 0.0728; alpha = 0.0040
Simulating trial. . .
espilon = 0.0728; alpha = 0.0040
Simulating trial. . .
espilon = 0.0728; alpha = 0.0040
Simulating trial. . .
espilon = 0.0728; alpha = 0.0040
Simulating trial. . .
espilon = 0.0728; alpha = 0.0040
Simulating trial. . .
espilon = 0.0728; alpha = 0.0040
Simulating trial. . .
espilon = 0.0728; alpha = 0.0040
Simulating trial. . .
espilon = 0.0728; alpha = 0.0040
Simulating trial. . .
espilon = 0.0728; alpha = 0.0040
Simulating trial. . .
espilon = 0.0728; alpha = 0.0040
Simulating trial. . .
espilon = 0.0728; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.82)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.55)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.00)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.70)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.89)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove left instead of forward. (rewarded 1.52)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.14)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.85)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.96)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.21)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.27)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent followed the waypoint left. (rewarded 2.83)
66% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 657
\-------------------------
Simulating trial. . .
espilon = 0.0725; alpha = 0.0040
Simulating trial. . .
espilon = 0.0725; alpha = 0.0040
Simulating trial. . .
espilon = 0.0725; alpha = 0.0040
Simulating trial. . .
espilon = 0.0725; alpha = 0.0040
Simulating trial. . .
espilon = 0.0725; alpha = 0.0040
Simulating trial. . .
espilon = 0.0725; alpha = 0.0040
Simulating trial. . .
espilon = 0.0725; alpha = 0.0040
Simulating trial. . .
espilon = 0.0725; alpha = 0.0040
Simulating trial. . .
espilon = 0.0725; alpha = 0.0040
Simulating trial. . .
espilon = 0.0725; alpha = 0.0040
Simulating trial. . .
espilon = 0.0725; alpha = 0.0040
Simulating trial. . .
espilon = 0.0725; alpha = 0.0040
Simulating trial. . .
espilon = 0.0725; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'right', 'right')
Agent followed the waypoint right. (rewarded 1.06)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.22)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.46)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.59)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.51)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.84)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.10)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'left')
Agent properly idled at a red light. (rewarded 2.36)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.10)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 0.55)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'right')
Agent drove right instead of left. (rewarded 1.66)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.39)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.22)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.71)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.05)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.20)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 2.15)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.66)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.93)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.28)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.34)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.55)
27% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 658
\-------------------------
Simulating trial. . .
espilon = 0.0722; alpha = 0.0040
Simulating trial. . .
espilon = 0.0722; alpha = 0.0040
Simulating trial. . .
espilon = 0.0722; alpha = 0.0040
Simulating trial. . .
espilon = 0.0722; alpha = 0.0040
Simulating trial. . .
espilon = 0.0722; alpha = 0.0040
Simulating trial. . .
espilon = 0.0722; alpha = 0.0040
Simulating trial. . .
espilon = 0.0722; alpha = 0.0040
Simulating trial. . .
espilon = 0.0722; alpha = 0.0040
Simulating trial. . .
espilon = 0.0722; alpha = 0.0040
Simulating trial. . .
espilon = 0.0722; alpha = 0.0040
Simulating trial. . .
espilon = 0.0722; alpha = 0.0040
Simulating trial. . .
espilon = 0.0722; alpha = 0.0040
Simulating trial. . .
espilon = 0.0722; alpha = 0.0040
Simulating trial. . .
espilon = 0.0722; alpha = 0.0040
Simulating trial. . .
espilon = 0.0722; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', 'right')
Agent drove right instead of left. (rewarded 1.62)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 0.06)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.88)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 2.29)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.39)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 0.97)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.62)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 2.55)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 659
\-------------------------
Simulating trial. . .
espilon = 0.0719; alpha = 0.0040
Simulating trial. . .
espilon = 0.0719; alpha = 0.0040
Simulating trial. . .
espilon = 0.0719; alpha = 0.0040
Simulating trial. . .
espilon = 0.0719; alpha = 0.0040
Simulating trial. . .
espilon = 0.0719; alpha = 0.0040
Simulating trial. . .
espilon = 0.0719; alpha = 0.0040
Simulating trial. . .
espilon = 0.0719; alpha = 0.0040
Simulating trial. . .
espilon = 0.0719; alpha = 0.0040
Simulating trial. . .
espilon = 0.0719; alpha = 0.0040
Simulating trial. . .
espilon = 0.0719; alpha = 0.0040
Simulating trial. . .
espilon = 0.0719; alpha = 0.0040
Simulating trial. . .
espilon = 0.0719; alpha = 0.0040
Simulating trial. . .
espilon = 0.0719; alpha = 0.0040
Simulating trial. . .
espilon = 0.0719; alpha = 0.0040
Simulating trial. . .
espilon = 0.0719; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'left', None)
Agent drove forward instead of left. (rewarded 1.17)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', 'left')
Agent drove right instead of left. (rewarded 0.24)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.86)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.94)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.54)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.58)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.11)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 0.89)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded 0.22)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.92)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 2.03)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.20)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.98)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'forward')
Agent drove right instead of left. (rewarded 1.32)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 0.74)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'right', 'left')
Agent properly idled at a red light. (rewarded 1.68)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'left')
Agent followed the waypoint right. (rewarded 1.72)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.11)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.58)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent followed the waypoint right. (rewarded 1.09)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 660
\-------------------------
Simulating trial. . .
espilon = 0.0716; alpha = 0.0040
Simulating trial. . .
espilon = 0.0716; alpha = 0.0040
Simulating trial. . .
espilon = 0.0716; alpha = 0.0040
Simulating trial. . .
espilon = 0.0716; alpha = 0.0040
Simulating trial. . .
espilon = 0.0716; alpha = 0.0040
Simulating trial. . .
espilon = 0.0716; alpha = 0.0040
Simulating trial. . .
espilon = 0.0716; alpha = 0.0040
Simulating trial. . .
espilon = 0.0716; alpha = 0.0040
Simulating trial. . .
espilon = 0.0716; alpha = 0.0040
Simulating trial. . .
espilon = 0.0716; alpha = 0.0040
Simulating trial. . .
espilon = 0.0716; alpha = 0.0040
Simulating trial. . .
espilon = 0.0716; alpha = 0.0040
Simulating trial. . .
espilon = 0.0716; alpha = 0.0040
Simulating trial. . .
espilon = 0.0716; alpha = 0.0040
Simulating trial. . .
espilon = 0.0716; alpha = 0.0040
Simulating trial. . .
espilon = 0.0716; alpha = 0.0040
Simulating trial. . .
espilon = 0.0716; alpha = 0.0040
Simulating trial. . .
espilon = 0.0716; alpha = 0.0040
Simulating trial. . .
espilon = 0.0716; alpha = 0.0040
Simulating trial. . .
espilon = 0.0716; alpha = 0.0040
Simulating trial. . .
espilon = 0.0716; alpha = 0.0040
Simulating trial. . .
espilon = 0.0716; alpha = 0.0040
Simulating trial. . .
espilon = 0.0716; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.08)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.24)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.46)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.83)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.09)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'forward', None)
Agent properly idled at a red light. (rewarded 2.74)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.78)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.06)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.39)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 0.97)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'right')
Agent drove forward instead of left. (rewarded 0.04)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.30)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent drove right instead of left. (rewarded 1.29)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.02)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.69)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'forward', 'left')
Agent drove forward instead of left. (rewarded 0.38)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.12)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded -0.35)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 2.33)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.76)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 0.60)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.97)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 0.40)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 0.67)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.41)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 661
\-------------------------
Simulating trial. . .
espilon = 0.0714; alpha = 0.0040
Simulating trial. . .
espilon = 0.0714; alpha = 0.0040
Simulating trial. . .
espilon = 0.0714; alpha = 0.0040
Simulating trial. . .
espilon = 0.0714; alpha = 0.0040
Simulating trial. . .
espilon = 0.0714; alpha = 0.0040
Simulating trial. . .
espilon = 0.0714; alpha = 0.0040
Simulating trial. . .
espilon = 0.0714; alpha = 0.0040
Simulating trial. . .
espilon = 0.0714; alpha = 0.0040
Simulating trial. . .
espilon = 0.0714; alpha = 0.0040
Simulating trial. . .
espilon = 0.0714; alpha = 0.0040
Simulating trial. . .
espilon = 0.0714; alpha = 0.0040
Simulating trial. . .
espilon = 0.0714; alpha = 0.0040
Simulating trial. . .
espilon = 0.0714; alpha = 0.0040
Simulating trial. . .
espilon = 0.0714; alpha = 0.0040
Simulating trial. . .
espilon = 0.0714; alpha = 0.0040
Simulating trial. . .
espilon = 0.0714; alpha = 0.0040
Simulating trial. . .
espilon = 0.0714; alpha = 0.0040
Simulating trial. . .
espilon = 0.0714; alpha = 0.0040
Simulating trial. . .
espilon = 0.0714; alpha = 0.0040
Simulating trial. . .
espilon = 0.0714; alpha = 0.0040
Simulating trial. . .
espilon = 0.0714; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.67)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.67)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.60)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.50)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.55)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove right instead of left. (rewarded 0.45)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 1.13)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'forward')
Agent followed the waypoint forward. (rewarded 1.86)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.32)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent followed the waypoint left. (rewarded 2.81)
67% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 662
\-------------------------
Simulating trial. . .
espilon = 0.0711; alpha = 0.0040
Simulating trial. . .
espilon = 0.0711; alpha = 0.0040
Simulating trial. . .
espilon = 0.0711; alpha = 0.0040
Simulating trial. . .
espilon = 0.0711; alpha = 0.0040
Simulating trial. . .
espilon = 0.0711; alpha = 0.0040
Simulating trial. . .
espilon = 0.0711; alpha = 0.0040
Simulating trial. . .
espilon = 0.0711; alpha = 0.0040
Simulating trial. . .
espilon = 0.0711; alpha = 0.0040
Simulating trial. . .
espilon = 0.0711; alpha = 0.0040
Simulating trial. . .
espilon = 0.0711; alpha = 0.0040
Simulating trial. . .
espilon = 0.0711; alpha = 0.0040
Simulating trial. . .
espilon = 0.0711; alpha = 0.0040
Simulating trial. . .
espilon = 0.0711; alpha = 0.0040
Simulating trial. . .
espilon = 0.0711; alpha = 0.0040
Simulating trial. . .
espilon = 0.0711; alpha = 0.0040
Simulating trial. . .
espilon = 0.0711; alpha = 0.0040
Simulating trial. . .
espilon = 0.0711; alpha = 0.0040
Simulating trial. . .
espilon = 0.0711; alpha = 0.0040
Simulating trial. . .
espilon = 0.0711; alpha = 0.0040
Simulating trial. . .
espilon = 0.0711; alpha = 0.0040
Simulating trial. . .
espilon = 0.0711; alpha = 0.0040
Simulating trial. . .
espilon = 0.0711; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.06)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.67)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.79)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.68)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.50)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.50)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 1.45)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 1.58)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.52)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.41)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'left')
Agent followed the waypoint right. (rewarded 2.03)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.30)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'right')
Agent drove right instead of forward. (rewarded 1.32)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.91)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.70)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.59)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.85)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.45)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent drove right instead of left. (rewarded -0.32)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.34)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.20)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.77)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.96)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.08)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.60)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 663
\-------------------------
Simulating trial. . .
espilon = 0.0708; alpha = 0.0040
Simulating trial. . .
espilon = 0.0708; alpha = 0.0040
Simulating trial. . .
espilon = 0.0708; alpha = 0.0040
Simulating trial. . .
espilon = 0.0708; alpha = 0.0040
Simulating trial. . .
espilon = 0.0708; alpha = 0.0040
Simulating trial. . .
espilon = 0.0708; alpha = 0.0040
Simulating trial. . .
espilon = 0.0708; alpha = 0.0040
Simulating trial. . .
espilon = 0.0708; alpha = 0.0040
Simulating trial. . .
espilon = 0.0708; alpha = 0.0040
Simulating trial. . .
espilon = 0.0708; alpha = 0.0040
Simulating trial. . .
espilon = 0.0708; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', 'left')
Agent properly idled at a red light. (rewarded 1.02)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 1.13)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.82)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.80)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 2.06)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 1.11)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 0.86)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'left')
Agent followed the waypoint right. (rewarded 2.84)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.72)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.49)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.10)
56% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 664
\-------------------------
Simulating trial. . .
espilon = 0.0705; alpha = 0.0040
Simulating trial. . .
espilon = 0.0705; alpha = 0.0040
Simulating trial. . .
espilon = 0.0705; alpha = 0.0040
Simulating trial. . .
espilon = 0.0705; alpha = 0.0040
Simulating trial. . .
espilon = 0.0705; alpha = 0.0040
Simulating trial. . .
espilon = 0.0705; alpha = 0.0040
Simulating trial. . .
espilon = 0.0705; alpha = 0.0040
Simulating trial. . .
espilon = 0.0705; alpha = 0.0040
Simulating trial. . .
espilon = 0.0705; alpha = 0.0040
Simulating trial. . .
espilon = 0.0705; alpha = 0.0040
Simulating trial. . .
espilon = 0.0705; alpha = 0.0040
Simulating trial. . .
espilon = 0.0705; alpha = 0.0040
Simulating trial. . .
espilon = 0.0705; alpha = 0.0040
Simulating trial. . .
espilon = 0.0705; alpha = 0.0040
Simulating trial. . .
espilon = 0.0705; alpha = 0.0040
Simulating trial. . .
espilon = 0.0705; alpha = 0.0040
Simulating trial. . .
espilon = 0.0705; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'forward', 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.00)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.09)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.38)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.81)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent followed the waypoint left. (rewarded 2.93)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', 'left')
Agent drove right instead of forward. (rewarded 0.10)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.63)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.07)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.56)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.62)
50% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 665
\-------------------------
Simulating trial. . .
espilon = 0.0702; alpha = 0.0040
Simulating trial. . .
espilon = 0.0702; alpha = 0.0040
Simulating trial. . .
espilon = 0.0702; alpha = 0.0040
Simulating trial. . .
espilon = 0.0702; alpha = 0.0040
Simulating trial. . .
espilon = 0.0702; alpha = 0.0040
Simulating trial. . .
espilon = 0.0702; alpha = 0.0040
Simulating trial. . .
espilon = 0.0702; alpha = 0.0040
Simulating trial. . .
espilon = 0.0702; alpha = 0.0040
Simulating trial. . .
espilon = 0.0702; alpha = 0.0040
Simulating trial. . .
espilon = 0.0702; alpha = 0.0040
Simulating trial. . .
espilon = 0.0702; alpha = 0.0040
Simulating trial. . .
espilon = 0.0702; alpha = 0.0040
Simulating trial. . .
espilon = 0.0702; alpha = 0.0040
Simulating trial. . .
espilon = 0.0702; alpha = 0.0040
Simulating trial. . .
espilon = 0.0702; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.48)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.56)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.28)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent drove left instead of right. (rewarded 0.56)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.71)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.02)
76% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 666
\-------------------------
Simulating trial. . .
espilon = 0.0699; alpha = 0.0040
Simulating trial. . .
espilon = 0.0699; alpha = 0.0040
Simulating trial. . .
espilon = 0.0699; alpha = 0.0040
Simulating trial. . .
espilon = 0.0699; alpha = 0.0040
Simulating trial. . .
espilon = 0.0699; alpha = 0.0040
Simulating trial. . .
espilon = 0.0699; alpha = 0.0040
Simulating trial. . .
espilon = 0.0699; alpha = 0.0040
Simulating trial. . .
espilon = 0.0699; alpha = 0.0040
Simulating trial. . .
espilon = 0.0699; alpha = 0.0040
Simulating trial. . .
espilon = 0.0699; alpha = 0.0040
Simulating trial. . .
espilon = 0.0699; alpha = 0.0040
Simulating trial. . .
espilon = 0.0699; alpha = 0.0040
Simulating trial. . .
espilon = 0.0699; alpha = 0.0040
Simulating trial. . .
espilon = 0.0699; alpha = 0.0040
Simulating trial. . .
espilon = 0.0699; alpha = 0.0040
Simulating trial. . .
espilon = 0.0699; alpha = 0.0040
Simulating trial. . .
espilon = 0.0699; alpha = 0.0040
Simulating trial. . .
espilon = 0.0699; alpha = 0.0040
Simulating trial. . .
espilon = 0.0699; alpha = 0.0040
Simulating trial. . .
espilon = 0.0699; alpha = 0.0040
Simulating trial. . .
espilon = 0.0699; alpha = 0.0040
Simulating trial. . .
espilon = 0.0699; alpha = 0.0040
Simulating trial. . .
espilon = 0.0699; alpha = 0.0040
Simulating trial. . .
espilon = 0.0699; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.20)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.57)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.46)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.41)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.13)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.33)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 0.16)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.23)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent drove right instead of left. (rewarded 1.55)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 2.10)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.06)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.07)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.03)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.84)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.14)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.64)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.94)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.63)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.65)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.39)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 0.73)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.99)
12% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 667
\-------------------------
Simulating trial. . .
espilon = 0.0697; alpha = 0.0040
Simulating trial. . .
espilon = 0.0697; alpha = 0.0040
Simulating trial. . .
espilon = 0.0697; alpha = 0.0040
Simulating trial. . .
espilon = 0.0697; alpha = 0.0040
Simulating trial. . .
espilon = 0.0697; alpha = 0.0040
Simulating trial. . .
espilon = 0.0697; alpha = 0.0040
Simulating trial. . .
espilon = 0.0697; alpha = 0.0040
Simulating trial. . .
espilon = 0.0697; alpha = 0.0040
Simulating trial. . .
espilon = 0.0697; alpha = 0.0040
Simulating trial. . .
espilon = 0.0697; alpha = 0.0040
Simulating trial. . .
espilon = 0.0697; alpha = 0.0040
Simulating trial. . .
espilon = 0.0697; alpha = 0.0040
Simulating trial. . .
espilon = 0.0697; alpha = 0.0040
Simulating trial. . .
espilon = 0.0697; alpha = 0.0040
Simulating trial. . .
espilon = 0.0697; alpha = 0.0040
Simulating trial. . .
espilon = 0.0697; alpha = 0.0040
Simulating trial. . .
espilon = 0.0697; alpha = 0.0040
Simulating trial. . .
espilon = 0.0697; alpha = 0.0040
Simulating trial. . .
espilon = 0.0697; alpha = 0.0040
Simulating trial. . .
espilon = 0.0697; alpha = 0.0040
Simulating trial. . .
espilon = 0.0697; alpha = 0.0040
Simulating trial. . .
espilon = 0.0697; alpha = 0.0040
Simulating trial. . .
espilon = 0.0697; alpha = 0.0040
Simulating trial. . .
espilon = 0.0697; alpha = 0.0040
Simulating trial. . .
espilon = 0.0697; alpha = 0.0040
Simulating trial. . .
espilon = 0.0697; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.91)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 1.36)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'left')
Agent followed the waypoint forward. (rewarded 2.33)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.10)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.00)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.19)
70% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 668
\-------------------------
Simulating trial. . .
espilon = 0.0694; alpha = 0.0040
Simulating trial. . .
espilon = 0.0694; alpha = 0.0040
Simulating trial. . .
espilon = 0.0694; alpha = 0.0040
Simulating trial. . .
espilon = 0.0694; alpha = 0.0040
Simulating trial. . .
espilon = 0.0694; alpha = 0.0040
Simulating trial. . .
espilon = 0.0694; alpha = 0.0040
Simulating trial. . .
espilon = 0.0694; alpha = 0.0040
Simulating trial. . .
espilon = 0.0694; alpha = 0.0040
Simulating trial. . .
espilon = 0.0694; alpha = 0.0040
Simulating trial. . .
espilon = 0.0694; alpha = 0.0040
Simulating trial. . .
espilon = 0.0694; alpha = 0.0040
Simulating trial. . .
espilon = 0.0694; alpha = 0.0040
Simulating trial. . .
espilon = 0.0694; alpha = 0.0040
Simulating trial. . .
espilon = 0.0694; alpha = 0.0040
Simulating trial. . .
espilon = 0.0694; alpha = 0.0040
Simulating trial. . .
espilon = 0.0694; alpha = 0.0040
Simulating trial. . .
espilon = 0.0694; alpha = 0.0040
Simulating trial. . .
espilon = 0.0694; alpha = 0.0040
Simulating trial. . .
espilon = 0.0694; alpha = 0.0040
Simulating trial. . .
espilon = 0.0694; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', 'left')
Agent drove right instead of forward. (rewarded 0.79)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.33)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent drove right instead of left. (rewarded 0.53)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.05)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.01)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'right')
Agent followed the waypoint right. (rewarded 2.76)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.22)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.17)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.51)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 2.20)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.11)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.03)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.17)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.83)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', 'forward')
Agent drove forward instead of left. (rewarded -0.15)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.08)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.07)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', None)
Agent followed the waypoint right. (rewarded 2.01)
28% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 669
\-------------------------
Simulating trial. . .
espilon = 0.0691; alpha = 0.0040
Simulating trial. . .
espilon = 0.0691; alpha = 0.0040
Simulating trial. . .
espilon = 0.0691; alpha = 0.0040
Simulating trial. . .
espilon = 0.0691; alpha = 0.0040
Simulating trial. . .
espilon = 0.0691; alpha = 0.0040
Simulating trial. . .
espilon = 0.0691; alpha = 0.0040
Simulating trial. . .
espilon = 0.0691; alpha = 0.0040
Simulating trial. . .
espilon = 0.0691; alpha = 0.0040
Simulating trial. . .
espilon = 0.0691; alpha = 0.0040
Simulating trial. . .
espilon = 0.0691; alpha = 0.0040
Simulating trial. . .
espilon = 0.0691; alpha = 0.0040
Simulating trial. . .
espilon = 0.0691; alpha = 0.0040
Simulating trial. . .
espilon = 0.0691; alpha = 0.0040
Simulating trial. . .
espilon = 0.0691; alpha = 0.0040
Simulating trial. . .
espilon = 0.0691; alpha = 0.0040
Simulating trial. . .
espilon = 0.0691; alpha = 0.0040
Simulating trial. . .
espilon = 0.0691; alpha = 0.0040
Simulating trial. . .
espilon = 0.0691; alpha = 0.0040
Simulating trial. . .
espilon = 0.0691; alpha = 0.0040
Simulating trial. . .
espilon = 0.0691; alpha = 0.0040
Simulating trial. . .
espilon = 0.0691; alpha = 0.0040
Simulating trial. . .
espilon = 0.0691; alpha = 0.0040
Simulating trial. . .
espilon = 0.0691; alpha = 0.0040
Simulating trial. . .
espilon = 0.0691; alpha = 0.0040
Simulating trial. . .
espilon = 0.0691; alpha = 0.0040
Simulating trial. . .
espilon = 0.0691; alpha = 0.0040
Simulating trial. . .
espilon = 0.0691; alpha = 0.0040
Simulating trial. . .
espilon = 0.0691; alpha = 0.0040
Simulating trial. . .
espilon = 0.0691; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 2.50)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.39)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.16)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.73)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.49)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.81)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.86)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.21)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.89)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent drove right instead of forward. (rewarded 1.64)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 1.60)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.56)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 1.57)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 0.22)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'right', 'left')
Agent followed the waypoint right. (rewarded 2.38)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent attempted driving left through a red light. (rewarded -9.27)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.26)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.73)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.65)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.74)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 670
\-------------------------
Simulating trial. . .
espilon = 0.0688; alpha = 0.0040
Simulating trial. . .
espilon = 0.0688; alpha = 0.0040
Simulating trial. . .
espilon = 0.0688; alpha = 0.0040
Simulating trial. . .
espilon = 0.0688; alpha = 0.0040
Simulating trial. . .
espilon = 0.0688; alpha = 0.0040
Simulating trial. . .
espilon = 0.0688; alpha = 0.0040
Simulating trial. . .
espilon = 0.0688; alpha = 0.0040
Simulating trial. . .
espilon = 0.0688; alpha = 0.0040
Simulating trial. . .
espilon = 0.0688; alpha = 0.0040
Simulating trial. . .
espilon = 0.0688; alpha = 0.0040
Simulating trial. . .
espilon = 0.0688; alpha = 0.0040
Simulating trial. . .
espilon = 0.0688; alpha = 0.0040
Simulating trial. . .
espilon = 0.0688; alpha = 0.0040
Simulating trial. . .
espilon = 0.0688; alpha = 0.0040
Simulating trial. . .
espilon = 0.0688; alpha = 0.0040
Simulating trial. . .
espilon = 0.0688; alpha = 0.0040
Simulating trial. . .
espilon = 0.0688; alpha = 0.0040
Simulating trial. . .
espilon = 0.0688; alpha = 0.0040
Simulating trial. . .
espilon = 0.0688; alpha = 0.0040
Simulating trial. . .
espilon = 0.0688; alpha = 0.0040
Simulating trial. . .
espilon = 0.0688; alpha = 0.0040
Simulating trial. . .
espilon = 0.0688; alpha = 0.0040
Simulating trial. . .
espilon = 0.0688; alpha = 0.0040
Simulating trial. . .
espilon = 0.0688; alpha = 0.0040
Simulating trial. . .
espilon = 0.0688; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.12)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.19)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.55)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.69)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.08)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.72)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.71)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove right instead of forward. (rewarded 0.83)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.91)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.34)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.11)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.03)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.42)
35% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 671
\-------------------------
Simulating trial. . .
espilon = 0.0686; alpha = 0.0040
Simulating trial. . .
espilon = 0.0686; alpha = 0.0040
Simulating trial. . .
espilon = 0.0686; alpha = 0.0040
Simulating trial. . .
espilon = 0.0686; alpha = 0.0040
Simulating trial. . .
espilon = 0.0686; alpha = 0.0040
Simulating trial. . .
espilon = 0.0686; alpha = 0.0040
Simulating trial. . .
espilon = 0.0686; alpha = 0.0040
Simulating trial. . .
espilon = 0.0686; alpha = 0.0040
Simulating trial. . .
espilon = 0.0686; alpha = 0.0040
Simulating trial. . .
espilon = 0.0686; alpha = 0.0040
Simulating trial. . .
espilon = 0.0686; alpha = 0.0040
Simulating trial. . .
espilon = 0.0686; alpha = 0.0040
Simulating trial. . .
espilon = 0.0686; alpha = 0.0040
Simulating trial. . .
espilon = 0.0686; alpha = 0.0040
Simulating trial. . .
espilon = 0.0686; alpha = 0.0040
Simulating trial. . .
espilon = 0.0686; alpha = 0.0040
Simulating trial. . .
espilon = 0.0686; alpha = 0.0040
Simulating trial. . .
espilon = 0.0686; alpha = 0.0040
Simulating trial. . .
espilon = 0.0686; alpha = 0.0040
Simulating trial. . .
espilon = 0.0686; alpha = 0.0040
Simulating trial. . .
espilon = 0.0686; alpha = 0.0040
Simulating trial. . .
espilon = 0.0686; alpha = 0.0040
Simulating trial. . .
espilon = 0.0686; alpha = 0.0040
Simulating trial. . .
espilon = 0.0686; alpha = 0.0040
Simulating trial. . .
espilon = 0.0686; alpha = 0.0040
Simulating trial. . .
espilon = 0.0686; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', 'right')
Agent followed the waypoint right. (rewarded 2.01)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.11)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent drove left instead of forward. (rewarded 0.25)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 0.90)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.48)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.08)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 2.84)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 0.10)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'right')
Agent drove right instead of left. (rewarded 0.91)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'forward', 'forward')
Agent drove forward instead of right. (rewarded 0.65)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'left', None)
Agent followed the waypoint forward. (rewarded 2.75)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.87)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'right')
Agent properly idled at a red light. (rewarded 2.16)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'right')
Agent drove right instead of forward. (rewarded 0.31)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.34)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.37)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.36)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 1.48)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 0.77)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.61)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 672
\-------------------------
Simulating trial. . .
espilon = 0.0683; alpha = 0.0040
Simulating trial. . .
espilon = 0.0683; alpha = 0.0040
Simulating trial. . .
espilon = 0.0683; alpha = 0.0040
Simulating trial. . .
espilon = 0.0683; alpha = 0.0040
Simulating trial. . .
espilon = 0.0683; alpha = 0.0040
Simulating trial. . .
espilon = 0.0683; alpha = 0.0040
Simulating trial. . .
espilon = 0.0683; alpha = 0.0040
Simulating trial. . .
espilon = 0.0683; alpha = 0.0040
Simulating trial. . .
espilon = 0.0683; alpha = 0.0040
Simulating trial. . .
espilon = 0.0683; alpha = 0.0040
Simulating trial. . .
espilon = 0.0683; alpha = 0.0040
Simulating trial. . .
espilon = 0.0683; alpha = 0.0040
Simulating trial. . .
espilon = 0.0683; alpha = 0.0040
Simulating trial. . .
espilon = 0.0683; alpha = 0.0040
Simulating trial. . .
espilon = 0.0683; alpha = 0.0040
Simulating trial. . .
espilon = 0.0683; alpha = 0.0040
Simulating trial. . .
espilon = 0.0683; alpha = 0.0040
Simulating trial. . .
espilon = 0.0683; alpha = 0.0040
Simulating trial. . .
espilon = 0.0683; alpha = 0.0040
Simulating trial. . .
espilon = 0.0683; alpha = 0.0040
Simulating trial. . .
espilon = 0.0683; alpha = 0.0040
Simulating trial. . .
espilon = 0.0683; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'right', 'forward')
Agent followed the waypoint forward. (rewarded 2.16)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.19)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.71)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.18)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.38)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.69)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.13)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 1.82)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 2.87)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.09)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.89)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.24)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 673
\-------------------------
Simulating trial. . .
espilon = 0.0680; alpha = 0.0040
Simulating trial. . .
espilon = 0.0680; alpha = 0.0040
Simulating trial. . .
espilon = 0.0680; alpha = 0.0040
Simulating trial. . .
espilon = 0.0680; alpha = 0.0040
Simulating trial. . .
espilon = 0.0680; alpha = 0.0040
Simulating trial. . .
espilon = 0.0680; alpha = 0.0040
Simulating trial. . .
espilon = 0.0680; alpha = 0.0040
Simulating trial. . .
espilon = 0.0680; alpha = 0.0040
Simulating trial. . .
espilon = 0.0680; alpha = 0.0040
Simulating trial. . .
espilon = 0.0680; alpha = 0.0040
Simulating trial. . .
espilon = 0.0680; alpha = 0.0040
Simulating trial. . .
espilon = 0.0680; alpha = 0.0040
Simulating trial. . .
espilon = 0.0680; alpha = 0.0040
Simulating trial. . .
espilon = 0.0680; alpha = 0.0040
Simulating trial. . .
espilon = 0.0680; alpha = 0.0040
Simulating trial. . .
espilon = 0.0680; alpha = 0.0040
Simulating trial. . .
espilon = 0.0680; alpha = 0.0040
Simulating trial. . .
espilon = 0.0680; alpha = 0.0040
Simulating trial. . .
espilon = 0.0680; alpha = 0.0040
Simulating trial. . .
espilon = 0.0680; alpha = 0.0040
Simulating trial. . .
espilon = 0.0680; alpha = 0.0040
Simulating trial. . .
espilon = 0.0680; alpha = 0.0040
Simulating trial. . .
espilon = 0.0680; alpha = 0.0040
Simulating trial. . .
espilon = 0.0680; alpha = 0.0040
Simulating trial. . .
espilon = 0.0680; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.20)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 2.35)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.41)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.37)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 1.61)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'right', None)
Agent properly idled at a red light. (rewarded 2.02)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.14)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.72)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.35)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.86)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.12)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded 0.11)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'right', None)
Agent drove left instead of forward. (rewarded 0.43)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.27)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 0.79)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 0.49)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.88)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.21)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 2.21)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 0.38)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 674
\-------------------------
Simulating trial. . .
espilon = 0.0677; alpha = 0.0040
Simulating trial. . .
espilon = 0.0677; alpha = 0.0040
Simulating trial. . .
espilon = 0.0677; alpha = 0.0040
Simulating trial. . .
espilon = 0.0677; alpha = 0.0040
Simulating trial. . .
espilon = 0.0677; alpha = 0.0040
Simulating trial. . .
espilon = 0.0677; alpha = 0.0040
Simulating trial. . .
espilon = 0.0677; alpha = 0.0040
Simulating trial. . .
espilon = 0.0677; alpha = 0.0040
Simulating trial. . .
espilon = 0.0677; alpha = 0.0040
Simulating trial. . .
espilon = 0.0677; alpha = 0.0040
Simulating trial. . .
espilon = 0.0677; alpha = 0.0040
Simulating trial. . .
espilon = 0.0677; alpha = 0.0040
Simulating trial. . .
espilon = 0.0677; alpha = 0.0040
Simulating trial. . .
espilon = 0.0677; alpha = 0.0040
Simulating trial. . .
espilon = 0.0677; alpha = 0.0040
Simulating trial. . .
espilon = 0.0677; alpha = 0.0040
Simulating trial. . .
espilon = 0.0677; alpha = 0.0040
Simulating trial. . .
espilon = 0.0677; alpha = 0.0040
Simulating trial. . .
espilon = 0.0677; alpha = 0.0040
Simulating trial. . .
espilon = 0.0677; alpha = 0.0040
Simulating trial. . .
espilon = 0.0677; alpha = 0.0040
Simulating trial. . .
espilon = 0.0677; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.66)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.12)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'right')
Agent properly idled at a red light. (rewarded 1.81)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.43)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.17)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.86)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 1.04)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.14)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', None)
Agent properly idled at a red light. (rewarded 1.86)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 1.67)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.19)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.03)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent followed the waypoint right. (rewarded 2.42)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 0.71)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 0.77)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 0.63)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 1.15)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.09)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.63)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.24)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 675
\-------------------------
Simulating trial. . .
espilon = 0.0675; alpha = 0.0040
Simulating trial. . .
espilon = 0.0675; alpha = 0.0040
Simulating trial. . .
espilon = 0.0675; alpha = 0.0040
Simulating trial. . .
espilon = 0.0675; alpha = 0.0040
Simulating trial. . .
espilon = 0.0675; alpha = 0.0040
Simulating trial. . .
espilon = 0.0675; alpha = 0.0040
Simulating trial. . .
espilon = 0.0675; alpha = 0.0040
Simulating trial. . .
espilon = 0.0675; alpha = 0.0040
Simulating trial. . .
espilon = 0.0675; alpha = 0.0040
Simulating trial. . .
espilon = 0.0675; alpha = 0.0040
Simulating trial. . .
espilon = 0.0675; alpha = 0.0040
Simulating trial. . .
espilon = 0.0675; alpha = 0.0040
Simulating trial. . .
espilon = 0.0675; alpha = 0.0040
Simulating trial. . .
espilon = 0.0675; alpha = 0.0040
Simulating trial. . .
espilon = 0.0675; alpha = 0.0040
Simulating trial. . .
espilon = 0.0675; alpha = 0.0040
Simulating trial. . .
espilon = 0.0675; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'left')
Agent followed the waypoint right. (rewarded 1.77)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.83)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.22)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove left instead of forward. (rewarded 1.40)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent followed the waypoint right. (rewarded 2.49)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.39)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 1.00)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.90)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 1.73)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.37)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.46)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.82)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.47)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.32)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.43)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.20)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.64)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.16)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.42)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.18)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', None)
Agent properly idled at a red light. (rewarded 1.89)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.38)
12% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 676
\-------------------------
Simulating trial. . .
espilon = 0.0672; alpha = 0.0040
Simulating trial. . .
espilon = 0.0672; alpha = 0.0040
Simulating trial. . .
espilon = 0.0672; alpha = 0.0040
Simulating trial. . .
espilon = 0.0672; alpha = 0.0040
Simulating trial. . .
espilon = 0.0672; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.12)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 1.13)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 2.14)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.23)
80% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 677
\-------------------------
Simulating trial. . .
espilon = 0.0669; alpha = 0.0040
Simulating trial. . .
espilon = 0.0669; alpha = 0.0040
Simulating trial. . .
espilon = 0.0669; alpha = 0.0040
Simulating trial. . .
espilon = 0.0669; alpha = 0.0040
Simulating trial. . .
espilon = 0.0669; alpha = 0.0040
Simulating trial. . .
espilon = 0.0669; alpha = 0.0040
Simulating trial. . .
espilon = 0.0669; alpha = 0.0040
Simulating trial. . .
espilon = 0.0669; alpha = 0.0040
Simulating trial. . .
espilon = 0.0669; alpha = 0.0040
Simulating trial. . .
espilon = 0.0669; alpha = 0.0040
Simulating trial. . .
espilon = 0.0669; alpha = 0.0040
Simulating trial. . .
espilon = 0.0669; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.89)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.39)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 1.09)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.51)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.80)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.82)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 1.35)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.85)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', 'right')
Agent attempted driving forward through a red light. (rewarded -10.43)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 1.39)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.17)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 2.64)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent drove forward instead of left. (rewarded 0.92)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent followed the waypoint left. (rewarded 1.16)
44% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 678
\-------------------------
Simulating trial. . .
espilon = 0.0667; alpha = 0.0040
Simulating trial. . .
espilon = 0.0667; alpha = 0.0040
Simulating trial. . .
espilon = 0.0667; alpha = 0.0040
Simulating trial. . .
espilon = 0.0667; alpha = 0.0040
Simulating trial. . .
espilon = 0.0667; alpha = 0.0040
Simulating trial. . .
espilon = 0.0667; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 0.21)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.21)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.05)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.74)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.24)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 2.17)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.90)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.97)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.82)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.35)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'right', None)
Agent followed the waypoint forward. (rewarded 1.36)
45% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 679
\-------------------------
Simulating trial. . .
espilon = 0.0664; alpha = 0.0040
Simulating trial. . .
espilon = 0.0664; alpha = 0.0040
Simulating trial. . .
espilon = 0.0664; alpha = 0.0040
Simulating trial. . .
espilon = 0.0664; alpha = 0.0040
Simulating trial. . .
espilon = 0.0664; alpha = 0.0040
Simulating trial. . .
espilon = 0.0664; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.15)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'right')
Agent drove right instead of forward. (rewarded 1.70)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.64)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 0.87)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.69)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 0.21)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent attempted driving forward through a red light. (rewarded -9.91)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.68)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.83)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.33)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.84)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.52)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent followed the waypoint right. (rewarded 1.31)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', 'forward')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.99)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.25)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.13)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.74)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 0.82)
10% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 680
\-------------------------
Simulating trial. . .
espilon = 0.0661; alpha = 0.0040
Simulating trial. . .
espilon = 0.0661; alpha = 0.0040
Simulating trial. . .
espilon = 0.0661; alpha = 0.0040
Simulating trial. . .
espilon = 0.0661; alpha = 0.0040
Simulating trial. . .
espilon = 0.0661; alpha = 0.0040
Simulating trial. . .
espilon = 0.0661; alpha = 0.0040
Simulating trial. . .
espilon = 0.0661; alpha = 0.0040
Simulating trial. . .
espilon = 0.0661; alpha = 0.0040
Simulating trial. . .
espilon = 0.0661; alpha = 0.0040
Simulating trial. . .
espilon = 0.0661; alpha = 0.0040
Simulating trial. . .
espilon = 0.0661; alpha = 0.0040
Simulating trial. . .
espilon = 0.0661; alpha = 0.0040
Simulating trial. . .
espilon = 0.0661; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.64)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.85)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.10)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.23)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 2.38)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.07)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.63)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.65)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.99)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.66)
67% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 681
\-------------------------
Simulating trial. . .
espilon = 0.0659; alpha = 0.0040
Simulating trial. . .
espilon = 0.0659; alpha = 0.0040
Simulating trial. . .
espilon = 0.0659; alpha = 0.0040
Simulating trial. . .
espilon = 0.0659; alpha = 0.0040
Simulating trial. . .
espilon = 0.0659; alpha = 0.0040
Simulating trial. . .
espilon = 0.0659; alpha = 0.0040
Simulating trial. . .
espilon = 0.0659; alpha = 0.0040
Simulating trial. . .
espilon = 0.0659; alpha = 0.0040
Simulating trial. . .
espilon = 0.0659; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.04)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent drove right instead of forward. (rewarded 1.70)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'forward')
Agent properly idled at a red light. (rewarded 1.63)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'forward')
Agent drove right instead of left. (rewarded 0.71)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.75)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 1.64)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'left')
Agent properly idled at a red light. (rewarded 1.94)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'left')
Agent properly idled at a red light. (rewarded 1.46)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', 'left')
Agent properly idled at a red light. (rewarded 2.13)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.88)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.82)
45% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 682
\-------------------------
Simulating trial. . .
espilon = 0.0656; alpha = 0.0040
Simulating trial. . .
espilon = 0.0656; alpha = 0.0040
Simulating trial. . .
espilon = 0.0656; alpha = 0.0040
Simulating trial. . .
espilon = 0.0656; alpha = 0.0040
Simulating trial. . .
espilon = 0.0656; alpha = 0.0040
Simulating trial. . .
espilon = 0.0656; alpha = 0.0040
Simulating trial. . .
espilon = 0.0656; alpha = 0.0040
Simulating trial. . .
espilon = 0.0656; alpha = 0.0040
Simulating trial. . .
espilon = 0.0656; alpha = 0.0040
Simulating trial. . .
espilon = 0.0656; alpha = 0.0040
Simulating trial. . .
espilon = 0.0656; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', 'forward')
Agent drove right instead of forward. (rewarded 1.53)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.29)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.15)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'forward')
Agent properly idled at a red light. (rewarded 2.70)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.57)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.52)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.75)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.09)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.07)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.95)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 0.19)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent drove right instead of left. (rewarded 1.22)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 1.92)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.21)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'right', None)
Agent properly idled at a red light. (rewarded 1.09)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent attempted driving left through a red light. (rewarded -9.40)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.73)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.93)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.51)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.39)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.35)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.36)
27% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 683
\-------------------------
Simulating trial. . .
espilon = 0.0653; alpha = 0.0040
Simulating trial. . .
espilon = 0.0653; alpha = 0.0040
Simulating trial. . .
espilon = 0.0653; alpha = 0.0040
Simulating trial. . .
espilon = 0.0653; alpha = 0.0040
Simulating trial. . .
espilon = 0.0653; alpha = 0.0040
Simulating trial. . .
espilon = 0.0653; alpha = 0.0040
Simulating trial. . .
espilon = 0.0653; alpha = 0.0040
Simulating trial. . .
espilon = 0.0653; alpha = 0.0040
Simulating trial. . .
espilon = 0.0653; alpha = 0.0040
Simulating trial. . .
espilon = 0.0653; alpha = 0.0040
Simulating trial. . .
espilon = 0.0653; alpha = 0.0040
Simulating trial. . .
espilon = 0.0653; alpha = 0.0040
Simulating trial. . .
espilon = 0.0653; alpha = 0.0040
Simulating trial. . .
espilon = 0.0653; alpha = 0.0040
Simulating trial. . .
espilon = 0.0653; alpha = 0.0040
Simulating trial. . .
espilon = 0.0653; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.43)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.76)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.60)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.74)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.77)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.03)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.18)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.65)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.64)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.77)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'forward')
Agent drove right instead of left. (rewarded 0.62)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.26)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 0.97)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.35)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.54)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 0.98)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.11)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded -0.45)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 0.81)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.04)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 684
\-------------------------
Simulating trial. . .
espilon = 0.0651; alpha = 0.0040
Simulating trial. . .
espilon = 0.0651; alpha = 0.0040
Simulating trial. . .
espilon = 0.0651; alpha = 0.0040
Simulating trial. . .
espilon = 0.0651; alpha = 0.0040
Simulating trial. . .
espilon = 0.0651; alpha = 0.0040
Simulating trial. . .
espilon = 0.0651; alpha = 0.0040
Simulating trial. . .
espilon = 0.0651; alpha = 0.0040
Simulating trial. . .
espilon = 0.0651; alpha = 0.0040
Simulating trial. . .
espilon = 0.0651; alpha = 0.0040
Simulating trial. . .
espilon = 0.0651; alpha = 0.0040
Simulating trial. . .
espilon = 0.0651; alpha = 0.0040
Simulating trial. . .
espilon = 0.0651; alpha = 0.0040
Simulating trial. . .
espilon = 0.0651; alpha = 0.0040
Simulating trial. . .
espilon = 0.0651; alpha = 0.0040
Simulating trial. . .
espilon = 0.0651; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', 'left')
Agent drove right instead of left. (rewarded 0.32)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.72)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent drove right instead of left. (rewarded 0.84)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.69)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.82)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 0.97)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.11)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.13)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.46)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 2.35)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.41)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 2.08)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.04)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.60)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.96)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 1.41)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.45)
43% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 685
\-------------------------
Simulating trial. . .
espilon = 0.0648; alpha = 0.0040
Simulating trial. . .
espilon = 0.0648; alpha = 0.0040
Simulating trial. . .
espilon = 0.0648; alpha = 0.0040
Simulating trial. . .
espilon = 0.0648; alpha = 0.0040
Simulating trial. . .
espilon = 0.0648; alpha = 0.0040
Simulating trial. . .
espilon = 0.0648; alpha = 0.0040
Simulating trial. . .
espilon = 0.0648; alpha = 0.0040
Simulating trial. . .
espilon = 0.0648; alpha = 0.0040
Simulating trial. . .
espilon = 0.0648; alpha = 0.0040
Simulating trial. . .
espilon = 0.0648; alpha = 0.0040
Simulating trial. . .
espilon = 0.0648; alpha = 0.0040
Simulating trial. . .
espilon = 0.0648; alpha = 0.0040
Simulating trial. . .
espilon = 0.0648; alpha = 0.0040
Simulating trial. . .
espilon = 0.0648; alpha = 0.0040
Simulating trial. . .
espilon = 0.0648; alpha = 0.0040
Simulating trial. . .
espilon = 0.0648; alpha = 0.0040
Simulating trial. . .
espilon = 0.0648; alpha = 0.0040
Simulating trial. . .
espilon = 0.0648; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.26)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.41)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.37)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.82)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.51)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent followed the waypoint forward. (rewarded 1.31)
70% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 686
\-------------------------
Simulating trial. . .
espilon = 0.0646; alpha = 0.0040
Simulating trial. . .
espilon = 0.0646; alpha = 0.0040
Simulating trial. . .
espilon = 0.0646; alpha = 0.0040
Simulating trial. . .
espilon = 0.0646; alpha = 0.0040
Simulating trial. . .
espilon = 0.0646; alpha = 0.0040
Simulating trial. . .
espilon = 0.0646; alpha = 0.0040
Simulating trial. . .
espilon = 0.0646; alpha = 0.0040
Simulating trial. . .
espilon = 0.0646; alpha = 0.0040
Simulating trial. . .
espilon = 0.0646; alpha = 0.0040
Simulating trial. . .
espilon = 0.0646; alpha = 0.0040
Simulating trial. . .
espilon = 0.0646; alpha = 0.0040
Simulating trial. . .
espilon = 0.0646; alpha = 0.0040
Simulating trial. . .
espilon = 0.0646; alpha = 0.0040
Simulating trial. . .
espilon = 0.0646; alpha = 0.0040
Simulating trial. . .
espilon = 0.0646; alpha = 0.0040
Simulating trial. . .
espilon = 0.0646; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.02)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.17)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.23)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.97)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.14)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.32)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.62)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 0.98)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.11)
55% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 687
\-------------------------
Simulating trial. . .
espilon = 0.0643; alpha = 0.0040
Simulating trial. . .
espilon = 0.0643; alpha = 0.0040
Simulating trial. . .
espilon = 0.0643; alpha = 0.0040
Simulating trial. . .
espilon = 0.0643; alpha = 0.0040
Simulating trial. . .
espilon = 0.0643; alpha = 0.0040
Simulating trial. . .
espilon = 0.0643; alpha = 0.0040
Simulating trial. . .
espilon = 0.0643; alpha = 0.0040
Simulating trial. . .
espilon = 0.0643; alpha = 0.0040
Simulating trial. . .
espilon = 0.0643; alpha = 0.0040
Simulating trial. . .
espilon = 0.0643; alpha = 0.0040
Simulating trial. . .
espilon = 0.0643; alpha = 0.0040
Simulating trial. . .
espilon = 0.0643; alpha = 0.0040
Simulating trial. . .
espilon = 0.0643; alpha = 0.0040
Simulating trial. . .
espilon = 0.0643; alpha = 0.0040
Simulating trial. . .
espilon = 0.0643; alpha = 0.0040
Simulating trial. . .
espilon = 0.0643; alpha = 0.0040
Simulating trial. . .
espilon = 0.0643; alpha = 0.0040
Simulating trial. . .
espilon = 0.0643; alpha = 0.0040
Simulating trial. . .
espilon = 0.0643; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'forward')
Agent drove right instead of left. (rewarded 1.69)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 2.56)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 1.76)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.14)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.90)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.96)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.29)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.42)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.19)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.27)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.53)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.31)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 0.98)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'left')
Agent properly idled at a red light. (rewarded 2.80)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.18)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 1.97)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.99)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.81)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 688
\-------------------------
Simulating trial. . .
espilon = 0.0641; alpha = 0.0040
Simulating trial. . .
espilon = 0.0641; alpha = 0.0040
Simulating trial. . .
espilon = 0.0641; alpha = 0.0040
Simulating trial. . .
espilon = 0.0641; alpha = 0.0040
Simulating trial. . .
espilon = 0.0641; alpha = 0.0040
Simulating trial. . .
espilon = 0.0641; alpha = 0.0040
Simulating trial. . .
espilon = 0.0641; alpha = 0.0040
Simulating trial. . .
espilon = 0.0641; alpha = 0.0040
Simulating trial. . .
espilon = 0.0641; alpha = 0.0040
Simulating trial. . .
espilon = 0.0641; alpha = 0.0040
Simulating trial. . .
espilon = 0.0641; alpha = 0.0040
Simulating trial. . .
espilon = 0.0641; alpha = 0.0040
Simulating trial. . .
espilon = 0.0641; alpha = 0.0040
Simulating trial. . .
espilon = 0.0641; alpha = 0.0040
Simulating trial. . .
espilon = 0.0641; alpha = 0.0040
Simulating trial. . .
espilon = 0.0641; alpha = 0.0040
Simulating trial. . .
espilon = 0.0641; alpha = 0.0040
Simulating trial. . .
espilon = 0.0641; alpha = 0.0040
Simulating trial. . .
espilon = 0.0641; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent drove right instead of left. (rewarded 0.82)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent followed the waypoint right. (rewarded 2.64)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.09)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.04)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.74)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.70)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.56)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.36)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.67)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.16)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.08)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.19)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.20)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.25)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.65)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.81)
36% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 689
\-------------------------
Simulating trial. . .
espilon = 0.0638; alpha = 0.0040
Simulating trial. . .
espilon = 0.0638; alpha = 0.0040
Simulating trial. . .
espilon = 0.0638; alpha = 0.0040
Simulating trial. . .
espilon = 0.0638; alpha = 0.0040
Simulating trial. . .
espilon = 0.0638; alpha = 0.0040
Simulating trial. . .
espilon = 0.0638; alpha = 0.0040
Simulating trial. . .
espilon = 0.0638; alpha = 0.0040
Simulating trial. . .
espilon = 0.0638; alpha = 0.0040
Simulating trial. . .
espilon = 0.0638; alpha = 0.0040
Simulating trial. . .
espilon = 0.0638; alpha = 0.0040
Simulating trial. . .
espilon = 0.0638; alpha = 0.0040
Simulating trial. . .
espilon = 0.0638; alpha = 0.0040
Simulating trial. . .
espilon = 0.0638; alpha = 0.0040
Simulating trial. . .
espilon = 0.0638; alpha = 0.0040
Simulating trial. . .
espilon = 0.0638; alpha = 0.0040
Simulating trial. . .
espilon = 0.0638; alpha = 0.0040
Simulating trial. . .
espilon = 0.0638; alpha = 0.0040
Simulating trial. . .
espilon = 0.0638; alpha = 0.0040
Simulating trial. . .
espilon = 0.0638; alpha = 0.0040
Simulating trial. . .
espilon = 0.0638; alpha = 0.0040
Simulating trial. . .
espilon = 0.0638; alpha = 0.0040
Simulating trial. . .
espilon = 0.0638; alpha = 0.0040
Simulating trial. . .
espilon = 0.0638; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.09)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.74)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.46)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.54)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.68)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'right', 'left')
Agent drove forward instead of left. (rewarded 1.24)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.23)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', 'right')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.90)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'right')
Agent properly idled at a red light. (rewarded 1.99)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.72)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.08)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'right', None)
Agent followed the waypoint forward. (rewarded 1.08)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.85)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 0.72)
30% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 690
\-------------------------
Simulating trial. . .
espilon = 0.0635; alpha = 0.0040
Simulating trial. . .
espilon = 0.0635; alpha = 0.0040
Simulating trial. . .
espilon = 0.0635; alpha = 0.0040
Simulating trial. . .
espilon = 0.0635; alpha = 0.0040
Simulating trial. . .
espilon = 0.0635; alpha = 0.0040
Simulating trial. . .
espilon = 0.0635; alpha = 0.0040
Simulating trial. . .
espilon = 0.0635; alpha = 0.0040
Simulating trial. . .
espilon = 0.0635; alpha = 0.0040
Simulating trial. . .
espilon = 0.0635; alpha = 0.0040
Simulating trial. . .
espilon = 0.0635; alpha = 0.0040
Simulating trial. . .
espilon = 0.0635; alpha = 0.0040
Simulating trial. . .
espilon = 0.0635; alpha = 0.0040
Simulating trial. . .
espilon = 0.0635; alpha = 0.0040
Simulating trial. . .
espilon = 0.0635; alpha = 0.0040
Simulating trial. . .
espilon = 0.0635; alpha = 0.0040
Simulating trial. . .
espilon = 0.0635; alpha = 0.0040
Simulating trial. . .
espilon = 0.0635; alpha = 0.0040
Simulating trial. . .
espilon = 0.0635; alpha = 0.0040
Simulating trial. . .
espilon = 0.0635; alpha = 0.0040
Simulating trial. . .
espilon = 0.0635; alpha = 0.0040
Simulating trial. . .
espilon = 0.0635; alpha = 0.0040
Simulating trial. . .
espilon = 0.0635; alpha = 0.0040
Simulating trial. . .
espilon = 0.0635; alpha = 0.0040
Simulating trial. . .
espilon = 0.0635; alpha = 0.0040
Simulating trial. . .
espilon = 0.0635; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.12)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.42)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.25)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.36)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 0.36)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.32)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.33)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.97)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.01)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'forward')
Agent drove forward instead of right. (rewarded 0.67)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.73)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.83)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.58)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.45)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent followed the waypoint left. (rewarded 1.02)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 691
\-------------------------
Simulating trial. . .
espilon = 0.0633; alpha = 0.0040
Simulating trial. . .
espilon = 0.0633; alpha = 0.0040
Simulating trial. . .
espilon = 0.0633; alpha = 0.0040
Simulating trial. . .
espilon = 0.0633; alpha = 0.0040
Simulating trial. . .
espilon = 0.0633; alpha = 0.0040
Simulating trial. . .
espilon = 0.0633; alpha = 0.0040
Simulating trial. . .
espilon = 0.0633; alpha = 0.0040
Simulating trial. . .
espilon = 0.0633; alpha = 0.0040
Simulating trial. . .
espilon = 0.0633; alpha = 0.0040
Simulating trial. . .
espilon = 0.0633; alpha = 0.0040
Simulating trial. . .
espilon = 0.0633; alpha = 0.0040
Simulating trial. . .
espilon = 0.0633; alpha = 0.0040
Simulating trial. . .
espilon = 0.0633; alpha = 0.0040
Simulating trial. . .
espilon = 0.0633; alpha = 0.0040
Simulating trial. . .
espilon = 0.0633; alpha = 0.0040
Simulating trial. . .
espilon = 0.0633; alpha = 0.0040
Simulating trial. . .
espilon = 0.0633; alpha = 0.0040
Simulating trial. . .
espilon = 0.0633; alpha = 0.0040
Simulating trial. . .
espilon = 0.0633; alpha = 0.0040
Simulating trial. . .
espilon = 0.0633; alpha = 0.0040
Simulating trial. . .
espilon = 0.0633; alpha = 0.0040
Simulating trial. . .
espilon = 0.0633; alpha = 0.0040
Simulating trial. . .
espilon = 0.0633; alpha = 0.0040
Simulating trial. . .
espilon = 0.0633; alpha = 0.0040
Simulating trial. . .
espilon = 0.0633; alpha = 0.0040
Simulating trial. . .
espilon = 0.0633; alpha = 0.0040
Simulating trial. . .
espilon = 0.0633; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.10)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.45)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.84)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.90)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.28)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.05)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.80)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.02)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.45)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent followed the waypoint forward. (rewarded 2.12)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.20)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.28)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 692
\-------------------------
Simulating trial. . .
espilon = 0.0630; alpha = 0.0040
Simulating trial. . .
espilon = 0.0630; alpha = 0.0040
Simulating trial. . .
espilon = 0.0630; alpha = 0.0040
Simulating trial. . .
espilon = 0.0630; alpha = 0.0040
Simulating trial. . .
espilon = 0.0630; alpha = 0.0040
Simulating trial. . .
espilon = 0.0630; alpha = 0.0040
Simulating trial. . .
espilon = 0.0630; alpha = 0.0040
Simulating trial. . .
espilon = 0.0630; alpha = 0.0040
Simulating trial. . .
espilon = 0.0630; alpha = 0.0040
Simulating trial. . .
espilon = 0.0630; alpha = 0.0040
Simulating trial. . .
espilon = 0.0630; alpha = 0.0040
Simulating trial. . .
espilon = 0.0630; alpha = 0.0040
Simulating trial. . .
espilon = 0.0630; alpha = 0.0040
Simulating trial. . .
espilon = 0.0630; alpha = 0.0040
Simulating trial. . .
espilon = 0.0630; alpha = 0.0040
Simulating trial. . .
espilon = 0.0630; alpha = 0.0040
Simulating trial. . .
espilon = 0.0630; alpha = 0.0040
Simulating trial. . .
espilon = 0.0630; alpha = 0.0040
Simulating trial. . .
espilon = 0.0630; alpha = 0.0040
Simulating trial. . .
espilon = 0.0630; alpha = 0.0040
Simulating trial. . .
espilon = 0.0630; alpha = 0.0040
Simulating trial. . .
espilon = 0.0630; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.14)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.70)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.69)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.15)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.31)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.61)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 2.00)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 2.71)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 693
\-------------------------
Simulating trial. . .
espilon = 0.0628; alpha = 0.0040
Simulating trial. . .
espilon = 0.0628; alpha = 0.0040
Simulating trial. . .
espilon = 0.0628; alpha = 0.0040
Simulating trial. . .
espilon = 0.0628; alpha = 0.0040
Simulating trial. . .
espilon = 0.0628; alpha = 0.0040
Simulating trial. . .
espilon = 0.0628; alpha = 0.0040
Simulating trial. . .
espilon = 0.0628; alpha = 0.0040
Simulating trial. . .
espilon = 0.0628; alpha = 0.0040
Simulating trial. . .
espilon = 0.0628; alpha = 0.0040
Simulating trial. . .
espilon = 0.0628; alpha = 0.0040
Simulating trial. . .
espilon = 0.0628; alpha = 0.0040
Simulating trial. . .
espilon = 0.0628; alpha = 0.0040
Simulating trial. . .
espilon = 0.0628; alpha = 0.0040
Simulating trial. . .
espilon = 0.0628; alpha = 0.0040
Simulating trial. . .
espilon = 0.0628; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent followed the waypoint left. (rewarded 2.85)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'right')
Agent drove right instead of forward. (rewarded 1.76)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 2.31)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.88)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.28)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'forward', None)
Agent properly idled at a red light. (rewarded 2.02)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.75)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.62)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.63)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.80)
50% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 694
\-------------------------
Simulating trial. . .
espilon = 0.0625; alpha = 0.0040
Simulating trial. . .
espilon = 0.0625; alpha = 0.0040
Simulating trial. . .
espilon = 0.0625; alpha = 0.0040
Simulating trial. . .
espilon = 0.0625; alpha = 0.0040
Simulating trial. . .
espilon = 0.0625; alpha = 0.0040
Simulating trial. . .
espilon = 0.0625; alpha = 0.0040
Simulating trial. . .
espilon = 0.0625; alpha = 0.0040
Simulating trial. . .
espilon = 0.0625; alpha = 0.0040
Simulating trial. . .
espilon = 0.0625; alpha = 0.0040
Simulating trial. . .
espilon = 0.0625; alpha = 0.0040
Simulating trial. . .
espilon = 0.0625; alpha = 0.0040
Simulating trial. . .
espilon = 0.0625; alpha = 0.0040
Simulating trial. . .
espilon = 0.0625; alpha = 0.0040
Simulating trial. . .
espilon = 0.0625; alpha = 0.0040
Simulating trial. . .
espilon = 0.0625; alpha = 0.0040
Simulating trial. . .
espilon = 0.0625; alpha = 0.0040
Simulating trial. . .
espilon = 0.0625; alpha = 0.0040
Simulating trial. . .
espilon = 0.0625; alpha = 0.0040
Simulating trial. . .
espilon = 0.0625; alpha = 0.0040
Simulating trial. . .
espilon = 0.0625; alpha = 0.0040
Simulating trial. . .
espilon = 0.0625; alpha = 0.0040
Simulating trial. . .
espilon = 0.0625; alpha = 0.0040
Simulating trial. . .
espilon = 0.0625; alpha = 0.0040
Simulating trial. . .
espilon = 0.0625; alpha = 0.0040
Simulating trial. . .
espilon = 0.0625; alpha = 0.0040
Simulating trial. . .
espilon = 0.0625; alpha = 0.0040
Simulating trial. . .
espilon = 0.0625; alpha = 0.0040
Simulating trial. . .
espilon = 0.0625; alpha = 0.0040
Simulating trial. . .
espilon = 0.0625; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent followed the waypoint right. (rewarded 1.38)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.62)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'right')
Agent drove right instead of forward. (rewarded 1.28)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.15)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.47)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'left', None)
Agent followed the waypoint forward. (rewarded 2.45)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'right', None)
Agent properly idled at a red light. (rewarded 1.57)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.31)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.28)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.19)
50% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 695
\-------------------------
Simulating trial. . .
espilon = 0.0623; alpha = 0.0040
Simulating trial. . .
espilon = 0.0623; alpha = 0.0040
Simulating trial. . .
espilon = 0.0623; alpha = 0.0040
Simulating trial. . .
espilon = 0.0623; alpha = 0.0040
Simulating trial. . .
espilon = 0.0623; alpha = 0.0040
Simulating trial. . .
espilon = 0.0623; alpha = 0.0040
Simulating trial. . .
espilon = 0.0623; alpha = 0.0040
Simulating trial. . .
espilon = 0.0623; alpha = 0.0040
Simulating trial. . .
espilon = 0.0623; alpha = 0.0040
Simulating trial. . .
espilon = 0.0623; alpha = 0.0040
Simulating trial. . .
espilon = 0.0623; alpha = 0.0040
Simulating trial. . .
espilon = 0.0623; alpha = 0.0040
Simulating trial. . .
espilon = 0.0623; alpha = 0.0040
Simulating trial. . .
espilon = 0.0623; alpha = 0.0040
Simulating trial. . .
espilon = 0.0623; alpha = 0.0040
Simulating trial. . .
espilon = 0.0623; alpha = 0.0040
Simulating trial. . .
espilon = 0.0623; alpha = 0.0040
Simulating trial. . .
espilon = 0.0623; alpha = 0.0040
Simulating trial. . .
espilon = 0.0623; alpha = 0.0040
Simulating trial. . .
espilon = 0.0623; alpha = 0.0040
Simulating trial. . .
espilon = 0.0623; alpha = 0.0040
Simulating trial. . .
espilon = 0.0623; alpha = 0.0040
Simulating trial. . .
espilon = 0.0623; alpha = 0.0040
Simulating trial. . .
espilon = 0.0623; alpha = 0.0040
Simulating trial. . .
espilon = 0.0623; alpha = 0.0040
Simulating trial. . .
espilon = 0.0623; alpha = 0.0040
Simulating trial. . .
espilon = 0.0623; alpha = 0.0040
Simulating trial. . .
espilon = 0.0623; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.17)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.99)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.95)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'forward')
Agent followed the waypoint right. (rewarded 1.67)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.19)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.60)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', 'left')
Agent followed the waypoint forward. (rewarded 2.27)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.19)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.53)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.71)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 2.54)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.13)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 696
\-------------------------
Simulating trial. . .
espilon = 0.0620; alpha = 0.0040
Simulating trial. . .
espilon = 0.0620; alpha = 0.0040
Simulating trial. . .
espilon = 0.0620; alpha = 0.0040
Simulating trial. . .
espilon = 0.0620; alpha = 0.0040
Simulating trial. . .
espilon = 0.0620; alpha = 0.0040
Simulating trial. . .
espilon = 0.0620; alpha = 0.0040
Simulating trial. . .
espilon = 0.0620; alpha = 0.0040
Simulating trial. . .
espilon = 0.0620; alpha = 0.0040
Simulating trial. . .
espilon = 0.0620; alpha = 0.0040
Simulating trial. . .
espilon = 0.0620; alpha = 0.0040
Simulating trial. . .
espilon = 0.0620; alpha = 0.0040
Simulating trial. . .
espilon = 0.0620; alpha = 0.0040
Simulating trial. . .
espilon = 0.0620; alpha = 0.0040
Simulating trial. . .
espilon = 0.0620; alpha = 0.0040
Simulating trial. . .
espilon = 0.0620; alpha = 0.0040
Simulating trial. . .
espilon = 0.0620; alpha = 0.0040
Simulating trial. . .
espilon = 0.0620; alpha = 0.0040
Simulating trial. . .
espilon = 0.0620; alpha = 0.0040
Simulating trial. . .
espilon = 0.0620; alpha = 0.0040
Simulating trial. . .
espilon = 0.0620; alpha = 0.0040
Simulating trial. . .
espilon = 0.0620; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.25)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 2.74)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.30)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.42)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.90)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.60)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.13)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 2.79)
68% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 697
\-------------------------
Simulating trial. . .
espilon = 0.0618; alpha = 0.0040
Simulating trial. . .
espilon = 0.0618; alpha = 0.0040
Simulating trial. . .
espilon = 0.0618; alpha = 0.0040
Simulating trial. . .
espilon = 0.0618; alpha = 0.0040
Simulating trial. . .
espilon = 0.0618; alpha = 0.0040
Simulating trial. . .
espilon = 0.0618; alpha = 0.0040
Simulating trial. . .
espilon = 0.0618; alpha = 0.0040
Simulating trial. . .
espilon = 0.0618; alpha = 0.0040
Simulating trial. . .
espilon = 0.0618; alpha = 0.0040
Simulating trial. . .
espilon = 0.0618; alpha = 0.0040
Simulating trial. . .
espilon = 0.0618; alpha = 0.0040
Simulating trial. . .
espilon = 0.0618; alpha = 0.0040
Simulating trial. . .
espilon = 0.0618; alpha = 0.0040
Simulating trial. . .
espilon = 0.0618; alpha = 0.0040
Simulating trial. . .
espilon = 0.0618; alpha = 0.0040
Simulating trial. . .
espilon = 0.0618; alpha = 0.0040
Simulating trial. . .
espilon = 0.0618; alpha = 0.0040
Simulating trial. . .
espilon = 0.0618; alpha = 0.0040
Simulating trial. . .
espilon = 0.0618; alpha = 0.0040
Simulating trial. . .
espilon = 0.0618; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.74)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.12)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.68)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.34)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.00)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 2.75)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.42)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'right')
Agent drove right instead of forward. (rewarded 0.84)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.20)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent followed the waypoint left. (rewarded 1.67)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.57)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 1.39)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 2.40)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.03)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent followed the waypoint right. (rewarded 2.52)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.47)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.25)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.61)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.91)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.50)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 698
\-------------------------
Simulating trial. . .
espilon = 0.0615; alpha = 0.0040
Simulating trial. . .
espilon = 0.0615; alpha = 0.0040
Simulating trial. . .
espilon = 0.0615; alpha = 0.0040
Simulating trial. . .
espilon = 0.0615; alpha = 0.0040
Simulating trial. . .
espilon = 0.0615; alpha = 0.0040
Simulating trial. . .
espilon = 0.0615; alpha = 0.0040
Simulating trial. . .
espilon = 0.0615; alpha = 0.0040
Simulating trial. . .
espilon = 0.0615; alpha = 0.0040
Simulating trial. . .
espilon = 0.0615; alpha = 0.0040
Simulating trial. . .
espilon = 0.0615; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', 'left')
Agent followed the waypoint left. (rewarded 1.01)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.90)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'right', None)
Agent drove left instead of right. (rewarded 1.32)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.59)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.78)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.04)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.67)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.55)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.21)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.85)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.75)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.72)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 699
\-------------------------
Simulating trial. . .
espilon = 0.0613; alpha = 0.0040
Simulating trial. . .
espilon = 0.0613; alpha = 0.0040
Simulating trial. . .
espilon = 0.0613; alpha = 0.0040
Simulating trial. . .
espilon = 0.0613; alpha = 0.0040
Simulating trial. . .
espilon = 0.0613; alpha = 0.0040
Simulating trial. . .
espilon = 0.0613; alpha = 0.0040
Simulating trial. . .
espilon = 0.0613; alpha = 0.0040
Simulating trial. . .
espilon = 0.0613; alpha = 0.0040
Simulating trial. . .
espilon = 0.0613; alpha = 0.0040
Simulating trial. . .
espilon = 0.0613; alpha = 0.0040
Simulating trial. . .
espilon = 0.0613; alpha = 0.0040
Simulating trial. . .
espilon = 0.0613; alpha = 0.0040
Simulating trial. . .
espilon = 0.0613; alpha = 0.0040
Simulating trial. . .
espilon = 0.0613; alpha = 0.0040
Simulating trial. . .
espilon = 0.0613; alpha = 0.0040
Simulating trial. . .
espilon = 0.0613; alpha = 0.0040
Simulating trial. . .
espilon = 0.0613; alpha = 0.0040
Simulating trial. . .
espilon = 0.0613; alpha = 0.0040
Simulating trial. . .
espilon = 0.0613; alpha = 0.0040
Simulating trial. . .
espilon = 0.0613; alpha = 0.0040
Simulating trial. . .
espilon = 0.0613; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', 'forward')
Agent followed the waypoint forward. (rewarded 2.64)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.32)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.04)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.71)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.20)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.83)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'left')
Agent followed the waypoint right. (rewarded 2.90)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.69)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.29)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.34)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.44)
56% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 700
\-------------------------
Simulating trial. . .
espilon = 0.0611; alpha = 0.0040
Simulating trial. . .
espilon = 0.0611; alpha = 0.0040
Simulating trial. . .
espilon = 0.0611; alpha = 0.0040
Simulating trial. . .
espilon = 0.0611; alpha = 0.0040
Simulating trial. . .
espilon = 0.0611; alpha = 0.0040
Simulating trial. . .
espilon = 0.0611; alpha = 0.0040
Simulating trial. . .
espilon = 0.0611; alpha = 0.0040
Simulating trial. . .
espilon = 0.0611; alpha = 0.0040
Simulating trial. . .
espilon = 0.0611; alpha = 0.0040
Simulating trial. . .
espilon = 0.0611; alpha = 0.0040
Simulating trial. . .
espilon = 0.0611; alpha = 0.0040
Simulating trial. . .
espilon = 0.0611; alpha = 0.0040
Simulating trial. . .
espilon = 0.0611; alpha = 0.0040
Simulating trial. . .
espilon = 0.0611; alpha = 0.0040
Simulating trial. . .
espilon = 0.0611; alpha = 0.0040
Simulating trial. . .
espilon = 0.0611; alpha = 0.0040
Simulating trial. . .
espilon = 0.0611; alpha = 0.0040
Simulating trial. . .
espilon = 0.0611; alpha = 0.0040
Simulating trial. . .
espilon = 0.0611; alpha = 0.0040
Simulating trial. . .
espilon = 0.0611; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.80)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 1.13)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.14)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.43)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 2.86)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.51)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.44)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.75)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.10)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.75)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.53)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.15)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.76)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'right')
Agent followed the waypoint forward. (rewarded 2.63)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.04)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent followed the waypoint left. (rewarded 1.23)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.28)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.85)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.63)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 0.59)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.03)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.72)
12% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 701
\-------------------------
Simulating trial. . .
espilon = 0.0608; alpha = 0.0040
Simulating trial. . .
espilon = 0.0608; alpha = 0.0040
Simulating trial. . .
espilon = 0.0608; alpha = 0.0040
Simulating trial. . .
espilon = 0.0608; alpha = 0.0040
Simulating trial. . .
espilon = 0.0608; alpha = 0.0040
Simulating trial. . .
espilon = 0.0608; alpha = 0.0040
Simulating trial. . .
espilon = 0.0608; alpha = 0.0040
Simulating trial. . .
espilon = 0.0608; alpha = 0.0040
Simulating trial. . .
espilon = 0.0608; alpha = 0.0040
Simulating trial. . .
espilon = 0.0608; alpha = 0.0040
Simulating trial. . .
espilon = 0.0608; alpha = 0.0040
Simulating trial. . .
espilon = 0.0608; alpha = 0.0040
Simulating trial. . .
espilon = 0.0608; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.15)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -9.36)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.80)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.41)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.03)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.14)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.29)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.47)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.01)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, None)
Agent followed the waypoint right. (rewarded 2.42)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 702
\-------------------------
Simulating trial. . .
espilon = 0.0606; alpha = 0.0040
Simulating trial. . .
espilon = 0.0606; alpha = 0.0040
Simulating trial. . .
espilon = 0.0606; alpha = 0.0040
Simulating trial. . .
espilon = 0.0606; alpha = 0.0040
Simulating trial. . .
espilon = 0.0606; alpha = 0.0040
Simulating trial. . .
espilon = 0.0606; alpha = 0.0040
Simulating trial. . .
espilon = 0.0606; alpha = 0.0040
Simulating trial. . .
espilon = 0.0606; alpha = 0.0040
Simulating trial. . .
espilon = 0.0606; alpha = 0.0040
Simulating trial. . .
espilon = 0.0606; alpha = 0.0040
Simulating trial. . .
espilon = 0.0606; alpha = 0.0040
Simulating trial. . .
espilon = 0.0606; alpha = 0.0040
Simulating trial. . .
espilon = 0.0606; alpha = 0.0040
Simulating trial. . .
espilon = 0.0606; alpha = 0.0040
Simulating trial. . .
espilon = 0.0606; alpha = 0.0040
Simulating trial. . .
espilon = 0.0606; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'right')
Agent drove forward instead of right. (rewarded 1.89)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.11)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.99)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.88)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.86)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.43)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 0.80)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.83)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.53)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.39)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.81)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 2.52)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.31)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.63)
53% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 703
\-------------------------
Simulating trial. . .
espilon = 0.0603; alpha = 0.0040
Simulating trial. . .
espilon = 0.0603; alpha = 0.0040
Simulating trial. . .
espilon = 0.0603; alpha = 0.0040
Simulating trial. . .
espilon = 0.0603; alpha = 0.0040
Simulating trial. . .
espilon = 0.0603; alpha = 0.0040
Simulating trial. . .
espilon = 0.0603; alpha = 0.0040
Simulating trial. . .
espilon = 0.0603; alpha = 0.0040
Simulating trial. . .
espilon = 0.0603; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.15)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 1.77)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.84)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 1.13)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.98)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.99)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', 'left')
Agent followed the waypoint forward. (rewarded 1.84)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'left')
Agent followed the waypoint right. (rewarded 2.41)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 704
\-------------------------
Simulating trial. . .
espilon = 0.0601; alpha = 0.0040
Simulating trial. . .
espilon = 0.0601; alpha = 0.0040
Simulating trial. . .
espilon = 0.0601; alpha = 0.0040
Simulating trial. . .
espilon = 0.0601; alpha = 0.0040
Simulating trial. . .
espilon = 0.0601; alpha = 0.0040
Simulating trial. . .
espilon = 0.0601; alpha = 0.0040
Simulating trial. . .
espilon = 0.0601; alpha = 0.0040
Simulating trial. . .
espilon = 0.0601; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', 'left')
Agent followed the waypoint right. (rewarded 2.02)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.35)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.65)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.24)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 1.52)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.19)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.79)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.93)
68% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 705
\-------------------------
Simulating trial. . .
espilon = 0.0598; alpha = 0.0040
Simulating trial. . .
espilon = 0.0598; alpha = 0.0040
Simulating trial. . .
espilon = 0.0598; alpha = 0.0040
Simulating trial. . .
espilon = 0.0598; alpha = 0.0040
Simulating trial. . .
espilon = 0.0598; alpha = 0.0040
Simulating trial. . .
espilon = 0.0598; alpha = 0.0040
Simulating trial. . .
espilon = 0.0598; alpha = 0.0040
Simulating trial. . .
espilon = 0.0598; alpha = 0.0040
Simulating trial. . .
espilon = 0.0598; alpha = 0.0040
Simulating trial. . .
espilon = 0.0598; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'right', 'left')
Agent followed the waypoint right. (rewarded 2.76)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 2.97)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.52)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'forward')
Agent properly idled at a red light. (rewarded 1.23)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.67)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove right instead of left. (rewarded 1.74)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.07)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'left', None)
Agent followed the waypoint right. (rewarded 2.30)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.71)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'right', None)
Agent properly idled at a red light. (rewarded 1.79)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.49)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.18)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.34)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 2.52)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.05)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.50)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.22)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'right', 'left')
Agent drove right instead of forward. (rewarded -0.03)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.39)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 0.44)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 706
\-------------------------
Simulating trial. . .
espilon = 0.0596; alpha = 0.0040
Simulating trial. . .
espilon = 0.0596; alpha = 0.0040
Simulating trial. . .
espilon = 0.0596; alpha = 0.0040
Simulating trial. . .
espilon = 0.0596; alpha = 0.0040
Simulating trial. . .
espilon = 0.0596; alpha = 0.0040
Simulating trial. . .
espilon = 0.0596; alpha = 0.0040
Simulating trial. . .
espilon = 0.0596; alpha = 0.0040
Simulating trial. . .
espilon = 0.0596; alpha = 0.0040
Simulating trial. . .
espilon = 0.0596; alpha = 0.0040
Simulating trial. . .
espilon = 0.0596; alpha = 0.0040
Simulating trial. . .
espilon = 0.0596; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.68)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.47)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.25)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.03)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.83)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.07)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.79)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.81)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.54)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.36)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove left instead of forward. (rewarded 0.70)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 0.96)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.71)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.17)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.08)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.56)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'left', None)
Agent followed the waypoint forward. (rewarded 1.03)
32% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 707
\-------------------------
Simulating trial. . .
espilon = 0.0594; alpha = 0.0040
Simulating trial. . .
espilon = 0.0594; alpha = 0.0040
Simulating trial. . .
espilon = 0.0594; alpha = 0.0040
Simulating trial. . .
espilon = 0.0594; alpha = 0.0040
Simulating trial. . .
espilon = 0.0594; alpha = 0.0040
Simulating trial. . .
espilon = 0.0594; alpha = 0.0040
Simulating trial. . .
espilon = 0.0594; alpha = 0.0040
Simulating trial. . .
espilon = 0.0594; alpha = 0.0040
Simulating trial. . .
espilon = 0.0594; alpha = 0.0040
Simulating trial. . .
espilon = 0.0594; alpha = 0.0040
Simulating trial. . .
espilon = 0.0594; alpha = 0.0040
Simulating trial. . .
espilon = 0.0594; alpha = 0.0040
Simulating trial. . .
espilon = 0.0594; alpha = 0.0040
Simulating trial. . .
espilon = 0.0594; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.69)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 1.44)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.49)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.91)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.18)
80% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 708
\-------------------------
Simulating trial. . .
espilon = 0.0591; alpha = 0.0040
Simulating trial. . .
espilon = 0.0591; alpha = 0.0040
Simulating trial. . .
espilon = 0.0591; alpha = 0.0040
Simulating trial. . .
espilon = 0.0591; alpha = 0.0040
Simulating trial. . .
espilon = 0.0591; alpha = 0.0040
Simulating trial. . .
espilon = 0.0591; alpha = 0.0040
Simulating trial. . .
espilon = 0.0591; alpha = 0.0040
Simulating trial. . .
espilon = 0.0591; alpha = 0.0040
Simulating trial. . .
espilon = 0.0591; alpha = 0.0040
Simulating trial. . .
espilon = 0.0591; alpha = 0.0040
Simulating trial. . .
espilon = 0.0591; alpha = 0.0040
Simulating trial. . .
espilon = 0.0591; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 0.31)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.78)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.52)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.02)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.61)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.96)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.28)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'right')
Agent drove right instead of forward. (rewarded 0.53)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', None)
Agent properly idled at a red light. (rewarded 1.23)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.91)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.59)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.63)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.95)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'right')
Agent followed the waypoint right. (rewarded 1.05)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.28)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.23)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'right')
Agent properly idled at a red light. (rewarded 2.20)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.44)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.61)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 0.63)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 709
\-------------------------
Simulating trial. . .
espilon = 0.0589; alpha = 0.0040
Simulating trial. . .
espilon = 0.0589; alpha = 0.0040
Simulating trial. . .
espilon = 0.0589; alpha = 0.0040
Simulating trial. . .
espilon = 0.0589; alpha = 0.0040
Simulating trial. . .
espilon = 0.0589; alpha = 0.0040
Simulating trial. . .
espilon = 0.0589; alpha = 0.0040
Simulating trial. . .
espilon = 0.0589; alpha = 0.0040
Simulating trial. . .
espilon = 0.0589; alpha = 0.0040
Simulating trial. . .
espilon = 0.0589; alpha = 0.0040
Simulating trial. . .
espilon = 0.0589; alpha = 0.0040
Simulating trial. . .
espilon = 0.0589; alpha = 0.0040
Simulating trial. . .
espilon = 0.0589; alpha = 0.0040
Simulating trial. . .
espilon = 0.0589; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.74)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 1.42)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.57)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.98)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 1.41)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.76)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.03)
65% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 710
\-------------------------
Simulating trial. . .
espilon = 0.0587; alpha = 0.0040
Simulating trial. . .
espilon = 0.0587; alpha = 0.0040
Simulating trial. . .
espilon = 0.0587; alpha = 0.0040
Simulating trial. . .
espilon = 0.0587; alpha = 0.0040
Simulating trial. . .
espilon = 0.0587; alpha = 0.0040
Simulating trial. . .
espilon = 0.0587; alpha = 0.0040
Simulating trial. . .
espilon = 0.0587; alpha = 0.0040
Simulating trial. . .
espilon = 0.0587; alpha = 0.0040
Simulating trial. . .
espilon = 0.0587; alpha = 0.0040
Simulating trial. . .
espilon = 0.0587; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent attempted driving left through a red light. (rewarded -9.73)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.52)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.40)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent attempted driving forward through a red light. (rewarded -9.31)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.78)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.14)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.44)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.54)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.19)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.53)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 2.76)
45% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 711
\-------------------------
Simulating trial. . .
espilon = 0.0584; alpha = 0.0040
Simulating trial. . .
espilon = 0.0584; alpha = 0.0040
Simulating trial. . .
espilon = 0.0584; alpha = 0.0040
Simulating trial. . .
espilon = 0.0584; alpha = 0.0040
Simulating trial. . .
espilon = 0.0584; alpha = 0.0040
Simulating trial. . .
espilon = 0.0584; alpha = 0.0040
Simulating trial. . .
espilon = 0.0584; alpha = 0.0040
Simulating trial. . .
espilon = 0.0584; alpha = 0.0040
Simulating trial. . .
espilon = 0.0584; alpha = 0.0040
Simulating trial. . .
espilon = 0.0584; alpha = 0.0040
Simulating trial. . .
espilon = 0.0584; alpha = 0.0040
Simulating trial. . .
espilon = 0.0584; alpha = 0.0040
Simulating trial. . .
espilon = 0.0584; alpha = 0.0040
Simulating trial. . .
espilon = 0.0584; alpha = 0.0040
Simulating trial. . .
espilon = 0.0584; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'forward', 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.66)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'forward', None)
Agent drove left instead of right. (rewarded 1.15)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.50)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 2.16)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.22)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.14)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.03)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', 'forward')
Agent drove forward instead of left. (rewarded 0.05)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.22)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 0.28)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.13)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent followed the waypoint right. (rewarded 1.51)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.42)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.38)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent followed the waypoint right. (rewarded 1.93)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.14)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.08)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.40)
10% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 712
\-------------------------
Simulating trial. . .
espilon = 0.0582; alpha = 0.0040
Simulating trial. . .
espilon = 0.0582; alpha = 0.0040
Simulating trial. . .
espilon = 0.0582; alpha = 0.0040
Simulating trial. . .
espilon = 0.0582; alpha = 0.0040
Simulating trial. . .
espilon = 0.0582; alpha = 0.0040
Simulating trial. . .
espilon = 0.0582; alpha = 0.0040
Simulating trial. . .
espilon = 0.0582; alpha = 0.0040
Simulating trial. . .
espilon = 0.0582; alpha = 0.0040
Simulating trial. . .
espilon = 0.0582; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'forward')
Agent drove forward instead of right. (rewarded 0.79)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent followed the waypoint right. (rewarded 2.12)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.36)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.60)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.62)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.44)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'forward', 'right')
Agent drove right instead of left. (rewarded 0.92)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.74)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.24)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.28)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.64)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.05)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.28)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove right instead of left. (rewarded 0.90)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 0.93)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'left')
Agent drove forward instead of right. (rewarded -0.26)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', None)
Agent followed the waypoint right. (rewarded 2.14)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 0.81)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.40)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 0.29)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 713
\-------------------------
Simulating trial. . .
espilon = 0.0580; alpha = 0.0040
Simulating trial. . .
espilon = 0.0580; alpha = 0.0040
Simulating trial. . .
espilon = 0.0580; alpha = 0.0040
Simulating trial. . .
espilon = 0.0580; alpha = 0.0040
Simulating trial. . .
espilon = 0.0580; alpha = 0.0040
Simulating trial. . .
espilon = 0.0580; alpha = 0.0040
Simulating trial. . .
espilon = 0.0580; alpha = 0.0040
Simulating trial. . .
espilon = 0.0580; alpha = 0.0040
Simulating trial. . .
espilon = 0.0580; alpha = 0.0040
Simulating trial. . .
espilon = 0.0580; alpha = 0.0040
Simulating trial. . .
espilon = 0.0580; alpha = 0.0040
Simulating trial. . .
espilon = 0.0580; alpha = 0.0040
Simulating trial. . .
espilon = 0.0580; alpha = 0.0040
Simulating trial. . .
espilon = 0.0580; alpha = 0.0040
Simulating trial. . .
espilon = 0.0580; alpha = 0.0040
Simulating trial. . .
espilon = 0.0580; alpha = 0.0040
Simulating trial. . .
espilon = 0.0580; alpha = 0.0040
Simulating trial. . .
espilon = 0.0580; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent followed the waypoint forward. (rewarded 1.23)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.42)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.20)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.49)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.96)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent followed the waypoint left. (rewarded 2.30)
70% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 714
\-------------------------
Simulating trial. . .
espilon = 0.0577; alpha = 0.0040
Simulating trial. . .
espilon = 0.0577; alpha = 0.0040
Simulating trial. . .
espilon = 0.0577; alpha = 0.0040
Simulating trial. . .
espilon = 0.0577; alpha = 0.0040
Simulating trial. . .
espilon = 0.0577; alpha = 0.0040
Simulating trial. . .
espilon = 0.0577; alpha = 0.0040
Simulating trial. . .
espilon = 0.0577; alpha = 0.0040
Simulating trial. . .
espilon = 0.0577; alpha = 0.0040
Simulating trial. . .
espilon = 0.0577; alpha = 0.0040
Simulating trial. . .
espilon = 0.0577; alpha = 0.0040
Simulating trial. . .
espilon = 0.0577; alpha = 0.0040
Simulating trial. . .
espilon = 0.0577; alpha = 0.0040
Simulating trial. . .
espilon = 0.0577; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.86)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.32)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.68)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.20)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.45)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 2.71)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.58)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.23)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', None)
Agent properly idled at a red light. (rewarded 1.16)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.38)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 2.80)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 0.97)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'forward')
Agent drove right instead of left. (rewarded 0.69)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.02)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.48)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.61)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 0.86)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'right')
Agent followed the waypoint right. (rewarded 2.21)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.86)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.84)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.91)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.23)
12% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 715
\-------------------------
Simulating trial. . .
espilon = 0.0575; alpha = 0.0040
Simulating trial. . .
espilon = 0.0575; alpha = 0.0040
Simulating trial. . .
espilon = 0.0575; alpha = 0.0040
Simulating trial. . .
espilon = 0.0575; alpha = 0.0040
Simulating trial. . .
espilon = 0.0575; alpha = 0.0040
Simulating trial. . .
espilon = 0.0575; alpha = 0.0040
Simulating trial. . .
espilon = 0.0575; alpha = 0.0040
Simulating trial. . .
espilon = 0.0575; alpha = 0.0040
Simulating trial. . .
espilon = 0.0575; alpha = 0.0040
Simulating trial. . .
espilon = 0.0575; alpha = 0.0040
Simulating trial. . .
espilon = 0.0575; alpha = 0.0040
Simulating trial. . .
espilon = 0.0575; alpha = 0.0040
Simulating trial. . .
espilon = 0.0575; alpha = 0.0040
Simulating trial. . .
espilon = 0.0575; alpha = 0.0040
Simulating trial. . .
espilon = 0.0575; alpha = 0.0040
Simulating trial. . .
espilon = 0.0575; alpha = 0.0040
Simulating trial. . .
espilon = 0.0575; alpha = 0.0040
Simulating trial. . .
espilon = 0.0575; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.90)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent drove right instead of left. (rewarded 1.33)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.66)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.09)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.72)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 2.09)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.66)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.62)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.00)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.20)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent attempted driving left through a red light. (rewarded -10.39)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.21)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 716
\-------------------------
Simulating trial. . .
espilon = 0.0573; alpha = 0.0040
Simulating trial. . .
espilon = 0.0573; alpha = 0.0040
Simulating trial. . .
espilon = 0.0573; alpha = 0.0040
Simulating trial. . .
espilon = 0.0573; alpha = 0.0040
Simulating trial. . .
espilon = 0.0573; alpha = 0.0040
Simulating trial. . .
espilon = 0.0573; alpha = 0.0040
Simulating trial. . .
espilon = 0.0573; alpha = 0.0040
Simulating trial. . .
espilon = 0.0573; alpha = 0.0040
Simulating trial. . .
espilon = 0.0573; alpha = 0.0040
Simulating trial. . .
espilon = 0.0573; alpha = 0.0040
Simulating trial. . .
espilon = 0.0573; alpha = 0.0040
Simulating trial. . .
espilon = 0.0573; alpha = 0.0040
Simulating trial. . .
espilon = 0.0573; alpha = 0.0040
Simulating trial. . .
espilon = 0.0573; alpha = 0.0040
Simulating trial. . .
espilon = 0.0573; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'right')
Agent followed the waypoint right. (rewarded 1.61)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.30)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'left')
Agent followed the waypoint right. (rewarded 1.41)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.31)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.66)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.01)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', 'left')
Agent followed the waypoint forward. (rewarded 2.53)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.65)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.50)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 2.04)
50% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 717
\-------------------------
Simulating trial. . .
espilon = 0.0570; alpha = 0.0040
Simulating trial. . .
espilon = 0.0570; alpha = 0.0040
Simulating trial. . .
espilon = 0.0570; alpha = 0.0040
Simulating trial. . .
espilon = 0.0570; alpha = 0.0040
Simulating trial. . .
espilon = 0.0570; alpha = 0.0040
Simulating trial. . .
espilon = 0.0570; alpha = 0.0040
Simulating trial. . .
espilon = 0.0570; alpha = 0.0040
Simulating trial. . .
espilon = 0.0570; alpha = 0.0040
Simulating trial. . .
espilon = 0.0570; alpha = 0.0040
Simulating trial. . .
espilon = 0.0570; alpha = 0.0040
Simulating trial. . .
espilon = 0.0570; alpha = 0.0040
Simulating trial. . .
espilon = 0.0570; alpha = 0.0040
Simulating trial. . .
espilon = 0.0570; alpha = 0.0040
Simulating trial. . .
espilon = 0.0570; alpha = 0.0040
Simulating trial. . .
espilon = 0.0570; alpha = 0.0040
Simulating trial. . .
espilon = 0.0570; alpha = 0.0040
Simulating trial. . .
espilon = 0.0570; alpha = 0.0040
Simulating trial. . .
espilon = 0.0570; alpha = 0.0040
Simulating trial. . .
espilon = 0.0570; alpha = 0.0040
Simulating trial. . .
espilon = 0.0570; alpha = 0.0040
Simulating trial. . .
espilon = 0.0570; alpha = 0.0040
Simulating trial. . .
espilon = 0.0570; alpha = 0.0040
Simulating trial. . .
espilon = 0.0570; alpha = 0.0040
Simulating trial. . .
espilon = 0.0570; alpha = 0.0040
Simulating trial. . .
espilon = 0.0570; alpha = 0.0040
Simulating trial. . .
espilon = 0.0570; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent followed the waypoint left. (rewarded 2.76)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.47)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.37)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'forward')
Agent followed the waypoint right. (rewarded 2.62)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.17)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.87)
70% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 718
\-------------------------
Simulating trial. . .
espilon = 0.0568; alpha = 0.0040
Simulating trial. . .
espilon = 0.0568; alpha = 0.0040
Simulating trial. . .
espilon = 0.0568; alpha = 0.0040
Simulating trial. . .
espilon = 0.0568; alpha = 0.0040
Simulating trial. . .
espilon = 0.0568; alpha = 0.0040
Simulating trial. . .
espilon = 0.0568; alpha = 0.0040
Simulating trial. . .
espilon = 0.0568; alpha = 0.0040
Simulating trial. . .
espilon = 0.0568; alpha = 0.0040
Simulating trial. . .
espilon = 0.0568; alpha = 0.0040
Simulating trial. . .
espilon = 0.0568; alpha = 0.0040
Simulating trial. . .
espilon = 0.0568; alpha = 0.0040
Simulating trial. . .
espilon = 0.0568; alpha = 0.0040
Simulating trial. . .
espilon = 0.0568; alpha = 0.0040
Simulating trial. . .
espilon = 0.0568; alpha = 0.0040
Simulating trial. . .
espilon = 0.0568; alpha = 0.0040
Simulating trial. . .
espilon = 0.0568; alpha = 0.0040
Simulating trial. . .
espilon = 0.0568; alpha = 0.0040
Simulating trial. . .
espilon = 0.0568; alpha = 0.0040
Simulating trial. . .
espilon = 0.0568; alpha = 0.0040
Simulating trial. . .
espilon = 0.0568; alpha = 0.0040
Simulating trial. . .
espilon = 0.0568; alpha = 0.0040
Simulating trial. . .
espilon = 0.0568; alpha = 0.0040
Simulating trial. . .
espilon = 0.0568; alpha = 0.0040
Simulating trial. . .
espilon = 0.0568; alpha = 0.0040
Simulating trial. . .
espilon = 0.0568; alpha = 0.0040
Simulating trial. . .
espilon = 0.0568; alpha = 0.0040
Simulating trial. . .
espilon = 0.0568; alpha = 0.0040
Simulating trial. . .
espilon = 0.0568; alpha = 0.0040
Simulating trial. . .
espilon = 0.0568; alpha = 0.0040
Simulating trial. . .
espilon = 0.0568; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.21)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.45)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.87)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.17)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.81)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.92)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.33)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.25)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.06)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.77)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.00)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.69)
52% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 719
\-------------------------
Simulating trial. . .
espilon = 0.0566; alpha = 0.0040
Simulating trial. . .
espilon = 0.0566; alpha = 0.0040
Simulating trial. . .
espilon = 0.0566; alpha = 0.0040
Simulating trial. . .
espilon = 0.0566; alpha = 0.0040
Simulating trial. . .
espilon = 0.0566; alpha = 0.0040
Simulating trial. . .
espilon = 0.0566; alpha = 0.0040
Simulating trial. . .
espilon = 0.0566; alpha = 0.0040
Simulating trial. . .
espilon = 0.0566; alpha = 0.0040
Simulating trial. . .
espilon = 0.0566; alpha = 0.0040
Simulating trial. . .
espilon = 0.0566; alpha = 0.0040
Simulating trial. . .
espilon = 0.0566; alpha = 0.0040
Simulating trial. . .
espilon = 0.0566; alpha = 0.0040
Simulating trial. . .
espilon = 0.0566; alpha = 0.0040
Simulating trial. . .
espilon = 0.0566; alpha = 0.0040
Simulating trial. . .
espilon = 0.0566; alpha = 0.0040
Simulating trial. . .
espilon = 0.0566; alpha = 0.0040
Simulating trial. . .
espilon = 0.0566; alpha = 0.0040
Simulating trial. . .
espilon = 0.0566; alpha = 0.0040
Simulating trial. . .
espilon = 0.0566; alpha = 0.0040
Simulating trial. . .
espilon = 0.0566; alpha = 0.0040
Simulating trial. . .
espilon = 0.0566; alpha = 0.0040
Simulating trial. . .
espilon = 0.0566; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'left')
Agent followed the waypoint right. (rewarded 1.86)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 0.67)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.95)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent followed the waypoint left. (rewarded 2.83)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.73)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.22)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 2.46)
65% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 720
\-------------------------
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
Simulating trial. . .
espilon = 0.0564; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.45)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', 'left')
Agent drove left instead of right. (rewarded 1.20)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.30)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.73)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 1.01)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent followed the waypoint left. (rewarded 2.58)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.33)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent followed the waypoint forward. (rewarded 1.04)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 2.65)
64% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 721
\-------------------------
Simulating trial. . .
espilon = 0.0561; alpha = 0.0040
Simulating trial. . .
espilon = 0.0561; alpha = 0.0040
Simulating trial. . .
espilon = 0.0561; alpha = 0.0040
Simulating trial. . .
espilon = 0.0561; alpha = 0.0040
Simulating trial. . .
espilon = 0.0561; alpha = 0.0040
Simulating trial. . .
espilon = 0.0561; alpha = 0.0040
Simulating trial. . .
espilon = 0.0561; alpha = 0.0040
Simulating trial. . .
espilon = 0.0561; alpha = 0.0040
Simulating trial. . .
espilon = 0.0561; alpha = 0.0040
Simulating trial. . .
espilon = 0.0561; alpha = 0.0040
Simulating trial. . .
espilon = 0.0561; alpha = 0.0040
Simulating trial. . .
espilon = 0.0561; alpha = 0.0040
Simulating trial. . .
espilon = 0.0561; alpha = 0.0040
Simulating trial. . .
espilon = 0.0561; alpha = 0.0040
Simulating trial. . .
espilon = 0.0561; alpha = 0.0040
Simulating trial. . .
espilon = 0.0561; alpha = 0.0040
Simulating trial. . .
espilon = 0.0561; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 1.62)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.25)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.86)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.40)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.34)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.56)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 1.63)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 2.61)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.70)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.59)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.18)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 2.26)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 722
\-------------------------
Simulating trial. . .
espilon = 0.0559; alpha = 0.0040
Simulating trial. . .
espilon = 0.0559; alpha = 0.0040
Simulating trial. . .
espilon = 0.0559; alpha = 0.0040
Simulating trial. . .
espilon = 0.0559; alpha = 0.0040
Simulating trial. . .
espilon = 0.0559; alpha = 0.0040
Simulating trial. . .
espilon = 0.0559; alpha = 0.0040
Simulating trial. . .
espilon = 0.0559; alpha = 0.0040
Simulating trial. . .
espilon = 0.0559; alpha = 0.0040
Simulating trial. . .
espilon = 0.0559; alpha = 0.0040
Simulating trial. . .
espilon = 0.0559; alpha = 0.0040
Simulating trial. . .
espilon = 0.0559; alpha = 0.0040
Simulating trial. . .
espilon = 0.0559; alpha = 0.0040
Simulating trial. . .
espilon = 0.0559; alpha = 0.0040
Simulating trial. . .
espilon = 0.0559; alpha = 0.0040
Simulating trial. . .
espilon = 0.0559; alpha = 0.0040
Simulating trial. . .
espilon = 0.0559; alpha = 0.0040
Simulating trial. . .
espilon = 0.0559; alpha = 0.0040
Simulating trial. . .
espilon = 0.0559; alpha = 0.0040
Simulating trial. . .
espilon = 0.0559; alpha = 0.0040
Simulating trial. . .
espilon = 0.0559; alpha = 0.0040
Simulating trial. . .
espilon = 0.0559; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent followed the waypoint left. (rewarded 1.72)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent drove left instead of forward. (rewarded 0.81)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.83)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent drove left instead of forward. (rewarded 0.51)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.10)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 2.11)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.11)
65% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 723
\-------------------------
Simulating trial. . .
espilon = 0.0557; alpha = 0.0040
Simulating trial. . .
espilon = 0.0557; alpha = 0.0040
Simulating trial. . .
espilon = 0.0557; alpha = 0.0040
Simulating trial. . .
espilon = 0.0557; alpha = 0.0040
Simulating trial. . .
espilon = 0.0557; alpha = 0.0040
Simulating trial. . .
espilon = 0.0557; alpha = 0.0040
Simulating trial. . .
espilon = 0.0557; alpha = 0.0040
Simulating trial. . .
espilon = 0.0557; alpha = 0.0040
Simulating trial. . .
espilon = 0.0557; alpha = 0.0040
Simulating trial. . .
espilon = 0.0557; alpha = 0.0040
Simulating trial. . .
espilon = 0.0557; alpha = 0.0040
Simulating trial. . .
espilon = 0.0557; alpha = 0.0040
Simulating trial. . .
espilon = 0.0557; alpha = 0.0040
Simulating trial. . .
espilon = 0.0557; alpha = 0.0040
Simulating trial. . .
espilon = 0.0557; alpha = 0.0040
Simulating trial. . .
espilon = 0.0557; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.54)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'forward')
Agent followed the waypoint right. (rewarded 2.79)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.22)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.87)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.47)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 1.18)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 1.06)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.80)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.40)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.79)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.34)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.62)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'forward')
Agent drove forward instead of left. (rewarded 1.25)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.75)
30% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 724
\-------------------------
Simulating trial. . .
espilon = 0.0555; alpha = 0.0040
Simulating trial. . .
espilon = 0.0555; alpha = 0.0040
Simulating trial. . .
espilon = 0.0555; alpha = 0.0040
Simulating trial. . .
espilon = 0.0555; alpha = 0.0040
Simulating trial. . .
espilon = 0.0555; alpha = 0.0040
Simulating trial. . .
espilon = 0.0555; alpha = 0.0040
Simulating trial. . .
espilon = 0.0555; alpha = 0.0040
Simulating trial. . .
espilon = 0.0555; alpha = 0.0040
Simulating trial. . .
espilon = 0.0555; alpha = 0.0040
Simulating trial. . .
espilon = 0.0555; alpha = 0.0040
Simulating trial. . .
espilon = 0.0555; alpha = 0.0040
Simulating trial. . .
espilon = 0.0555; alpha = 0.0040
Simulating trial. . .
espilon = 0.0555; alpha = 0.0040
Simulating trial. . .
espilon = 0.0555; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'right')
Agent drove forward instead of left. (rewarded 1.07)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.56)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'forward')
Agent properly idled at a red light. (rewarded 1.06)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.01)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 0.63)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.38)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.38)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.89)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'right')
Agent followed the waypoint forward. (rewarded 1.59)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'forward', None)
Agent drove right instead of left. (rewarded 1.41)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 1.22)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.03)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.75)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.96)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.19)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.96)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'forward')
Agent drove right instead of left. (rewarded 0.73)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.11)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.31)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 0.97)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 1.03)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.18)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.37)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.43)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.09)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 725
\-------------------------
Simulating trial. . .
espilon = 0.0552; alpha = 0.0040
Simulating trial. . .
espilon = 0.0552; alpha = 0.0040
Simulating trial. . .
espilon = 0.0552; alpha = 0.0040
Simulating trial. . .
espilon = 0.0552; alpha = 0.0040
Simulating trial. . .
espilon = 0.0552; alpha = 0.0040
Simulating trial. . .
espilon = 0.0552; alpha = 0.0040
Simulating trial. . .
espilon = 0.0552; alpha = 0.0040
Simulating trial. . .
espilon = 0.0552; alpha = 0.0040
Simulating trial. . .
espilon = 0.0552; alpha = 0.0040
Simulating trial. . .
espilon = 0.0552; alpha = 0.0040
Simulating trial. . .
espilon = 0.0552; alpha = 0.0040
Simulating trial. . .
espilon = 0.0552; alpha = 0.0040
Simulating trial. . .
espilon = 0.0552; alpha = 0.0040
Simulating trial. . .
espilon = 0.0552; alpha = 0.0040
Simulating trial. . .
espilon = 0.0552; alpha = 0.0040
Simulating trial. . .
espilon = 0.0552; alpha = 0.0040
Simulating trial. . .
espilon = 0.0552; alpha = 0.0040
Simulating trial. . .
espilon = 0.0552; alpha = 0.0040
Simulating trial. . .
espilon = 0.0552; alpha = 0.0040
Simulating trial. . .
espilon = 0.0552; alpha = 0.0040
Simulating trial. . .
espilon = 0.0552; alpha = 0.0040
Simulating trial. . .
espilon = 0.0552; alpha = 0.0040
Simulating trial. . .
espilon = 0.0552; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.82)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', 'forward')
Agent properly idled at a red light. (rewarded 2.18)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.12)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.48)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.98)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.74)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 0.69)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', 'forward')
Agent drove forward instead of left. (rewarded 1.15)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent drove right instead of left. (rewarded 1.68)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.83)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.29)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent followed the waypoint left. (rewarded 1.39)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.49)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 0.29)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.83)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 0.92)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'right')
Agent properly idled at a red light. (rewarded 1.41)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.66)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.79)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 0.69)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.16)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent followed the waypoint forward. (rewarded 0.64)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.46)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent followed the waypoint right. (rewarded 2.19)
20% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 726
\-------------------------
Simulating trial. . .
espilon = 0.0550; alpha = 0.0040
Simulating trial. . .
espilon = 0.0550; alpha = 0.0040
Simulating trial. . .
espilon = 0.0550; alpha = 0.0040
Simulating trial. . .
espilon = 0.0550; alpha = 0.0040
Simulating trial. . .
espilon = 0.0550; alpha = 0.0040
Simulating trial. . .
espilon = 0.0550; alpha = 0.0040
Simulating trial. . .
espilon = 0.0550; alpha = 0.0040
Simulating trial. . .
espilon = 0.0550; alpha = 0.0040
Simulating trial. . .
espilon = 0.0550; alpha = 0.0040
Simulating trial. . .
espilon = 0.0550; alpha = 0.0040
Simulating trial. . .
espilon = 0.0550; alpha = 0.0040
Simulating trial. . .
espilon = 0.0550; alpha = 0.0040
Simulating trial. . .
espilon = 0.0550; alpha = 0.0040
Simulating trial. . .
espilon = 0.0550; alpha = 0.0040
Simulating trial. . .
espilon = 0.0550; alpha = 0.0040
Simulating trial. . .
espilon = 0.0550; alpha = 0.0040
Simulating trial. . .
espilon = 0.0550; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 1.91)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 2.23)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.99)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.44)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.89)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.15)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 1.53)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.84)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 1.31)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.73)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', 'left')
Agent properly idled at a red light. (rewarded 1.89)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.96)
52% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 727
\-------------------------
Simulating trial. . .
espilon = 0.0548; alpha = 0.0040
Simulating trial. . .
espilon = 0.0548; alpha = 0.0040
Simulating trial. . .
espilon = 0.0548; alpha = 0.0040
Simulating trial. . .
espilon = 0.0548; alpha = 0.0040
Simulating trial. . .
espilon = 0.0548; alpha = 0.0040
Simulating trial. . .
espilon = 0.0548; alpha = 0.0040
Simulating trial. . .
espilon = 0.0548; alpha = 0.0040
Simulating trial. . .
espilon = 0.0548; alpha = 0.0040
Simulating trial. . .
espilon = 0.0548; alpha = 0.0040
Simulating trial. . .
espilon = 0.0548; alpha = 0.0040
Simulating trial. . .
espilon = 0.0548; alpha = 0.0040
Simulating trial. . .
espilon = 0.0548; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'right', 'forward')
Agent followed the waypoint right. (rewarded 1.23)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.26)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.71)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.03)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.14)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.58)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.56)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.49)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 728
\-------------------------
Simulating trial. . .
espilon = 0.0546; alpha = 0.0040
Simulating trial. . .
espilon = 0.0546; alpha = 0.0040
Simulating trial. . .
espilon = 0.0546; alpha = 0.0040
Simulating trial. . .
espilon = 0.0546; alpha = 0.0040
Simulating trial. . .
espilon = 0.0546; alpha = 0.0040
Simulating trial. . .
espilon = 0.0546; alpha = 0.0040
Simulating trial. . .
espilon = 0.0546; alpha = 0.0040
Simulating trial. . .
espilon = 0.0546; alpha = 0.0040
Simulating trial. . .
espilon = 0.0546; alpha = 0.0040
Simulating trial. . .
espilon = 0.0546; alpha = 0.0040
Simulating trial. . .
espilon = 0.0546; alpha = 0.0040
Simulating trial. . .
espilon = 0.0546; alpha = 0.0040
Simulating trial. . .
espilon = 0.0546; alpha = 0.0040
Simulating trial. . .
espilon = 0.0546; alpha = 0.0040
Simulating trial. . .
espilon = 0.0546; alpha = 0.0040
Simulating trial. . .
espilon = 0.0546; alpha = 0.0040
Simulating trial. . .
espilon = 0.0546; alpha = 0.0040
Simulating trial. . .
espilon = 0.0546; alpha = 0.0040
Simulating trial. . .
espilon = 0.0546; alpha = 0.0040
Simulating trial. . .
espilon = 0.0546; alpha = 0.0040
Simulating trial. . .
espilon = 0.0546; alpha = 0.0040
Simulating trial. . .
espilon = 0.0546; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 2.92)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.48)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.00)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.09)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.28)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent followed the waypoint forward. (rewarded 2.91)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'right')
Agent drove right instead of forward. (rewarded 0.68)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.55)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.36)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'left')
Agent drove right instead of left. (rewarded -0.13)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 0.86)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent followed the waypoint right. (rewarded 1.54)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.65)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.00)
30% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 729
\-------------------------
Simulating trial. . .
espilon = 0.0544; alpha = 0.0040
Simulating trial. . .
espilon = 0.0544; alpha = 0.0040
Simulating trial. . .
espilon = 0.0544; alpha = 0.0040
Simulating trial. . .
espilon = 0.0544; alpha = 0.0040
Simulating trial. . .
espilon = 0.0544; alpha = 0.0040
Simulating trial. . .
espilon = 0.0544; alpha = 0.0040
Simulating trial. . .
espilon = 0.0544; alpha = 0.0040
Simulating trial. . .
espilon = 0.0544; alpha = 0.0040
Simulating trial. . .
espilon = 0.0544; alpha = 0.0040
Simulating trial. . .
espilon = 0.0544; alpha = 0.0040
Simulating trial. . .
espilon = 0.0544; alpha = 0.0040
Simulating trial. . .
espilon = 0.0544; alpha = 0.0040
Simulating trial. . .
espilon = 0.0544; alpha = 0.0040
Simulating trial. . .
espilon = 0.0544; alpha = 0.0040
Simulating trial. . .
espilon = 0.0544; alpha = 0.0040
Simulating trial. . .
espilon = 0.0544; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'right')
Agent drove forward instead of left. (rewarded 1.82)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.88)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.85)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.79)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.04)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.83)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.77)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.16)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.75)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'left', 'forward')
Agent drove right instead of forward. (rewarded -0.00)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent followed the waypoint left. (rewarded 1.49)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 0.80)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.15)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent followed the waypoint left. (rewarded 2.21)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.52)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, 'forward')
Agent properly idled at a red light. (rewarded 1.69)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.16)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'forward')
Agent drove forward instead of right. (rewarded -0.15)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.56)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.30)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 730
\-------------------------
Simulating trial. . .
espilon = 0.0541; alpha = 0.0040
Simulating trial. . .
espilon = 0.0541; alpha = 0.0040
Simulating trial. . .
espilon = 0.0541; alpha = 0.0040
Simulating trial. . .
espilon = 0.0541; alpha = 0.0040
Simulating trial. . .
espilon = 0.0541; alpha = 0.0040
Simulating trial. . .
espilon = 0.0541; alpha = 0.0040
Simulating trial. . .
espilon = 0.0541; alpha = 0.0040
Simulating trial. . .
espilon = 0.0541; alpha = 0.0040
Simulating trial. . .
espilon = 0.0541; alpha = 0.0040
Simulating trial. . .
espilon = 0.0541; alpha = 0.0040
Simulating trial. . .
espilon = 0.0541; alpha = 0.0040
Simulating trial. . .
espilon = 0.0541; alpha = 0.0040
Simulating trial. . .
espilon = 0.0541; alpha = 0.0040
Simulating trial. . .
espilon = 0.0541; alpha = 0.0040
Simulating trial. . .
espilon = 0.0541; alpha = 0.0040
Simulating trial. . .
espilon = 0.0541; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.18)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.62)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.70)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.66)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.02)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent followed the waypoint forward. (rewarded 2.36)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 1.35)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 1.55)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.70)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 2.60)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 731
\-------------------------
Simulating trial. . .
espilon = 0.0539; alpha = 0.0040
Simulating trial. . .
espilon = 0.0539; alpha = 0.0040
Simulating trial. . .
espilon = 0.0539; alpha = 0.0040
Simulating trial. . .
espilon = 0.0539; alpha = 0.0040
Simulating trial. . .
espilon = 0.0539; alpha = 0.0040
Simulating trial. . .
espilon = 0.0539; alpha = 0.0040
Simulating trial. . .
espilon = 0.0539; alpha = 0.0040
Simulating trial. . .
espilon = 0.0539; alpha = 0.0040
Simulating trial. . .
espilon = 0.0539; alpha = 0.0040
Simulating trial. . .
espilon = 0.0539; alpha = 0.0040
Simulating trial. . .
espilon = 0.0539; alpha = 0.0040
Simulating trial. . .
espilon = 0.0539; alpha = 0.0040
Simulating trial. . .
espilon = 0.0539; alpha = 0.0040
Simulating trial. . .
espilon = 0.0539; alpha = 0.0040
Simulating trial. . .
espilon = 0.0539; alpha = 0.0040
Simulating trial. . .
espilon = 0.0539; alpha = 0.0040
Simulating trial. . .
espilon = 0.0539; alpha = 0.0040
Simulating trial. . .
espilon = 0.0539; alpha = 0.0040
Simulating trial. . .
espilon = 0.0539; alpha = 0.0040
Simulating trial. . .
espilon = 0.0539; alpha = 0.0040
Simulating trial. . .
espilon = 0.0539; alpha = 0.0040
Simulating trial. . .
espilon = 0.0539; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'right', None)
Agent drove forward instead of left. (rewarded 1.98)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.82)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.11)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 2.66)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.73)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.48)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.42)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'forward', None)
Agent properly idled at a red light. (rewarded 2.51)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.60)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.36)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.27)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent drove left instead of forward. (rewarded 0.58)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.78)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.61)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 2.04)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.18)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.39)
43% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 732
\-------------------------
Simulating trial. . .
espilon = 0.0537; alpha = 0.0040
Simulating trial. . .
espilon = 0.0537; alpha = 0.0040
Simulating trial. . .
espilon = 0.0537; alpha = 0.0040
Simulating trial. . .
espilon = 0.0537; alpha = 0.0040
Simulating trial. . .
espilon = 0.0537; alpha = 0.0040
Simulating trial. . .
espilon = 0.0537; alpha = 0.0040
Simulating trial. . .
espilon = 0.0537; alpha = 0.0040
Simulating trial. . .
espilon = 0.0537; alpha = 0.0040
Simulating trial. . .
espilon = 0.0537; alpha = 0.0040
Simulating trial. . .
espilon = 0.0537; alpha = 0.0040
Simulating trial. . .
espilon = 0.0537; alpha = 0.0040
Simulating trial. . .
espilon = 0.0537; alpha = 0.0040
Simulating trial. . .
espilon = 0.0537; alpha = 0.0040
Simulating trial. . .
espilon = 0.0537; alpha = 0.0040
Simulating trial. . .
espilon = 0.0537; alpha = 0.0040
Simulating trial. . .
espilon = 0.0537; alpha = 0.0040
Simulating trial. . .
espilon = 0.0537; alpha = 0.0040
Simulating trial. . .
espilon = 0.0537; alpha = 0.0040
Simulating trial. . .
espilon = 0.0537; alpha = 0.0040
Simulating trial. . .
espilon = 0.0537; alpha = 0.0040
Simulating trial. . .
espilon = 0.0537; alpha = 0.0040
Simulating trial. . .
espilon = 0.0537; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 2.16)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'right', 'right')
Agent followed the waypoint forward. (rewarded 2.85)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.21)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.43)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.96)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.59)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.79)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 1.72)
68% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 733
\-------------------------
Simulating trial. . .
espilon = 0.0535; alpha = 0.0040
Simulating trial. . .
espilon = 0.0535; alpha = 0.0040
Simulating trial. . .
espilon = 0.0535; alpha = 0.0040
Simulating trial. . .
espilon = 0.0535; alpha = 0.0040
Simulating trial. . .
espilon = 0.0535; alpha = 0.0040
Simulating trial. . .
espilon = 0.0535; alpha = 0.0040
Simulating trial. . .
espilon = 0.0535; alpha = 0.0040
Simulating trial. . .
espilon = 0.0535; alpha = 0.0040
Simulating trial. . .
espilon = 0.0535; alpha = 0.0040
Simulating trial. . .
espilon = 0.0535; alpha = 0.0040
Simulating trial. . .
espilon = 0.0535; alpha = 0.0040
Simulating trial. . .
espilon = 0.0535; alpha = 0.0040
Simulating trial. . .
espilon = 0.0535; alpha = 0.0040
Simulating trial. . .
espilon = 0.0535; alpha = 0.0040
Simulating trial. . .
espilon = 0.0535; alpha = 0.0040
Simulating trial. . .
espilon = 0.0535; alpha = 0.0040
Simulating trial. . .
espilon = 0.0535; alpha = 0.0040
Simulating trial. . .
espilon = 0.0535; alpha = 0.0040
Simulating trial. . .
espilon = 0.0535; alpha = 0.0040
Simulating trial. . .
espilon = 0.0535; alpha = 0.0040
Simulating trial. . .
espilon = 0.0535; alpha = 0.0040
Simulating trial. . .
espilon = 0.0535; alpha = 0.0040
Simulating trial. . .
espilon = 0.0535; alpha = 0.0040
Simulating trial. . .
espilon = 0.0535; alpha = 0.0040
Simulating trial. . .
espilon = 0.0535; alpha = 0.0040
Simulating trial. . .
espilon = 0.0535; alpha = 0.0040
Simulating trial. . .
espilon = 0.0535; alpha = 0.0040
Simulating trial. . .
espilon = 0.0535; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.61)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.83)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.38)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.37)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.90)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.64)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.15)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.62)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.57)
55% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 734
\-------------------------
Simulating trial. . .
espilon = 0.0533; alpha = 0.0040
Simulating trial. . .
espilon = 0.0533; alpha = 0.0040
Simulating trial. . .
espilon = 0.0533; alpha = 0.0040
Simulating trial. . .
espilon = 0.0533; alpha = 0.0040
Simulating trial. . .
espilon = 0.0533; alpha = 0.0040
Simulating trial. . .
espilon = 0.0533; alpha = 0.0040
Simulating trial. . .
espilon = 0.0533; alpha = 0.0040
Simulating trial. . .
espilon = 0.0533; alpha = 0.0040
Simulating trial. . .
espilon = 0.0533; alpha = 0.0040
Simulating trial. . .
espilon = 0.0533; alpha = 0.0040
Simulating trial. . .
espilon = 0.0533; alpha = 0.0040
Simulating trial. . .
espilon = 0.0533; alpha = 0.0040
Simulating trial. . .
espilon = 0.0533; alpha = 0.0040
Simulating trial. . .
espilon = 0.0533; alpha = 0.0040
Simulating trial. . .
espilon = 0.0533; alpha = 0.0040
Simulating trial. . .
espilon = 0.0533; alpha = 0.0040
Simulating trial. . .
espilon = 0.0533; alpha = 0.0040
Simulating trial. . .
espilon = 0.0533; alpha = 0.0040
Simulating trial. . .
espilon = 0.0533; alpha = 0.0040
Simulating trial. . .
espilon = 0.0533; alpha = 0.0040
Simulating trial. . .
espilon = 0.0533; alpha = 0.0040
Simulating trial. . .
espilon = 0.0533; alpha = 0.0040
Simulating trial. . .
espilon = 0.0533; alpha = 0.0040
Simulating trial. . .
espilon = 0.0533; alpha = 0.0040
Simulating trial. . .
espilon = 0.0533; alpha = 0.0040
Simulating trial. . .
espilon = 0.0533; alpha = 0.0040
Simulating trial. . .
espilon = 0.0533; alpha = 0.0040
Simulating trial. . .
espilon = 0.0533; alpha = 0.0040
Simulating trial. . .
espilon = 0.0533; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.75)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.51)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.67)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.06)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.92)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent attempted driving left through a red light. (rewarded -10.73)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.05)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.50)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 1.86)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.27)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.92)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.56)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 735
\-------------------------
Simulating trial. . .
espilon = 0.0531; alpha = 0.0040
Simulating trial. . .
espilon = 0.0531; alpha = 0.0040
Simulating trial. . .
espilon = 0.0531; alpha = 0.0040
Simulating trial. . .
espilon = 0.0531; alpha = 0.0040
Simulating trial. . .
espilon = 0.0531; alpha = 0.0040
Simulating trial. . .
espilon = 0.0531; alpha = 0.0040
Simulating trial. . .
espilon = 0.0531; alpha = 0.0040
Simulating trial. . .
espilon = 0.0531; alpha = 0.0040
Simulating trial. . .
espilon = 0.0531; alpha = 0.0040
Simulating trial. . .
espilon = 0.0531; alpha = 0.0040
Simulating trial. . .
espilon = 0.0531; alpha = 0.0040
Simulating trial. . .
espilon = 0.0531; alpha = 0.0040
Simulating trial. . .
espilon = 0.0531; alpha = 0.0040
Simulating trial. . .
espilon = 0.0531; alpha = 0.0040
Simulating trial. . .
espilon = 0.0531; alpha = 0.0040
Simulating trial. . .
espilon = 0.0531; alpha = 0.0040
Simulating trial. . .
espilon = 0.0531; alpha = 0.0040
Simulating trial. . .
espilon = 0.0531; alpha = 0.0040
Simulating trial. . .
espilon = 0.0531; alpha = 0.0040
Simulating trial. . .
espilon = 0.0531; alpha = 0.0040
Simulating trial. . .
espilon = 0.0531; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 1.42)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.74)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.68)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.00)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 1.89)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.76)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.41)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.55)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.55)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.67)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.05)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent followed the waypoint left. (rewarded 2.65)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.28)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.91)
53% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 736
\-------------------------
Simulating trial. . .
espilon = 0.0529; alpha = 0.0040
Simulating trial. . .
espilon = 0.0529; alpha = 0.0040
Simulating trial. . .
espilon = 0.0529; alpha = 0.0040
Simulating trial. . .
espilon = 0.0529; alpha = 0.0040
Simulating trial. . .
espilon = 0.0529; alpha = 0.0040
Simulating trial. . .
espilon = 0.0529; alpha = 0.0040
Simulating trial. . .
espilon = 0.0529; alpha = 0.0040
Simulating trial. . .
espilon = 0.0529; alpha = 0.0040
Simulating trial. . .
espilon = 0.0529; alpha = 0.0040
Simulating trial. . .
espilon = 0.0529; alpha = 0.0040
Simulating trial. . .
espilon = 0.0529; alpha = 0.0040
Simulating trial. . .
espilon = 0.0529; alpha = 0.0040
Simulating trial. . .
espilon = 0.0529; alpha = 0.0040
Simulating trial. . .
espilon = 0.0529; alpha = 0.0040
Simulating trial. . .
espilon = 0.0529; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.92)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.26)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.90)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.97)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.27)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.99)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.18)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.84)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 2.56)
64% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 737
\-------------------------
Simulating trial. . .
espilon = 0.0527; alpha = 0.0040
Simulating trial. . .
espilon = 0.0527; alpha = 0.0040
Simulating trial. . .
espilon = 0.0527; alpha = 0.0040
Simulating trial. . .
espilon = 0.0527; alpha = 0.0040
Simulating trial. . .
espilon = 0.0527; alpha = 0.0040
Simulating trial. . .
espilon = 0.0527; alpha = 0.0040
Simulating trial. . .
espilon = 0.0527; alpha = 0.0040
Simulating trial. . .
espilon = 0.0527; alpha = 0.0040
Simulating trial. . .
espilon = 0.0527; alpha = 0.0040
Simulating trial. . .
espilon = 0.0527; alpha = 0.0040
Simulating trial. . .
espilon = 0.0527; alpha = 0.0040
Simulating trial. . .
espilon = 0.0527; alpha = 0.0040
Simulating trial. . .
espilon = 0.0527; alpha = 0.0040
Simulating trial. . .
espilon = 0.0527; alpha = 0.0040
Simulating trial. . .
espilon = 0.0527; alpha = 0.0040
Simulating trial. . .
espilon = 0.0527; alpha = 0.0040
Simulating trial. . .
espilon = 0.0527; alpha = 0.0040
Simulating trial. . .
espilon = 0.0527; alpha = 0.0040
Simulating trial. . .
espilon = 0.0527; alpha = 0.0040
Simulating trial. . .
espilon = 0.0527; alpha = 0.0040
Simulating trial. . .
espilon = 0.0527; alpha = 0.0040
Simulating trial. . .
espilon = 0.0527; alpha = 0.0040
Simulating trial. . .
espilon = 0.0527; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.07)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'right', None)
Agent properly idled at a red light. (rewarded 1.72)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.65)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.76)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.89)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.17)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.56)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.16)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 2.74)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.51)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.34)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 0.85)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 738
\-------------------------
Simulating trial. . .
espilon = 0.0524; alpha = 0.0040
Simulating trial. . .
espilon = 0.0524; alpha = 0.0040
Simulating trial. . .
espilon = 0.0524; alpha = 0.0040
Simulating trial. . .
espilon = 0.0524; alpha = 0.0040
Simulating trial. . .
espilon = 0.0524; alpha = 0.0040
Simulating trial. . .
espilon = 0.0524; alpha = 0.0040
Simulating trial. . .
espilon = 0.0524; alpha = 0.0040
Simulating trial. . .
espilon = 0.0524; alpha = 0.0040
Simulating trial. . .
espilon = 0.0524; alpha = 0.0040
Simulating trial. . .
espilon = 0.0524; alpha = 0.0040
Simulating trial. . .
espilon = 0.0524; alpha = 0.0040
Simulating trial. . .
espilon = 0.0524; alpha = 0.0040
Simulating trial. . .
espilon = 0.0524; alpha = 0.0040
Simulating trial. . .
espilon = 0.0524; alpha = 0.0040
Simulating trial. . .
espilon = 0.0524; alpha = 0.0040
Simulating trial. . .
espilon = 0.0524; alpha = 0.0040
Simulating trial. . .
espilon = 0.0524; alpha = 0.0040
Simulating trial. . .
espilon = 0.0524; alpha = 0.0040
Simulating trial. . .
espilon = 0.0524; alpha = 0.0040
Simulating trial. . .
espilon = 0.0524; alpha = 0.0040
Simulating trial. . .
espilon = 0.0524; alpha = 0.0040
Simulating trial. . .
espilon = 0.0524; alpha = 0.0040
Simulating trial. . .
espilon = 0.0524; alpha = 0.0040
Simulating trial. . .
espilon = 0.0524; alpha = 0.0040
Simulating trial. . .
espilon = 0.0524; alpha = 0.0040
Simulating trial. . .
espilon = 0.0524; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.51)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.85)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.00)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.17)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.00)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', 'left')
Agent followed the waypoint right. (rewarded 2.84)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.06)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.46)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.23)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.94)
50% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 739
\-------------------------
Simulating trial. . .
espilon = 0.0522; alpha = 0.0040
Simulating trial. . .
espilon = 0.0522; alpha = 0.0040
Simulating trial. . .
espilon = 0.0522; alpha = 0.0040
Simulating trial. . .
espilon = 0.0522; alpha = 0.0040
Simulating trial. . .
espilon = 0.0522; alpha = 0.0040
Simulating trial. . .
espilon = 0.0522; alpha = 0.0040
Simulating trial. . .
espilon = 0.0522; alpha = 0.0040
Simulating trial. . .
espilon = 0.0522; alpha = 0.0040
Simulating trial. . .
espilon = 0.0522; alpha = 0.0040
Simulating trial. . .
espilon = 0.0522; alpha = 0.0040
Simulating trial. . .
espilon = 0.0522; alpha = 0.0040
Simulating trial. . .
espilon = 0.0522; alpha = 0.0040
Simulating trial. . .
espilon = 0.0522; alpha = 0.0040
Simulating trial. . .
espilon = 0.0522; alpha = 0.0040
Simulating trial. . .
espilon = 0.0522; alpha = 0.0040
Simulating trial. . .
espilon = 0.0522; alpha = 0.0040
Simulating trial. . .
espilon = 0.0522; alpha = 0.0040
Simulating trial. . .
espilon = 0.0522; alpha = 0.0040
Simulating trial. . .
espilon = 0.0522; alpha = 0.0040
Simulating trial. . .
espilon = 0.0522; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', 'left')
Agent drove right instead of left. (rewarded 0.52)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.92)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.23)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.86)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.82)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.29)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.55)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.41)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.39)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.16)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent drove right instead of left. (rewarded 0.63)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.88)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.11)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.02)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.35)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent attempted driving forward through a red light. (rewarded -10.13)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent followed the waypoint forward. (rewarded 1.91)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.71)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.14)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded -0.37)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 1.57)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.44)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 1.27)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.62)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.78)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent followed the waypoint right. (rewarded 0.44)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 0.50)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.18)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.12)
3% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 740
\-------------------------
Simulating trial. . .
espilon = 0.0520; alpha = 0.0040
Simulating trial. . .
espilon = 0.0520; alpha = 0.0040
Simulating trial. . .
espilon = 0.0520; alpha = 0.0040
Simulating trial. . .
espilon = 0.0520; alpha = 0.0040
Simulating trial. . .
espilon = 0.0520; alpha = 0.0040
Simulating trial. . .
espilon = 0.0520; alpha = 0.0040
Simulating trial. . .
espilon = 0.0520; alpha = 0.0040
Simulating trial. . .
espilon = 0.0520; alpha = 0.0040
Simulating trial. . .
espilon = 0.0520; alpha = 0.0040
Simulating trial. . .
espilon = 0.0520; alpha = 0.0040
Simulating trial. . .
espilon = 0.0520; alpha = 0.0040
Simulating trial. . .
espilon = 0.0520; alpha = 0.0040
Simulating trial. . .
espilon = 0.0520; alpha = 0.0040
Simulating trial. . .
espilon = 0.0520; alpha = 0.0040
Simulating trial. . .
espilon = 0.0520; alpha = 0.0040
Simulating trial. . .
espilon = 0.0520; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'right', None)
Agent drove left instead of right. (rewarded 1.25)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.03)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.96)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.34)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.92)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 1.38)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.83)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.57)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'right')
Agent drove right instead of left. (rewarded 1.58)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.53)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.85)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 0.74)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 741
\-------------------------
Simulating trial. . .
espilon = 0.0518; alpha = 0.0040
Simulating trial. . .
espilon = 0.0518; alpha = 0.0040
Simulating trial. . .
espilon = 0.0518; alpha = 0.0040
Simulating trial. . .
espilon = 0.0518; alpha = 0.0040
Simulating trial. . .
espilon = 0.0518; alpha = 0.0040
Simulating trial. . .
espilon = 0.0518; alpha = 0.0040
Simulating trial. . .
espilon = 0.0518; alpha = 0.0040
Simulating trial. . .
espilon = 0.0518; alpha = 0.0040
Simulating trial. . .
espilon = 0.0518; alpha = 0.0040
Simulating trial. . .
espilon = 0.0518; alpha = 0.0040
Simulating trial. . .
espilon = 0.0518; alpha = 0.0040
Simulating trial. . .
espilon = 0.0518; alpha = 0.0040
Simulating trial. . .
espilon = 0.0518; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.37)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.57)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.21)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.36)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.51)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.85)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.92)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.21)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.68)
64% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 742
\-------------------------
Simulating trial. . .
espilon = 0.0516; alpha = 0.0040
Simulating trial. . .
espilon = 0.0516; alpha = 0.0040
Simulating trial. . .
espilon = 0.0516; alpha = 0.0040
Simulating trial. . .
espilon = 0.0516; alpha = 0.0040
Simulating trial. . .
espilon = 0.0516; alpha = 0.0040
Simulating trial. . .
espilon = 0.0516; alpha = 0.0040
Simulating trial. . .
espilon = 0.0516; alpha = 0.0040
Simulating trial. . .
espilon = 0.0516; alpha = 0.0040
Simulating trial. . .
espilon = 0.0516; alpha = 0.0040
Simulating trial. . .
espilon = 0.0516; alpha = 0.0040
Simulating trial. . .
espilon = 0.0516; alpha = 0.0040
Simulating trial. . .
espilon = 0.0516; alpha = 0.0040
Simulating trial. . .
espilon = 0.0516; alpha = 0.0040
Simulating trial. . .
espilon = 0.0516; alpha = 0.0040
Simulating trial. . .
espilon = 0.0516; alpha = 0.0040
Simulating trial. . .
espilon = 0.0516; alpha = 0.0040
Simulating trial. . .
espilon = 0.0516; alpha = 0.0040
Simulating trial. . .
espilon = 0.0516; alpha = 0.0040
Simulating trial. . .
espilon = 0.0516; alpha = 0.0040
Simulating trial. . .
espilon = 0.0516; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.93)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', 'left')
Agent followed the waypoint forward. (rewarded 1.19)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'right')
Agent drove right instead of forward. (rewarded 1.19)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.61)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.60)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.85)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.28)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.75)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.70)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.58)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 1.22)
63% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 743
\-------------------------
Simulating trial. . .
espilon = 0.0514; alpha = 0.0040
Simulating trial. . .
espilon = 0.0514; alpha = 0.0040
Simulating trial. . .
espilon = 0.0514; alpha = 0.0040
Simulating trial. . .
espilon = 0.0514; alpha = 0.0040
Simulating trial. . .
espilon = 0.0514; alpha = 0.0040
Simulating trial. . .
espilon = 0.0514; alpha = 0.0040
Simulating trial. . .
espilon = 0.0514; alpha = 0.0040
Simulating trial. . .
espilon = 0.0514; alpha = 0.0040
Simulating trial. . .
espilon = 0.0514; alpha = 0.0040
Simulating trial. . .
espilon = 0.0514; alpha = 0.0040
Simulating trial. . .
espilon = 0.0514; alpha = 0.0040
Simulating trial. . .
espilon = 0.0514; alpha = 0.0040
Simulating trial. . .
espilon = 0.0514; alpha = 0.0040
Simulating trial. . .
espilon = 0.0514; alpha = 0.0040
Simulating trial. . .
espilon = 0.0514; alpha = 0.0040
Simulating trial. . .
espilon = 0.0514; alpha = 0.0040
Simulating trial. . .
espilon = 0.0514; alpha = 0.0040
Simulating trial. . .
espilon = 0.0514; alpha = 0.0040
Simulating trial. . .
espilon = 0.0514; alpha = 0.0040
Simulating trial. . .
espilon = 0.0514; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'right', None)
Agent properly idled at a red light. (rewarded 2.24)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.90)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.75)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.80)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.93)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.69)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 1.55)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.40)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.52)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 2.35)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.27)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.52)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.16)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.00)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.81)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.43)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 1.05)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 0.57)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.81)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'left')
Agent drove right instead of left. (rewarded -0.76)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 744
\-------------------------
Simulating trial. . .
espilon = 0.0512; alpha = 0.0040
Simulating trial. . .
espilon = 0.0512; alpha = 0.0040
Simulating trial. . .
espilon = 0.0512; alpha = 0.0040
Simulating trial. . .
espilon = 0.0512; alpha = 0.0040
Simulating trial. . .
espilon = 0.0512; alpha = 0.0040
Simulating trial. . .
espilon = 0.0512; alpha = 0.0040
Simulating trial. . .
espilon = 0.0512; alpha = 0.0040
Simulating trial. . .
espilon = 0.0512; alpha = 0.0040
Simulating trial. . .
espilon = 0.0512; alpha = 0.0040
Simulating trial. . .
espilon = 0.0512; alpha = 0.0040
Simulating trial. . .
espilon = 0.0512; alpha = 0.0040
Simulating trial. . .
espilon = 0.0512; alpha = 0.0040
Simulating trial. . .
espilon = 0.0512; alpha = 0.0040
Simulating trial. . .
espilon = 0.0512; alpha = 0.0040
Simulating trial. . .
espilon = 0.0512; alpha = 0.0040
Simulating trial. . .
espilon = 0.0512; alpha = 0.0040
Simulating trial. . .
espilon = 0.0512; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 2.87)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.91)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.44)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded 1.76)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.53)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.26)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.44)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.20)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 745
\-------------------------
Simulating trial. . .
espilon = 0.0510; alpha = 0.0040
Simulating trial. . .
espilon = 0.0510; alpha = 0.0040
Simulating trial. . .
espilon = 0.0510; alpha = 0.0040
Simulating trial. . .
espilon = 0.0510; alpha = 0.0040
Simulating trial. . .
espilon = 0.0510; alpha = 0.0040
Simulating trial. . .
espilon = 0.0510; alpha = 0.0040
Simulating trial. . .
espilon = 0.0510; alpha = 0.0040
Simulating trial. . .
espilon = 0.0510; alpha = 0.0040
Simulating trial. . .
espilon = 0.0510; alpha = 0.0040
Simulating trial. . .
espilon = 0.0510; alpha = 0.0040
Simulating trial. . .
espilon = 0.0510; alpha = 0.0040
Simulating trial. . .
espilon = 0.0510; alpha = 0.0040
Simulating trial. . .
espilon = 0.0510; alpha = 0.0040
Simulating trial. . .
espilon = 0.0510; alpha = 0.0040
Simulating trial. . .
espilon = 0.0510; alpha = 0.0040
Simulating trial. . .
espilon = 0.0510; alpha = 0.0040
Simulating trial. . .
espilon = 0.0510; alpha = 0.0040
Simulating trial. . .
espilon = 0.0510; alpha = 0.0040
Simulating trial. . .
espilon = 0.0510; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'left')
Agent drove forward instead of right. (rewarded 1.76)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'forward')
Agent drove forward instead of right. (rewarded 1.54)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.53)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.27)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.58)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent idled at a green light with no oncoming traffic. (rewarded -5.04)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.91)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.22)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 1.52)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.01)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent followed the waypoint right. (rewarded 2.53)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.69)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.89)
57% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 746
\-------------------------
Simulating trial. . .
espilon = 0.0508; alpha = 0.0040
Simulating trial. . .
espilon = 0.0508; alpha = 0.0040
Simulating trial. . .
espilon = 0.0508; alpha = 0.0040
Simulating trial. . .
espilon = 0.0508; alpha = 0.0040
Simulating trial. . .
espilon = 0.0508; alpha = 0.0040
Simulating trial. . .
espilon = 0.0508; alpha = 0.0040
Simulating trial. . .
espilon = 0.0508; alpha = 0.0040
Simulating trial. . .
espilon = 0.0508; alpha = 0.0040
Simulating trial. . .
espilon = 0.0508; alpha = 0.0040
Simulating trial. . .
espilon = 0.0508; alpha = 0.0040
Simulating trial. . .
espilon = 0.0508; alpha = 0.0040
Simulating trial. . .
espilon = 0.0508; alpha = 0.0040
Simulating trial. . .
espilon = 0.0508; alpha = 0.0040
Simulating trial. . .
espilon = 0.0508; alpha = 0.0040
Simulating trial. . .
espilon = 0.0508; alpha = 0.0040
Simulating trial. . .
espilon = 0.0508; alpha = 0.0040
Simulating trial. . .
espilon = 0.0508; alpha = 0.0040
Simulating trial. . .
espilon = 0.0508; alpha = 0.0040
Simulating trial. . .
espilon = 0.0508; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.93)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.00)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.16)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.02)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.11)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.73)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.82)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.65)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'right', 'forward')
Agent properly idled at a red light. (rewarded 1.98)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 2.15)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'left', 'forward')
Agent drove right instead of forward. (rewarded 1.16)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.28)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', None)
Agent properly idled at a red light. (rewarded 1.54)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.57)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.21)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.42)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 0.94)
43% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 747
\-------------------------
Simulating trial. . .
espilon = 0.0506; alpha = 0.0040
Simulating trial. . .
espilon = 0.0506; alpha = 0.0040
Simulating trial. . .
espilon = 0.0506; alpha = 0.0040
Simulating trial. . .
espilon = 0.0506; alpha = 0.0040
Simulating trial. . .
espilon = 0.0506; alpha = 0.0040
Simulating trial. . .
espilon = 0.0506; alpha = 0.0040
Simulating trial. . .
espilon = 0.0506; alpha = 0.0040
Simulating trial. . .
espilon = 0.0506; alpha = 0.0040
Simulating trial. . .
espilon = 0.0506; alpha = 0.0040
Simulating trial. . .
espilon = 0.0506; alpha = 0.0040
Simulating trial. . .
espilon = 0.0506; alpha = 0.0040
Simulating trial. . .
espilon = 0.0506; alpha = 0.0040
Simulating trial. . .
espilon = 0.0506; alpha = 0.0040
Simulating trial. . .
espilon = 0.0506; alpha = 0.0040
Simulating trial. . .
espilon = 0.0506; alpha = 0.0040
Simulating trial. . .
espilon = 0.0506; alpha = 0.0040
Simulating trial. . .
espilon = 0.0506; alpha = 0.0040
Simulating trial. . .
espilon = 0.0506; alpha = 0.0040
Simulating trial. . .
espilon = 0.0506; alpha = 0.0040
Simulating trial. . .
espilon = 0.0506; alpha = 0.0040
Simulating trial. . .
espilon = 0.0506; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 0.46)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.68)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.78)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.49)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 2.54)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.15)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 1.18)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.71)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 748
\-------------------------
Simulating trial. . .
espilon = 0.0504; alpha = 0.0040
Simulating trial. . .
espilon = 0.0504; alpha = 0.0040
Simulating trial. . .
espilon = 0.0504; alpha = 0.0040
Simulating trial. . .
espilon = 0.0504; alpha = 0.0040
Simulating trial. . .
espilon = 0.0504; alpha = 0.0040
Simulating trial. . .
espilon = 0.0504; alpha = 0.0040
Simulating trial. . .
espilon = 0.0504; alpha = 0.0040
Simulating trial. . .
espilon = 0.0504; alpha = 0.0040
Simulating trial. . .
espilon = 0.0504; alpha = 0.0040
Simulating trial. . .
espilon = 0.0504; alpha = 0.0040
Simulating trial. . .
espilon = 0.0504; alpha = 0.0040
Simulating trial. . .
espilon = 0.0504; alpha = 0.0040
Simulating trial. . .
espilon = 0.0504; alpha = 0.0040
Simulating trial. . .
espilon = 0.0504; alpha = 0.0040
Simulating trial. . .
espilon = 0.0504; alpha = 0.0040
Simulating trial. . .
espilon = 0.0504; alpha = 0.0040
Simulating trial. . .
espilon = 0.0504; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', 'left')
Agent drove right instead of left. (rewarded 0.28)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'right')
Agent properly idled at a red light. (rewarded 1.89)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.10)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'forward')
Agent drove right instead of forward. (rewarded 0.45)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.51)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 2.29)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.64)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.60)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.10)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.77)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.32)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.70)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.07)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.44)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'right', 'forward')
Agent properly idled at a red light. (rewarded 2.10)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.53)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 1.98)
32% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 749
\-------------------------
Simulating trial. . .
espilon = 0.0502; alpha = 0.0040
Simulating trial. . .
espilon = 0.0502; alpha = 0.0040
Simulating trial. . .
espilon = 0.0502; alpha = 0.0040
Simulating trial. . .
espilon = 0.0502; alpha = 0.0040
Simulating trial. . .
espilon = 0.0502; alpha = 0.0040
Simulating trial. . .
espilon = 0.0502; alpha = 0.0040
Simulating trial. . .
espilon = 0.0502; alpha = 0.0040
Simulating trial. . .
espilon = 0.0502; alpha = 0.0040
Simulating trial. . .
espilon = 0.0502; alpha = 0.0040
Simulating trial. . .
espilon = 0.0502; alpha = 0.0040
Simulating trial. . .
espilon = 0.0502; alpha = 0.0040
Simulating trial. . .
espilon = 0.0502; alpha = 0.0040
Simulating trial. . .
espilon = 0.0502; alpha = 0.0040
Simulating trial. . .
espilon = 0.0502; alpha = 0.0040
Simulating trial. . .
espilon = 0.0502; alpha = 0.0040
Simulating trial. . .
espilon = 0.0502; alpha = 0.0040
Simulating trial. . .
espilon = 0.0502; alpha = 0.0040
Simulating trial. . .
espilon = 0.0502; alpha = 0.0040
Simulating trial. . .
espilon = 0.0502; alpha = 0.0040
Simulating trial. . .
espilon = 0.0502; alpha = 0.0040
Simulating trial. . .
espilon = 0.0502; alpha = 0.0040
Simulating trial. . .
espilon = 0.0502; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.71)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.49)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.15)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.38)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.95)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.17)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.36)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.79)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 1.55)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent followed the waypoint left. (rewarded 0.83)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.81)
45% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 750
\-------------------------
Simulating trial. . .
espilon = 0.0500; alpha = 0.0040
Simulating trial. . .
espilon = 0.0500; alpha = 0.0040
Simulating trial. . .
espilon = 0.0500; alpha = 0.0040
Simulating trial. . .
espilon = 0.0500; alpha = 0.0040
Simulating trial. . .
espilon = 0.0500; alpha = 0.0040
Simulating trial. . .
espilon = 0.0500; alpha = 0.0040
Simulating trial. . .
espilon = 0.0500; alpha = 0.0040
Simulating trial. . .
espilon = 0.0500; alpha = 0.0040
Simulating trial. . .
espilon = 0.0500; alpha = 0.0040
Simulating trial. . .
espilon = 0.0500; alpha = 0.0040
Simulating trial. . .
espilon = 0.0500; alpha = 0.0040
Simulating trial. . .
espilon = 0.0500; alpha = 0.0040
Simulating trial. . .
espilon = 0.0500; alpha = 0.0040
Simulating trial. . .
espilon = 0.0500; alpha = 0.0040
Simulating trial. . .
espilon = 0.0500; alpha = 0.0040
Simulating trial. . .
espilon = 0.0500; alpha = 0.0040
Simulating trial. . .
espilon = 0.0500; alpha = 0.0040
Simulating trial. . .
espilon = 0.0500; alpha = 0.0040
Simulating trial. . .
espilon = 0.0500; alpha = 0.0040
Simulating trial. . .
espilon = 0.0500; alpha = 0.0040
Simulating trial. . .
espilon = 0.0500; alpha = 0.0040
Simulating trial. . .
espilon = 0.0500; alpha = 0.0040
Simulating trial. . .
espilon = 0.0500; alpha = 0.0040
Simulating trial. . .
espilon = 0.0500; alpha = 0.0040
Simulating trial. . .
espilon = 0.0500; alpha = 0.0040
Simulating trial. . .
espilon = 0.0500; alpha = 0.0040
Simulating trial. . .
espilon = 0.0500; alpha = 0.0040
Simulating trial. . .
espilon = 0.0500; alpha = 0.0040
Simulating trial. . .
espilon = 0.0500; alpha = 0.0040
Simulating trial. . .
espilon = 0.0500; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.26)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.46)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.33)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.19)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent attempted driving forward through a red light. (rewarded -9.03)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.92)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent drove forward instead of left. (rewarded 0.78)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.40)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.41)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', 'left')
Agent drove right instead of forward. (rewarded -0.01)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.14)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.03)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.22)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 0.87)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'right')
Agent properly idled at a red light. (rewarded 1.98)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.58)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.89)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.92)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', 'right')
Agent drove forward instead of right. (rewarded 0.24)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'forward')
Agent attempted driving left through a red light. (rewarded -9.44)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.92)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.53)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent followed the waypoint right. (rewarded 0.96)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'forward')
Agent drove right instead of left. (rewarded 0.21)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent followed the waypoint right. (rewarded 1.57)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 0.41)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', 'right')
Agent followed the waypoint right. (rewarded 1.72)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent followed the waypoint right. (rewarded 1.65)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 0.16)
3% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 751
\-------------------------
Simulating trial. . .
espilon = 0.0498; alpha = 0.0040
Simulating trial. . .
espilon = 0.0498; alpha = 0.0040
Simulating trial. . .
espilon = 0.0498; alpha = 0.0040
Simulating trial. . .
espilon = 0.0498; alpha = 0.0040
Simulating trial. . .
espilon = 0.0498; alpha = 0.0040
Simulating trial. . .
espilon = 0.0498; alpha = 0.0040
Simulating trial. . .
espilon = 0.0498; alpha = 0.0040
Simulating trial. . .
espilon = 0.0498; alpha = 0.0040
Simulating trial. . .
espilon = 0.0498; alpha = 0.0040
Simulating trial. . .
espilon = 0.0498; alpha = 0.0040
Simulating trial. . .
espilon = 0.0498; alpha = 0.0040
Simulating trial. . .
espilon = 0.0498; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.94)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.67)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 1.57)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent followed the waypoint left. (rewarded 2.50)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'right')
Agent followed the waypoint forward. (rewarded 1.05)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.33)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.92)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.15)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.95)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.20)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 2.46)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.96)
66% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 752
\-------------------------
Simulating trial. . .
espilon = 0.0496; alpha = 0.0040
Simulating trial. . .
espilon = 0.0496; alpha = 0.0040
Simulating trial. . .
espilon = 0.0496; alpha = 0.0040
Simulating trial. . .
espilon = 0.0496; alpha = 0.0040
Simulating trial. . .
espilon = 0.0496; alpha = 0.0040
Simulating trial. . .
espilon = 0.0496; alpha = 0.0040
Simulating trial. . .
espilon = 0.0496; alpha = 0.0040
Simulating trial. . .
espilon = 0.0496; alpha = 0.0040
Simulating trial. . .
espilon = 0.0496; alpha = 0.0040
Simulating trial. . .
espilon = 0.0496; alpha = 0.0040
Simulating trial. . .
espilon = 0.0496; alpha = 0.0040
Simulating trial. . .
espilon = 0.0496; alpha = 0.0040
Simulating trial. . .
espilon = 0.0496; alpha = 0.0040
Simulating trial. . .
espilon = 0.0496; alpha = 0.0040
Simulating trial. . .
espilon = 0.0496; alpha = 0.0040
Simulating trial. . .
espilon = 0.0496; alpha = 0.0040
Simulating trial. . .
espilon = 0.0496; alpha = 0.0040
Simulating trial. . .
espilon = 0.0496; alpha = 0.0040
Simulating trial. . .
espilon = 0.0496; alpha = 0.0040
Simulating trial. . .
espilon = 0.0496; alpha = 0.0040
Simulating trial. . .
espilon = 0.0496; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.50)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.29)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.37)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.50)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.59)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.11)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.64)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.90)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.22)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.43)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.75)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.87)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.66)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.32)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.93)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.18)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.79)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.08)
28% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 753
\-------------------------
Simulating trial. . .
espilon = 0.0494; alpha = 0.0040
Simulating trial. . .
espilon = 0.0494; alpha = 0.0040
Simulating trial. . .
espilon = 0.0494; alpha = 0.0040
Simulating trial. . .
espilon = 0.0494; alpha = 0.0040
Simulating trial. . .
espilon = 0.0494; alpha = 0.0040
Simulating trial. . .
espilon = 0.0494; alpha = 0.0040
Simulating trial. . .
espilon = 0.0494; alpha = 0.0040
Simulating trial. . .
espilon = 0.0494; alpha = 0.0040
Simulating trial. . .
espilon = 0.0494; alpha = 0.0040
Simulating trial. . .
espilon = 0.0494; alpha = 0.0040
Simulating trial. . .
espilon = 0.0494; alpha = 0.0040
Simulating trial. . .
espilon = 0.0494; alpha = 0.0040
Simulating trial. . .
espilon = 0.0494; alpha = 0.0040
Simulating trial. . .
espilon = 0.0494; alpha = 0.0040
Simulating trial. . .
espilon = 0.0494; alpha = 0.0040
Simulating trial. . .
espilon = 0.0494; alpha = 0.0040
Simulating trial. . .
espilon = 0.0494; alpha = 0.0040
Simulating trial. . .
espilon = 0.0494; alpha = 0.0040
Simulating trial. . .
espilon = 0.0494; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.83)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.09)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.33)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.43)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.91)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 0.96)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.45)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.71)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.34)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.86)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 2.40)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.53)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.28)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.29)
44% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 754
\-------------------------
Simulating trial. . .
espilon = 0.0492; alpha = 0.0040
Simulating trial. . .
espilon = 0.0492; alpha = 0.0040
Simulating trial. . .
espilon = 0.0492; alpha = 0.0040
Simulating trial. . .
espilon = 0.0492; alpha = 0.0040
Simulating trial. . .
espilon = 0.0492; alpha = 0.0040
Simulating trial. . .
espilon = 0.0492; alpha = 0.0040
Simulating trial. . .
espilon = 0.0492; alpha = 0.0040
Simulating trial. . .
espilon = 0.0492; alpha = 0.0040
Simulating trial. . .
espilon = 0.0492; alpha = 0.0040
Simulating trial. . .
espilon = 0.0492; alpha = 0.0040
Simulating trial. . .
espilon = 0.0492; alpha = 0.0040
Simulating trial. . .
espilon = 0.0492; alpha = 0.0040
Simulating trial. . .
espilon = 0.0492; alpha = 0.0040
Simulating trial. . .
espilon = 0.0492; alpha = 0.0040
Simulating trial. . .
espilon = 0.0492; alpha = 0.0040
Simulating trial. . .
espilon = 0.0492; alpha = 0.0040
Simulating trial. . .
espilon = 0.0492; alpha = 0.0040
Simulating trial. . .
espilon = 0.0492; alpha = 0.0040
Simulating trial. . .
espilon = 0.0492; alpha = 0.0040
Simulating trial. . .
espilon = 0.0492; alpha = 0.0040
Simulating trial. . .
espilon = 0.0492; alpha = 0.0040
Simulating trial. . .
espilon = 0.0492; alpha = 0.0040
Simulating trial. . .
espilon = 0.0492; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.90)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.07)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.08)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded -0.02)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.62)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 1.44)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.89)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.86)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.38)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.36)
50% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 755
\-------------------------
Simulating trial. . .
espilon = 0.0490; alpha = 0.0040
Simulating trial. . .
espilon = 0.0490; alpha = 0.0040
Simulating trial. . .
espilon = 0.0490; alpha = 0.0040
Simulating trial. . .
espilon = 0.0490; alpha = 0.0040
Simulating trial. . .
espilon = 0.0490; alpha = 0.0040
Simulating trial. . .
espilon = 0.0490; alpha = 0.0040
Simulating trial. . .
espilon = 0.0490; alpha = 0.0040
Simulating trial. . .
espilon = 0.0490; alpha = 0.0040
Simulating trial. . .
espilon = 0.0490; alpha = 0.0040
Simulating trial. . .
espilon = 0.0490; alpha = 0.0040
Simulating trial. . .
espilon = 0.0490; alpha = 0.0040
Simulating trial. . .
espilon = 0.0490; alpha = 0.0040
Simulating trial. . .
espilon = 0.0490; alpha = 0.0040
Simulating trial. . .
espilon = 0.0490; alpha = 0.0040
Simulating trial. . .
espilon = 0.0490; alpha = 0.0040
Simulating trial. . .
espilon = 0.0490; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', 'left')
Agent followed the waypoint right. (rewarded 1.27)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.18)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.06)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.47)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.02)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.68)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.37)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.57)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.74)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent drove right instead of forward. (rewarded 0.72)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 1.88)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.91)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.11)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.51)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.32)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.70)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.01)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'left')
Agent drove right instead of left. (rewarded -0.24)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.07)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'right')
Agent drove forward instead of left. (rewarded -0.19)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 0.31)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.38)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.44)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.55)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'right', None)
Agent drove right instead of forward. (rewarded 0.36)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'right')
Agent drove right instead of left. (rewarded 0.32)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent followed the waypoint right. (rewarded 0.48)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.99)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.66)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.02)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 756
\-------------------------
Simulating trial. . .
espilon = 0.0488; alpha = 0.0040
Simulating trial. . .
espilon = 0.0488; alpha = 0.0040
Simulating trial. . .
espilon = 0.0488; alpha = 0.0040
Simulating trial. . .
espilon = 0.0488; alpha = 0.0040
Simulating trial. . .
espilon = 0.0488; alpha = 0.0040
Simulating trial. . .
espilon = 0.0488; alpha = 0.0040
Simulating trial. . .
espilon = 0.0488; alpha = 0.0040
Simulating trial. . .
espilon = 0.0488; alpha = 0.0040
Simulating trial. . .
espilon = 0.0488; alpha = 0.0040
Simulating trial. . .
espilon = 0.0488; alpha = 0.0040
Simulating trial. . .
espilon = 0.0488; alpha = 0.0040
Simulating trial. . .
espilon = 0.0488; alpha = 0.0040
Simulating trial. . .
espilon = 0.0488; alpha = 0.0040
Simulating trial. . .
espilon = 0.0488; alpha = 0.0040
Simulating trial. . .
espilon = 0.0488; alpha = 0.0040
Simulating trial. . .
espilon = 0.0488; alpha = 0.0040
Simulating trial. . .
espilon = 0.0488; alpha = 0.0040
Simulating trial. . .
espilon = 0.0488; alpha = 0.0040
Simulating trial. . .
espilon = 0.0488; alpha = 0.0040
Simulating trial. . .
espilon = 0.0488; alpha = 0.0040
Simulating trial. . .
espilon = 0.0488; alpha = 0.0040
Simulating trial. . .
espilon = 0.0488; alpha = 0.0040
Simulating trial. . .
espilon = 0.0488; alpha = 0.0040
Simulating trial. . .
espilon = 0.0488; alpha = 0.0040
Simulating trial. . .
espilon = 0.0488; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'right', 'left')
Agent properly idled at a red light. (rewarded 1.28)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent properly idled at a red light. (rewarded 1.59)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.96)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.74)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.72)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.53)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.41)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'left')
Agent drove right instead of left. (rewarded 1.01)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.69)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'right', 'forward')
Agent drove right instead of forward. (rewarded 0.14)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.75)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.64)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.58)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent followed the waypoint left. (rewarded 0.89)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.96)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent drove right instead of forward. (rewarded 0.91)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.89)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.14)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'right', None)
Agent properly idled at a red light. (rewarded 1.75)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.30)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.56)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.13)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'left', 'left')
Agent followed the waypoint forward. (rewarded 2.00)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded 0.53)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.22)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 757
\-------------------------
Simulating trial. . .
espilon = 0.0486; alpha = 0.0040
Simulating trial. . .
espilon = 0.0486; alpha = 0.0040
Simulating trial. . .
espilon = 0.0486; alpha = 0.0040
Simulating trial. . .
espilon = 0.0486; alpha = 0.0040
Simulating trial. . .
espilon = 0.0486; alpha = 0.0040
Simulating trial. . .
espilon = 0.0486; alpha = 0.0040
Simulating trial. . .
espilon = 0.0486; alpha = 0.0040
Simulating trial. . .
espilon = 0.0486; alpha = 0.0040
Simulating trial. . .
espilon = 0.0486; alpha = 0.0040
Simulating trial. . .
espilon = 0.0486; alpha = 0.0040
Simulating trial. . .
espilon = 0.0486; alpha = 0.0040
Simulating trial. . .
espilon = 0.0486; alpha = 0.0040
Simulating trial. . .
espilon = 0.0486; alpha = 0.0040
Simulating trial. . .
espilon = 0.0486; alpha = 0.0040
Simulating trial. . .
espilon = 0.0486; alpha = 0.0040
Simulating trial. . .
espilon = 0.0486; alpha = 0.0040
Simulating trial. . .
espilon = 0.0486; alpha = 0.0040
Simulating trial. . .
espilon = 0.0486; alpha = 0.0040
Simulating trial. . .
espilon = 0.0486; alpha = 0.0040
Simulating trial. . .
espilon = 0.0486; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.02)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.10)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.42)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'right')
Agent drove right instead of left. (rewarded 0.84)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.80)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.81)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.20)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.58)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 0.95)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.38)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.69)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 2.52)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 2.81)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.36)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.84)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.46)
47% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 758
\-------------------------
Simulating trial. . .
espilon = 0.0484; alpha = 0.0040
Simulating trial. . .
espilon = 0.0484; alpha = 0.0040
Simulating trial. . .
espilon = 0.0484; alpha = 0.0040
Simulating trial. . .
espilon = 0.0484; alpha = 0.0040
Simulating trial. . .
espilon = 0.0484; alpha = 0.0040
Simulating trial. . .
espilon = 0.0484; alpha = 0.0040
Simulating trial. . .
espilon = 0.0484; alpha = 0.0040
Simulating trial. . .
espilon = 0.0484; alpha = 0.0040
Simulating trial. . .
espilon = 0.0484; alpha = 0.0040
Simulating trial. . .
espilon = 0.0484; alpha = 0.0040
Simulating trial. . .
espilon = 0.0484; alpha = 0.0040
Simulating trial. . .
espilon = 0.0484; alpha = 0.0040
Simulating trial. . .
espilon = 0.0484; alpha = 0.0040
Simulating trial. . .
espilon = 0.0484; alpha = 0.0040
Simulating trial. . .
espilon = 0.0484; alpha = 0.0040
Simulating trial. . .
espilon = 0.0484; alpha = 0.0040
Simulating trial. . .
espilon = 0.0484; alpha = 0.0040
Simulating trial. . .
espilon = 0.0484; alpha = 0.0040
Simulating trial. . .
espilon = 0.0484; alpha = 0.0040
Simulating trial. . .
espilon = 0.0484; alpha = 0.0040
Simulating trial. . .
espilon = 0.0484; alpha = 0.0040
Simulating trial. . .
espilon = 0.0484; alpha = 0.0040
Simulating trial. . .
espilon = 0.0484; alpha = 0.0040
Simulating trial. . .
espilon = 0.0484; alpha = 0.0040
Simulating trial. . .
espilon = 0.0484; alpha = 0.0040
Simulating trial. . .
espilon = 0.0484; alpha = 0.0040
Simulating trial. . .
espilon = 0.0484; alpha = 0.0040
Simulating trial. . .
espilon = 0.0484; alpha = 0.0040
Simulating trial. . .
espilon = 0.0484; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.87)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.18)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.39)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.13)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.65)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.97)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.11)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.44)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.34)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.07)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded -0.10)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'forward')
Agent drove forward instead of right. (rewarded 0.99)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent followed the waypoint right. (rewarded 1.91)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, None)
Agent followed the waypoint right. (rewarded 2.65)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.42)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.70)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.45)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.99)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.92)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.48)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.69)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.71)
27% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 759
\-------------------------
Simulating trial. . .
espilon = 0.0482; alpha = 0.0040
Simulating trial. . .
espilon = 0.0482; alpha = 0.0040
Simulating trial. . .
espilon = 0.0482; alpha = 0.0040
Simulating trial. . .
espilon = 0.0482; alpha = 0.0040
Simulating trial. . .
espilon = 0.0482; alpha = 0.0040
Simulating trial. . .
espilon = 0.0482; alpha = 0.0040
Simulating trial. . .
espilon = 0.0482; alpha = 0.0040
Simulating trial. . .
espilon = 0.0482; alpha = 0.0040
Simulating trial. . .
espilon = 0.0482; alpha = 0.0040
Simulating trial. . .
espilon = 0.0482; alpha = 0.0040
Simulating trial. . .
espilon = 0.0482; alpha = 0.0040
Simulating trial. . .
espilon = 0.0482; alpha = 0.0040
Simulating trial. . .
espilon = 0.0482; alpha = 0.0040
Simulating trial. . .
espilon = 0.0482; alpha = 0.0040
Simulating trial. . .
espilon = 0.0482; alpha = 0.0040
Simulating trial. . .
espilon = 0.0482; alpha = 0.0040
Simulating trial. . .
espilon = 0.0482; alpha = 0.0040
Simulating trial. . .
espilon = 0.0482; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.18)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.69)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.70)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.35)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.76)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.99)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent drove right instead of forward. (rewarded 0.16)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.38)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.69)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.05)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.41)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.07)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 760
\-------------------------
Simulating trial. . .
espilon = 0.0480; alpha = 0.0040
Simulating trial. . .
espilon = 0.0480; alpha = 0.0040
Simulating trial. . .
espilon = 0.0480; alpha = 0.0040
Simulating trial. . .
espilon = 0.0480; alpha = 0.0040
Simulating trial. . .
espilon = 0.0480; alpha = 0.0040
Simulating trial. . .
espilon = 0.0480; alpha = 0.0040
Simulating trial. . .
espilon = 0.0480; alpha = 0.0040
Simulating trial. . .
espilon = 0.0480; alpha = 0.0040
Simulating trial. . .
espilon = 0.0480; alpha = 0.0040
Simulating trial. . .
espilon = 0.0480; alpha = 0.0040
Simulating trial. . .
espilon = 0.0480; alpha = 0.0040
Simulating trial. . .
espilon = 0.0480; alpha = 0.0040
Simulating trial. . .
espilon = 0.0480; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'right', None)
Agent followed the waypoint left. (rewarded 1.37)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.27)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.10)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.69)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.31)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.85)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.12)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.92)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.32)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.72)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.56)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', 'right')
Agent followed the waypoint forward. (rewarded 2.38)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.62)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.09)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.59)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 761
\-------------------------
Simulating trial. . .
espilon = 0.0478; alpha = 0.0040
Simulating trial. . .
espilon = 0.0478; alpha = 0.0040
Simulating trial. . .
espilon = 0.0478; alpha = 0.0040
Simulating trial. . .
espilon = 0.0478; alpha = 0.0040
Simulating trial. . .
espilon = 0.0478; alpha = 0.0040
Simulating trial. . .
espilon = 0.0478; alpha = 0.0040
Simulating trial. . .
espilon = 0.0478; alpha = 0.0040
Simulating trial. . .
espilon = 0.0478; alpha = 0.0040
Simulating trial. . .
espilon = 0.0478; alpha = 0.0040
Simulating trial. . .
espilon = 0.0478; alpha = 0.0040
Simulating trial. . .
espilon = 0.0478; alpha = 0.0040
Simulating trial. . .
espilon = 0.0478; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', 'right')
Agent followed the waypoint right. (rewarded 2.46)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 2.42)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.18)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.20)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.35)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.70)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.83)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.74)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 1.49)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', 'left')
Agent properly idled at a red light. (rewarded 1.43)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.44)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.36)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 2.49)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.33)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 0.47)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 2.34)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.65)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.76)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 0.34)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', 'right')
Agent attempted driving right through traffic and cause a minor accident. (rewarded -19.75)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, 'right')
Agent followed the waypoint right. (rewarded 1.91)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.22)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.08)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.98)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.03)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.72)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 0.39)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 2.14)
7% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 762
\-------------------------
Simulating trial. . .
espilon = 0.0476; alpha = 0.0040
Simulating trial. . .
espilon = 0.0476; alpha = 0.0040
Simulating trial. . .
espilon = 0.0476; alpha = 0.0040
Simulating trial. . .
espilon = 0.0476; alpha = 0.0040
Simulating trial. . .
espilon = 0.0476; alpha = 0.0040
Simulating trial. . .
espilon = 0.0476; alpha = 0.0040
Simulating trial. . .
espilon = 0.0476; alpha = 0.0040
Simulating trial. . .
espilon = 0.0476; alpha = 0.0040
Simulating trial. . .
espilon = 0.0476; alpha = 0.0040
Simulating trial. . .
espilon = 0.0476; alpha = 0.0040
Simulating trial. . .
espilon = 0.0476; alpha = 0.0040
Simulating trial. . .
espilon = 0.0476; alpha = 0.0040
Simulating trial. . .
espilon = 0.0476; alpha = 0.0040
Simulating trial. . .
espilon = 0.0476; alpha = 0.0040
Simulating trial. . .
espilon = 0.0476; alpha = 0.0040
Simulating trial. . .
espilon = 0.0476; alpha = 0.0040
Simulating trial. . .
espilon = 0.0476; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.16)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.61)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.73)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.15)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 2.64)
80% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 763
\-------------------------
Simulating trial. . .
espilon = 0.0475; alpha = 0.0040
Simulating trial. . .
espilon = 0.0475; alpha = 0.0040
Simulating trial. . .
espilon = 0.0475; alpha = 0.0040
Simulating trial. . .
espilon = 0.0475; alpha = 0.0040
Simulating trial. . .
espilon = 0.0475; alpha = 0.0040
Simulating trial. . .
espilon = 0.0475; alpha = 0.0040
Simulating trial. . .
espilon = 0.0475; alpha = 0.0040
Simulating trial. . .
espilon = 0.0475; alpha = 0.0040
Simulating trial. . .
espilon = 0.0475; alpha = 0.0040
Simulating trial. . .
espilon = 0.0475; alpha = 0.0040
Simulating trial. . .
espilon = 0.0475; alpha = 0.0040
Simulating trial. . .
espilon = 0.0475; alpha = 0.0040
Simulating trial. . .
espilon = 0.0475; alpha = 0.0040
Simulating trial. . .
espilon = 0.0475; alpha = 0.0040
Simulating trial. . .
espilon = 0.0475; alpha = 0.0040
Simulating trial. . .
espilon = 0.0475; alpha = 0.0040
Simulating trial. . .
espilon = 0.0475; alpha = 0.0040
Simulating trial. . .
espilon = 0.0475; alpha = 0.0040
Simulating trial. . .
espilon = 0.0475; alpha = 0.0040
Simulating trial. . .
espilon = 0.0475; alpha = 0.0040
Simulating trial. . .
espilon = 0.0475; alpha = 0.0040
Simulating trial. . .
espilon = 0.0475; alpha = 0.0040
Simulating trial. . .
espilon = 0.0475; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.19)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.68)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.69)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.43)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.86)
75% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 764
\-------------------------
Simulating trial. . .
espilon = 0.0473; alpha = 0.0040
Simulating trial. . .
espilon = 0.0473; alpha = 0.0040
Simulating trial. . .
espilon = 0.0473; alpha = 0.0040
Simulating trial. . .
espilon = 0.0473; alpha = 0.0040
Simulating trial. . .
espilon = 0.0473; alpha = 0.0040
Simulating trial. . .
espilon = 0.0473; alpha = 0.0040
Simulating trial. . .
espilon = 0.0473; alpha = 0.0040
Simulating trial. . .
espilon = 0.0473; alpha = 0.0040
Simulating trial. . .
espilon = 0.0473; alpha = 0.0040
Simulating trial. . .
espilon = 0.0473; alpha = 0.0040
Simulating trial. . .
espilon = 0.0473; alpha = 0.0040
Simulating trial. . .
espilon = 0.0473; alpha = 0.0040
Simulating trial. . .
espilon = 0.0473; alpha = 0.0040
Simulating trial. . .
espilon = 0.0473; alpha = 0.0040
Simulating trial. . .
espilon = 0.0473; alpha = 0.0040
Simulating trial. . .
espilon = 0.0473; alpha = 0.0040
Simulating trial. . .
espilon = 0.0473; alpha = 0.0040
Simulating trial. . .
espilon = 0.0473; alpha = 0.0040
Simulating trial. . .
espilon = 0.0473; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.03)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.92)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 0.35)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 2.68)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.44)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.10)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.95)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.57)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', None)
Agent properly idled at a red light. (rewarded 1.04)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'right', None)
Agent followed the waypoint forward. (rewarded 1.04)
50% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 765
\-------------------------
Simulating trial. . .
espilon = 0.0471; alpha = 0.0040
Simulating trial. . .
espilon = 0.0471; alpha = 0.0040
Simulating trial. . .
espilon = 0.0471; alpha = 0.0040
Simulating trial. . .
espilon = 0.0471; alpha = 0.0040
Simulating trial. . .
espilon = 0.0471; alpha = 0.0040
Simulating trial. . .
espilon = 0.0471; alpha = 0.0040
Simulating trial. . .
espilon = 0.0471; alpha = 0.0040
Simulating trial. . .
espilon = 0.0471; alpha = 0.0040
Simulating trial. . .
espilon = 0.0471; alpha = 0.0040
Simulating trial. . .
espilon = 0.0471; alpha = 0.0040
Simulating trial. . .
espilon = 0.0471; alpha = 0.0040
Simulating trial. . .
espilon = 0.0471; alpha = 0.0040
Simulating trial. . .
espilon = 0.0471; alpha = 0.0040
Simulating trial. . .
espilon = 0.0471; alpha = 0.0040
Simulating trial. . .
espilon = 0.0471; alpha = 0.0040
Simulating trial. . .
espilon = 0.0471; alpha = 0.0040
Simulating trial. . .
espilon = 0.0471; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'right', 'left')
Agent followed the waypoint right. (rewarded 1.36)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.54)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 2.61)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.27)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.75)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.38)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.68)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.25)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.08)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.28)
50% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 766
\-------------------------
Simulating trial. . .
espilon = 0.0469; alpha = 0.0040
Simulating trial. . .
espilon = 0.0469; alpha = 0.0040
Simulating trial. . .
espilon = 0.0469; alpha = 0.0040
Simulating trial. . .
espilon = 0.0469; alpha = 0.0040
Simulating trial. . .
espilon = 0.0469; alpha = 0.0040
Simulating trial. . .
espilon = 0.0469; alpha = 0.0040
Simulating trial. . .
espilon = 0.0469; alpha = 0.0040
Simulating trial. . .
espilon = 0.0469; alpha = 0.0040
Simulating trial. . .
espilon = 0.0469; alpha = 0.0040
Simulating trial. . .
espilon = 0.0469; alpha = 0.0040
Simulating trial. . .
espilon = 0.0469; alpha = 0.0040
Simulating trial. . .
espilon = 0.0469; alpha = 0.0040
Simulating trial. . .
espilon = 0.0469; alpha = 0.0040
Simulating trial. . .
espilon = 0.0469; alpha = 0.0040
Simulating trial. . .
espilon = 0.0469; alpha = 0.0040
Simulating trial. . .
espilon = 0.0469; alpha = 0.0040
Simulating trial. . .
espilon = 0.0469; alpha = 0.0040
Simulating trial. . .
espilon = 0.0469; alpha = 0.0040
Simulating trial. . .
espilon = 0.0469; alpha = 0.0040
Simulating trial. . .
espilon = 0.0469; alpha = 0.0040
Simulating trial. . .
espilon = 0.0469; alpha = 0.0040
Simulating trial. . .
espilon = 0.0469; alpha = 0.0040
Simulating trial. . .
espilon = 0.0469; alpha = 0.0040
Simulating trial. . .
espilon = 0.0469; alpha = 0.0040
Simulating trial. . .
espilon = 0.0469; alpha = 0.0040
Simulating trial. . .
espilon = 0.0469; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 1.33)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.75)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.28)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent drove right instead of left. (rewarded 1.82)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.08)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.39)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 0.37)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.76)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 0.51)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.67)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 0.82)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent followed the waypoint left. (rewarded 2.52)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded 1.25)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.68)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.08)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent drove right instead of left. (rewarded -0.26)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.89)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.54)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.96)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.55)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 767
\-------------------------
Simulating trial. . .
espilon = 0.0467; alpha = 0.0040
Simulating trial. . .
espilon = 0.0467; alpha = 0.0040
Simulating trial. . .
espilon = 0.0467; alpha = 0.0040
Simulating trial. . .
espilon = 0.0467; alpha = 0.0040
Simulating trial. . .
espilon = 0.0467; alpha = 0.0040
Simulating trial. . .
espilon = 0.0467; alpha = 0.0040
Simulating trial. . .
espilon = 0.0467; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'forward', None)
Agent drove right instead of left. (rewarded 1.79)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent followed the waypoint right. (rewarded 1.45)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.33)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.04)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent followed the waypoint forward. (rewarded 1.87)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.87)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.73)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.69)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.85)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 2.24)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.72)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 0.19)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 1.30)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 0.08)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'forward')
Agent followed the waypoint right. (rewarded 1.47)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', 'left')
Agent drove left instead of right. (rewarded 0.32)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.14)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.21)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.76)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', 'left')
Agent drove right instead of left. (rewarded 1.22)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent drove right instead of forward. (rewarded 0.10)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 0.37)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'right')
Agent drove right instead of left. (rewarded 0.39)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.17)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.48)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 768
\-------------------------
Simulating trial. . .
espilon = 0.0465; alpha = 0.0040
Simulating trial. . .
espilon = 0.0465; alpha = 0.0040
Simulating trial. . .
espilon = 0.0465; alpha = 0.0040
Simulating trial. . .
espilon = 0.0465; alpha = 0.0040
Simulating trial. . .
espilon = 0.0465; alpha = 0.0040
Simulating trial. . .
espilon = 0.0465; alpha = 0.0040
Simulating trial. . .
espilon = 0.0465; alpha = 0.0040
Simulating trial. . .
espilon = 0.0465; alpha = 0.0040
Simulating trial. . .
espilon = 0.0465; alpha = 0.0040
Simulating trial. . .
espilon = 0.0465; alpha = 0.0040
Simulating trial. . .
espilon = 0.0465; alpha = 0.0040
Simulating trial. . .
espilon = 0.0465; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', 'forward')
Agent properly idled at a red light. (rewarded 1.94)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 2.39)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', 'forward')
Agent attempted driving forward through a red light. (rewarded -10.01)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.28)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent drove right instead of forward. (rewarded 1.49)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.04)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'left', 'left')
Agent drove forward instead of left. (rewarded 1.18)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'right', None)
Agent properly idled at a red light. (rewarded 2.71)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.85)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.15)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', 'left')
Agent followed the waypoint left. (rewarded 2.65)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.33)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.62)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.61)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.50)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.31)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.76)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.13)
28% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 769
\-------------------------
Simulating trial. . .
espilon = 0.0463; alpha = 0.0040
Simulating trial. . .
espilon = 0.0463; alpha = 0.0040
Simulating trial. . .
espilon = 0.0463; alpha = 0.0040
Simulating trial. . .
espilon = 0.0463; alpha = 0.0040
Simulating trial. . .
espilon = 0.0463; alpha = 0.0040
Simulating trial. . .
espilon = 0.0463; alpha = 0.0040
Simulating trial. . .
espilon = 0.0463; alpha = 0.0040
Simulating trial. . .
espilon = 0.0463; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.94)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.39)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.82)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.27)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.92)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.70)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 1.75)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 2.29)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.13)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'forward')
Agent drove right instead of forward. (rewarded 0.25)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.73)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.49)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.62)
35% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 770
\-------------------------
Simulating trial. . .
espilon = 0.0461; alpha = 0.0040
Simulating trial. . .
espilon = 0.0461; alpha = 0.0040
Simulating trial. . .
espilon = 0.0461; alpha = 0.0040
Simulating trial. . .
espilon = 0.0461; alpha = 0.0040
Simulating trial. . .
espilon = 0.0461; alpha = 0.0040
Simulating trial. . .
espilon = 0.0461; alpha = 0.0040
Simulating trial. . .
espilon = 0.0461; alpha = 0.0040
Simulating trial. . .
espilon = 0.0461; alpha = 0.0040
Simulating trial. . .
espilon = 0.0461; alpha = 0.0040
Simulating trial. . .
espilon = 0.0461; alpha = 0.0040
Simulating trial. . .
espilon = 0.0461; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.76)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.79)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.91)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.76)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.17)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.83)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'left')
Agent drove left instead of right. (rewarded 0.77)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.08)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 1.75)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'forward')
Agent followed the waypoint forward. (rewarded 2.45)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'right')
Agent drove right instead of forward. (rewarded 0.84)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 0.90)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 2.11)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'forward')
Agent drove forward instead of left. (rewarded -0.22)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.85)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.17)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.28)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 0.74)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 0.27)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 0.71)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 771
\-------------------------
Simulating trial. . .
espilon = 0.0460; alpha = 0.0040
Simulating trial. . .
espilon = 0.0460; alpha = 0.0040
Simulating trial. . .
espilon = 0.0460; alpha = 0.0040
Simulating trial. . .
espilon = 0.0460; alpha = 0.0040
Simulating trial. . .
espilon = 0.0460; alpha = 0.0040
Simulating trial. . .
espilon = 0.0460; alpha = 0.0040
Simulating trial. . .
espilon = 0.0460; alpha = 0.0040
Simulating trial. . .
espilon = 0.0460; alpha = 0.0040
Simulating trial. . .
espilon = 0.0460; alpha = 0.0040
Simulating trial. . .
espilon = 0.0460; alpha = 0.0040
Simulating trial. . .
espilon = 0.0460; alpha = 0.0040
Simulating trial. . .
espilon = 0.0460; alpha = 0.0040
Simulating trial. . .
espilon = 0.0460; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'left', 'left')
Agent drove forward instead of left. (rewarded 1.16)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.74)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 2.47)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.09)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.92)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent drove right instead of left. (rewarded 1.20)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.23)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.40)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.75)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.16)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 1.70)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.05)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.01)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'forward')
Agent followed the waypoint right. (rewarded 1.13)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.64)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.00)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.07)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent drove left instead of right. (rewarded -0.11)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.84)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'left')
Agent followed the waypoint right. (rewarded 2.43)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.85)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.66)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 1.11)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 0.59)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 1.09)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 772
\-------------------------
Simulating trial. . .
espilon = 0.0458; alpha = 0.0040
Simulating trial. . .
espilon = 0.0458; alpha = 0.0040
Simulating trial. . .
espilon = 0.0458; alpha = 0.0040
Simulating trial. . .
espilon = 0.0458; alpha = 0.0040
Simulating trial. . .
espilon = 0.0458; alpha = 0.0040
Simulating trial. . .
espilon = 0.0458; alpha = 0.0040
Simulating trial. . .
espilon = 0.0458; alpha = 0.0040
Simulating trial. . .
espilon = 0.0458; alpha = 0.0040
Simulating trial. . .
espilon = 0.0458; alpha = 0.0040
Simulating trial. . .
espilon = 0.0458; alpha = 0.0040
Simulating trial. . .
espilon = 0.0458; alpha = 0.0040
Simulating trial. . .
espilon = 0.0458; alpha = 0.0040
Simulating trial. . .
espilon = 0.0458; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', 'left')
Agent followed the waypoint right. (rewarded 1.43)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 1.73)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.66)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.06)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.62)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.26)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.29)
65% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 773
\-------------------------
Simulating trial. . .
espilon = 0.0456; alpha = 0.0040
Simulating trial. . .
espilon = 0.0456; alpha = 0.0040
Simulating trial. . .
espilon = 0.0456; alpha = 0.0040
Simulating trial. . .
espilon = 0.0456; alpha = 0.0040
Simulating trial. . .
espilon = 0.0456; alpha = 0.0040
Simulating trial. . .
espilon = 0.0456; alpha = 0.0040
Simulating trial. . .
espilon = 0.0456; alpha = 0.0040
Simulating trial. . .
espilon = 0.0456; alpha = 0.0040
Simulating trial. . .
espilon = 0.0456; alpha = 0.0040
Simulating trial. . .
espilon = 0.0456; alpha = 0.0040
Simulating trial. . .
espilon = 0.0456; alpha = 0.0040
Simulating trial. . .
espilon = 0.0456; alpha = 0.0040
Simulating trial. . .
espilon = 0.0456; alpha = 0.0040
Simulating trial. . .
espilon = 0.0456; alpha = 0.0040
Simulating trial. . .
espilon = 0.0456; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.52)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.14)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.33)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.70)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.34)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.04)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.35)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.19)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 1.87)
64% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 774
\-------------------------
Simulating trial. . .
espilon = 0.0454; alpha = 0.0040
Simulating trial. . .
espilon = 0.0454; alpha = 0.0040
Simulating trial. . .
espilon = 0.0454; alpha = 0.0040
Simulating trial. . .
espilon = 0.0454; alpha = 0.0040
Simulating trial. . .
espilon = 0.0454; alpha = 0.0040
Simulating trial. . .
espilon = 0.0454; alpha = 0.0040
Simulating trial. . .
espilon = 0.0454; alpha = 0.0040
Simulating trial. . .
espilon = 0.0454; alpha = 0.0040
Simulating trial. . .
espilon = 0.0454; alpha = 0.0040
Simulating trial. . .
espilon = 0.0454; alpha = 0.0040
Simulating trial. . .
espilon = 0.0454; alpha = 0.0040
Simulating trial. . .
espilon = 0.0454; alpha = 0.0040
Simulating trial. . .
espilon = 0.0454; alpha = 0.0040
Simulating trial. . .
espilon = 0.0454; alpha = 0.0040
Simulating trial. . .
espilon = 0.0454; alpha = 0.0040
Simulating trial. . .
espilon = 0.0454; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.23)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.06)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.27)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.00)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.97)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.56)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.69)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.44)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.23)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.14)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.36)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.06)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.10)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.96)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 2.50)
25% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 775
\-------------------------
Simulating trial. . .
espilon = 0.0452; alpha = 0.0040
Simulating trial. . .
espilon = 0.0452; alpha = 0.0040
Simulating trial. . .
espilon = 0.0452; alpha = 0.0040
Simulating trial. . .
espilon = 0.0452; alpha = 0.0040
Simulating trial. . .
espilon = 0.0452; alpha = 0.0040
Simulating trial. . .
espilon = 0.0452; alpha = 0.0040
Simulating trial. . .
espilon = 0.0452; alpha = 0.0040
Simulating trial. . .
espilon = 0.0452; alpha = 0.0040
Simulating trial. . .
espilon = 0.0452; alpha = 0.0040
Simulating trial. . .
espilon = 0.0452; alpha = 0.0040
Simulating trial. . .
espilon = 0.0452; alpha = 0.0040
Simulating trial. . .
espilon = 0.0452; alpha = 0.0040
Simulating trial. . .
espilon = 0.0452; alpha = 0.0040
Simulating trial. . .
espilon = 0.0452; alpha = 0.0040
Simulating trial. . .
espilon = 0.0452; alpha = 0.0040
Simulating trial. . .
espilon = 0.0452; alpha = 0.0040
Simulating trial. . .
espilon = 0.0452; alpha = 0.0040
Simulating trial. . .
espilon = 0.0452; alpha = 0.0040
Simulating trial. . .
espilon = 0.0452; alpha = 0.0040
Simulating trial. . .
espilon = 0.0452; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.00)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.02)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.32)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.39)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.49)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.04)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.18)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 2.22)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'right', None)
Agent followed the waypoint forward. (rewarded 2.04)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.73)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'right')
Agent properly idled at a red light. (rewarded 2.59)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'right')
Agent drove forward instead of left. (rewarded 0.32)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 0.99)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.33)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.90)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.07)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 0.41)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.00)
10% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 776
\-------------------------
Simulating trial. . .
espilon = 0.0450; alpha = 0.0040
Simulating trial. . .
espilon = 0.0450; alpha = 0.0040
Simulating trial. . .
espilon = 0.0450; alpha = 0.0040
Simulating trial. . .
espilon = 0.0450; alpha = 0.0040
Simulating trial. . .
espilon = 0.0450; alpha = 0.0040
Simulating trial. . .
espilon = 0.0450; alpha = 0.0040
Simulating trial. . .
espilon = 0.0450; alpha = 0.0040
Simulating trial. . .
espilon = 0.0450; alpha = 0.0040
Simulating trial. . .
espilon = 0.0450; alpha = 0.0040
Simulating trial. . .
espilon = 0.0450; alpha = 0.0040
Simulating trial. . .
espilon = 0.0450; alpha = 0.0040
Simulating trial. . .
espilon = 0.0450; alpha = 0.0040
Simulating trial. . .
espilon = 0.0450; alpha = 0.0040
Simulating trial. . .
espilon = 0.0450; alpha = 0.0040
Simulating trial. . .
espilon = 0.0450; alpha = 0.0040
Simulating trial. . .
espilon = 0.0450; alpha = 0.0040
Simulating trial. . .
espilon = 0.0450; alpha = 0.0040
Simulating trial. . .
espilon = 0.0450; alpha = 0.0040
Simulating trial. . .
espilon = 0.0450; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 1.09)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.85)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.24)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.12)
80% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 777
\-------------------------
Simulating trial. . .
espilon = 0.0449; alpha = 0.0040
Simulating trial. . .
espilon = 0.0449; alpha = 0.0040
Simulating trial. . .
espilon = 0.0449; alpha = 0.0040
Simulating trial. . .
espilon = 0.0449; alpha = 0.0040
Simulating trial. . .
espilon = 0.0449; alpha = 0.0040
Simulating trial. . .
espilon = 0.0449; alpha = 0.0040
Simulating trial. . .
espilon = 0.0449; alpha = 0.0040
Simulating trial. . .
espilon = 0.0449; alpha = 0.0040
Simulating trial. . .
espilon = 0.0449; alpha = 0.0040
Simulating trial. . .
espilon = 0.0449; alpha = 0.0040
Simulating trial. . .
espilon = 0.0449; alpha = 0.0040
Simulating trial. . .
espilon = 0.0449; alpha = 0.0040
Simulating trial. . .
espilon = 0.0449; alpha = 0.0040
Simulating trial. . .
espilon = 0.0449; alpha = 0.0040
Simulating trial. . .
espilon = 0.0449; alpha = 0.0040
Simulating trial. . .
espilon = 0.0449; alpha = 0.0040
Simulating trial. . .
espilon = 0.0449; alpha = 0.0040
Simulating trial. . .
espilon = 0.0449; alpha = 0.0040
Simulating trial. . .
espilon = 0.0449; alpha = 0.0040
Simulating trial. . .
espilon = 0.0449; alpha = 0.0040
Simulating trial. . .
espilon = 0.0449; alpha = 0.0040
Simulating trial. . .
espilon = 0.0449; alpha = 0.0040
Simulating trial. . .
espilon = 0.0449; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.89)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 1.63)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.86)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.75)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'forward')
Agent properly idled at a red light. (rewarded 1.08)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.79)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.77)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.17)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent followed the waypoint forward. (rewarded 1.90)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.44)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', 'left')
Agent drove forward instead of right. (rewarded 1.73)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.59)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.21)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.64)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.69)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.33)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.20)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.72)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.52)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.56)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 1.48)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.00)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.63)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.77)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.08)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.94)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'forward')
Agent followed the waypoint right. (rewarded 2.08)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.15)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.53)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.91)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 778
\-------------------------
Simulating trial. . .
espilon = 0.0447; alpha = 0.0040
Simulating trial. . .
espilon = 0.0447; alpha = 0.0040
Simulating trial. . .
espilon = 0.0447; alpha = 0.0040
Simulating trial. . .
espilon = 0.0447; alpha = 0.0040
Simulating trial. . .
espilon = 0.0447; alpha = 0.0040
Simulating trial. . .
espilon = 0.0447; alpha = 0.0040
Simulating trial. . .
espilon = 0.0447; alpha = 0.0040
Simulating trial. . .
espilon = 0.0447; alpha = 0.0040
Simulating trial. . .
espilon = 0.0447; alpha = 0.0040
Simulating trial. . .
espilon = 0.0447; alpha = 0.0040
Simulating trial. . .
espilon = 0.0447; alpha = 0.0040
Simulating trial. . .
espilon = 0.0447; alpha = 0.0040
Simulating trial. . .
espilon = 0.0447; alpha = 0.0040
Simulating trial. . .
espilon = 0.0447; alpha = 0.0040
Simulating trial. . .
espilon = 0.0447; alpha = 0.0040
Simulating trial. . .
espilon = 0.0447; alpha = 0.0040
Simulating trial. . .
espilon = 0.0447; alpha = 0.0040
Simulating trial. . .
espilon = 0.0447; alpha = 0.0040
Simulating trial. . .
espilon = 0.0447; alpha = 0.0040
Simulating trial. . .
espilon = 0.0447; alpha = 0.0040
Simulating trial. . .
espilon = 0.0447; alpha = 0.0040
Simulating trial. . .
espilon = 0.0447; alpha = 0.0040
Simulating trial. . .
espilon = 0.0447; alpha = 0.0040
Simulating trial. . .
espilon = 0.0447; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.43)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.04)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.35)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.43)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.98)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.85)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'right')
Agent drove right instead of forward. (rewarded 0.03)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.78)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.34)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 2.41)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.91)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 1.28)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.23)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.83)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.30)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.75)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.53)
32% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 779
\-------------------------
Simulating trial. . .
espilon = 0.0445; alpha = 0.0040
Simulating trial. . .
espilon = 0.0445; alpha = 0.0040
Simulating trial. . .
espilon = 0.0445; alpha = 0.0040
Simulating trial. . .
espilon = 0.0445; alpha = 0.0040
Simulating trial. . .
espilon = 0.0445; alpha = 0.0040
Simulating trial. . .
espilon = 0.0445; alpha = 0.0040
Simulating trial. . .
espilon = 0.0445; alpha = 0.0040
Simulating trial. . .
espilon = 0.0445; alpha = 0.0040
Simulating trial. . .
espilon = 0.0445; alpha = 0.0040
Simulating trial. . .
espilon = 0.0445; alpha = 0.0040
Simulating trial. . .
espilon = 0.0445; alpha = 0.0040
Simulating trial. . .
espilon = 0.0445; alpha = 0.0040
Simulating trial. . .
espilon = 0.0445; alpha = 0.0040
Simulating trial. . .
espilon = 0.0445; alpha = 0.0040
Simulating trial. . .
espilon = 0.0445; alpha = 0.0040
Simulating trial. . .
espilon = 0.0445; alpha = 0.0040
Simulating trial. . .
espilon = 0.0445; alpha = 0.0040
Simulating trial. . .
espilon = 0.0445; alpha = 0.0040
Simulating trial. . .
espilon = 0.0445; alpha = 0.0040
Simulating trial. . .
espilon = 0.0445; alpha = 0.0040
Simulating trial. . .
espilon = 0.0445; alpha = 0.0040
Simulating trial. . .
espilon = 0.0445; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.54)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.69)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.51)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.65)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.39)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.35)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.76)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', 'left')
Agent properly idled at a red light. (rewarded 2.68)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 2.72)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.06)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.15)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.44)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.87)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 1.39)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.08)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.72)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.07)
43% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 780
\-------------------------
Simulating trial. . .
espilon = 0.0443; alpha = 0.0040
Simulating trial. . .
espilon = 0.0443; alpha = 0.0040
Simulating trial. . .
espilon = 0.0443; alpha = 0.0040
Simulating trial. . .
espilon = 0.0443; alpha = 0.0040
Simulating trial. . .
espilon = 0.0443; alpha = 0.0040
Simulating trial. . .
espilon = 0.0443; alpha = 0.0040
Simulating trial. . .
espilon = 0.0443; alpha = 0.0040
Simulating trial. . .
espilon = 0.0443; alpha = 0.0040
Simulating trial. . .
espilon = 0.0443; alpha = 0.0040
Simulating trial. . .
espilon = 0.0443; alpha = 0.0040
Simulating trial. . .
espilon = 0.0443; alpha = 0.0040
Simulating trial. . .
espilon = 0.0443; alpha = 0.0040
Simulating trial. . .
espilon = 0.0443; alpha = 0.0040
Simulating trial. . .
espilon = 0.0443; alpha = 0.0040
Simulating trial. . .
espilon = 0.0443; alpha = 0.0040
Simulating trial. . .
espilon = 0.0443; alpha = 0.0040
Simulating trial. . .
espilon = 0.0443; alpha = 0.0040
Simulating trial. . .
espilon = 0.0443; alpha = 0.0040
Simulating trial. . .
espilon = 0.0443; alpha = 0.0040
Simulating trial. . .
espilon = 0.0443; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.80)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.54)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.63)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.80)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 0.99)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.09)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.19)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.45)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'right', None)
Agent drove right instead of forward. (rewarded 0.60)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.87)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.82)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.05)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.00)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.85)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.28)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 1.67)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.15)
32% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 781
\-------------------------
Simulating trial. . .
espilon = 0.0442; alpha = 0.0040
Simulating trial. . .
espilon = 0.0442; alpha = 0.0040
Simulating trial. . .
espilon = 0.0442; alpha = 0.0040
Simulating trial. . .
espilon = 0.0442; alpha = 0.0040
Simulating trial. . .
espilon = 0.0442; alpha = 0.0040
Simulating trial. . .
espilon = 0.0442; alpha = 0.0040
Simulating trial. . .
espilon = 0.0442; alpha = 0.0040
Simulating trial. . .
espilon = 0.0442; alpha = 0.0040
Simulating trial. . .
espilon = 0.0442; alpha = 0.0040
Simulating trial. . .
espilon = 0.0442; alpha = 0.0040
Simulating trial. . .
espilon = 0.0442; alpha = 0.0040
Simulating trial. . .
espilon = 0.0442; alpha = 0.0040
Simulating trial. . .
espilon = 0.0442; alpha = 0.0040
Simulating trial. . .
espilon = 0.0442; alpha = 0.0040
Simulating trial. . .
espilon = 0.0442; alpha = 0.0040
Simulating trial. . .
espilon = 0.0442; alpha = 0.0040
Simulating trial. . .
espilon = 0.0442; alpha = 0.0040
Simulating trial. . .
espilon = 0.0442; alpha = 0.0040
Simulating trial. . .
espilon = 0.0442; alpha = 0.0040
Simulating trial. . .
espilon = 0.0442; alpha = 0.0040
Simulating trial. . .
espilon = 0.0442; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'left')
Agent followed the waypoint right. (rewarded 2.00)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.48)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', 'left')
Agent followed the waypoint forward. (rewarded 1.62)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.50)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.72)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.35)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.40)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.84)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.74)
70% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 782
\-------------------------
Simulating trial. . .
espilon = 0.0440; alpha = 0.0040
Simulating trial. . .
espilon = 0.0440; alpha = 0.0040
Simulating trial. . .
espilon = 0.0440; alpha = 0.0040
Simulating trial. . .
espilon = 0.0440; alpha = 0.0040
Simulating trial. . .
espilon = 0.0440; alpha = 0.0040
Simulating trial. . .
espilon = 0.0440; alpha = 0.0040
Simulating trial. . .
espilon = 0.0440; alpha = 0.0040
Simulating trial. . .
espilon = 0.0440; alpha = 0.0040
Simulating trial. . .
espilon = 0.0440; alpha = 0.0040
Simulating trial. . .
espilon = 0.0440; alpha = 0.0040
Simulating trial. . .
espilon = 0.0440; alpha = 0.0040
Simulating trial. . .
espilon = 0.0440; alpha = 0.0040
Simulating trial. . .
espilon = 0.0440; alpha = 0.0040
Simulating trial. . .
espilon = 0.0440; alpha = 0.0040
Simulating trial. . .
espilon = 0.0440; alpha = 0.0040
Simulating trial. . .
espilon = 0.0440; alpha = 0.0040
Simulating trial. . .
espilon = 0.0440; alpha = 0.0040
Simulating trial. . .
espilon = 0.0440; alpha = 0.0040
Simulating trial. . .
espilon = 0.0440; alpha = 0.0040
Simulating trial. . .
espilon = 0.0440; alpha = 0.0040
Simulating trial. . .
espilon = 0.0440; alpha = 0.0040
Simulating trial. . .
espilon = 0.0440; alpha = 0.0040
Simulating trial. . .
espilon = 0.0440; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', 'left')
Agent followed the waypoint right. (rewarded 1.39)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', 'forward')
Agent followed the waypoint right. (rewarded 1.07)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.24)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.31)
80% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 783
\-------------------------
Simulating trial. . .
espilon = 0.0438; alpha = 0.0040
Simulating trial. . .
espilon = 0.0438; alpha = 0.0040
Simulating trial. . .
espilon = 0.0438; alpha = 0.0040
Simulating trial. . .
espilon = 0.0438; alpha = 0.0040
Simulating trial. . .
espilon = 0.0438; alpha = 0.0040
Simulating trial. . .
espilon = 0.0438; alpha = 0.0040
Simulating trial. . .
espilon = 0.0438; alpha = 0.0040
Simulating trial. . .
espilon = 0.0438; alpha = 0.0040
Simulating trial. . .
espilon = 0.0438; alpha = 0.0040
Simulating trial. . .
espilon = 0.0438; alpha = 0.0040
Simulating trial. . .
espilon = 0.0438; alpha = 0.0040
Simulating trial. . .
espilon = 0.0438; alpha = 0.0040
Simulating trial. . .
espilon = 0.0438; alpha = 0.0040
Simulating trial. . .
espilon = 0.0438; alpha = 0.0040
Simulating trial. . .
espilon = 0.0438; alpha = 0.0040
Simulating trial. . .
espilon = 0.0438; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.93)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', 'right')
Agent drove right instead of left. (rewarded 1.87)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.20)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.64)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.75)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.64)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.54)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.02)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.30)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.27)
67% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 784
\-------------------------
Simulating trial. . .
espilon = 0.0436; alpha = 0.0040
Simulating trial. . .
espilon = 0.0436; alpha = 0.0040
Simulating trial. . .
espilon = 0.0436; alpha = 0.0040
Simulating trial. . .
espilon = 0.0436; alpha = 0.0040
Simulating trial. . .
espilon = 0.0436; alpha = 0.0040
Simulating trial. . .
espilon = 0.0436; alpha = 0.0040
Simulating trial. . .
espilon = 0.0436; alpha = 0.0040
Simulating trial. . .
espilon = 0.0436; alpha = 0.0040
Simulating trial. . .
espilon = 0.0436; alpha = 0.0040
Simulating trial. . .
espilon = 0.0436; alpha = 0.0040
Simulating trial. . .
espilon = 0.0436; alpha = 0.0040
Simulating trial. . .
espilon = 0.0436; alpha = 0.0040
Simulating trial. . .
espilon = 0.0436; alpha = 0.0040
Simulating trial. . .
espilon = 0.0436; alpha = 0.0040
Simulating trial. . .
espilon = 0.0436; alpha = 0.0040
Simulating trial. . .
espilon = 0.0436; alpha = 0.0040
Simulating trial. . .
espilon = 0.0436; alpha = 0.0040
Simulating trial. . .
espilon = 0.0436; alpha = 0.0040
Simulating trial. . .
espilon = 0.0436; alpha = 0.0040
Simulating trial. . .
espilon = 0.0436; alpha = 0.0040
Simulating trial. . .
espilon = 0.0436; alpha = 0.0040
Simulating trial. . .
espilon = 0.0436; alpha = 0.0040
Simulating trial. . .
espilon = 0.0436; alpha = 0.0040
Simulating trial. . .
espilon = 0.0436; alpha = 0.0040
Simulating trial. . .
espilon = 0.0436; alpha = 0.0040
Simulating trial. . .
espilon = 0.0436; alpha = 0.0040
Simulating trial. . .
espilon = 0.0436; alpha = 0.0040
Simulating trial. . .
espilon = 0.0436; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 1.29)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.72)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.56)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.40)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.65)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent followed the waypoint forward. (rewarded 1.56)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.30)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.72)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent followed the waypoint right. (rewarded 2.15)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.08)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 785
\-------------------------
Simulating trial. . .
espilon = 0.0435; alpha = 0.0040
Simulating trial. . .
espilon = 0.0435; alpha = 0.0040
Simulating trial. . .
espilon = 0.0435; alpha = 0.0040
Simulating trial. . .
espilon = 0.0435; alpha = 0.0040
Simulating trial. . .
espilon = 0.0435; alpha = 0.0040
Simulating trial. . .
espilon = 0.0435; alpha = 0.0040
Simulating trial. . .
espilon = 0.0435; alpha = 0.0040
Simulating trial. . .
espilon = 0.0435; alpha = 0.0040
Simulating trial. . .
espilon = 0.0435; alpha = 0.0040
Simulating trial. . .
espilon = 0.0435; alpha = 0.0040
Simulating trial. . .
espilon = 0.0435; alpha = 0.0040
Simulating trial. . .
espilon = 0.0435; alpha = 0.0040
Simulating trial. . .
espilon = 0.0435; alpha = 0.0040
Simulating trial. . .
espilon = 0.0435; alpha = 0.0040
Simulating trial. . .
espilon = 0.0435; alpha = 0.0040
Simulating trial. . .
espilon = 0.0435; alpha = 0.0040
Simulating trial. . .
espilon = 0.0435; alpha = 0.0040
Simulating trial. . .
espilon = 0.0435; alpha = 0.0040
Simulating trial. . .
espilon = 0.0435; alpha = 0.0040
Simulating trial. . .
espilon = 0.0435; alpha = 0.0040
Simulating trial. . .
espilon = 0.0435; alpha = 0.0040
Simulating trial. . .
espilon = 0.0435; alpha = 0.0040
Simulating trial. . .
espilon = 0.0435; alpha = 0.0040
Simulating trial. . .
espilon = 0.0435; alpha = 0.0040
Simulating trial. . .
espilon = 0.0435; alpha = 0.0040
Simulating trial. . .
espilon = 0.0435; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.02)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 1.94)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.81)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.18)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.43)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.40)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'right')
Agent properly idled at a red light. (rewarded 2.10)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.45)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.30)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.65)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.19)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.46)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.68)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.43)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'forward')
Agent drove forward instead of left. (rewarded 1.32)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.80)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.13)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', None)
Agent properly idled at a red light. (rewarded 0.98)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', 'left')
Agent drove right instead of left. (rewarded -0.54)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 0.82)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 786
\-------------------------
Simulating trial. . .
espilon = 0.0433; alpha = 0.0040
Simulating trial. . .
espilon = 0.0433; alpha = 0.0040
Simulating trial. . .
espilon = 0.0433; alpha = 0.0040
Simulating trial. . .
espilon = 0.0433; alpha = 0.0040
Simulating trial. . .
espilon = 0.0433; alpha = 0.0040
Simulating trial. . .
espilon = 0.0433; alpha = 0.0040
Simulating trial. . .
espilon = 0.0433; alpha = 0.0040
Simulating trial. . .
espilon = 0.0433; alpha = 0.0040
Simulating trial. . .
espilon = 0.0433; alpha = 0.0040
Simulating trial. . .
espilon = 0.0433; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', 'right')
Agent properly idled at a red light. (rewarded 1.98)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.18)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.40)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent followed the waypoint left. (rewarded 1.65)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.21)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.84)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.85)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.91)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.62)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.13)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.05)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.45)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'right')
Agent drove right instead of left. (rewarded 1.33)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.51)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'right')
Agent followed the waypoint right. (rewarded 1.32)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent followed the waypoint right. (rewarded 1.84)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 0.88)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 1.07)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', 'right')
Agent drove right instead of forward. (rewarded -0.03)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 2.11)
20% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 787
\-------------------------
Simulating trial. . .
espilon = 0.0431; alpha = 0.0040
Simulating trial. . .
espilon = 0.0431; alpha = 0.0040
Simulating trial. . .
espilon = 0.0431; alpha = 0.0040
Simulating trial. . .
espilon = 0.0431; alpha = 0.0040
Simulating trial. . .
espilon = 0.0431; alpha = 0.0040
Simulating trial. . .
espilon = 0.0431; alpha = 0.0040
Simulating trial. . .
espilon = 0.0431; alpha = 0.0040
Simulating trial. . .
espilon = 0.0431; alpha = 0.0040
Simulating trial. . .
espilon = 0.0431; alpha = 0.0040
Simulating trial. . .
espilon = 0.0431; alpha = 0.0040
Simulating trial. . .
espilon = 0.0431; alpha = 0.0040
Simulating trial. . .
espilon = 0.0431; alpha = 0.0040
Simulating trial. . .
espilon = 0.0431; alpha = 0.0040
Simulating trial. . .
espilon = 0.0431; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.27)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'right')
Agent drove right instead of forward. (rewarded 0.74)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.06)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.78)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 0.26)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.83)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'right', None)
Agent idled at a green light with no oncoming traffic. (rewarded -4.96)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 1.85)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.00)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.15)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.26)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 1.24)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 0.92)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 0.62)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.27)
25% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 788
\-------------------------
Simulating trial. . .
espilon = 0.0429; alpha = 0.0040
Simulating trial. . .
espilon = 0.0429; alpha = 0.0040
Simulating trial. . .
espilon = 0.0429; alpha = 0.0040
Simulating trial. . .
espilon = 0.0429; alpha = 0.0040
Simulating trial. . .
espilon = 0.0429; alpha = 0.0040
Simulating trial. . .
espilon = 0.0429; alpha = 0.0040
Simulating trial. . .
espilon = 0.0429; alpha = 0.0040
Simulating trial. . .
espilon = 0.0429; alpha = 0.0040
Simulating trial. . .
espilon = 0.0429; alpha = 0.0040
Simulating trial. . .
espilon = 0.0429; alpha = 0.0040
Simulating trial. . .
espilon = 0.0429; alpha = 0.0040
Simulating trial. . .
espilon = 0.0429; alpha = 0.0040
Simulating trial. . .
espilon = 0.0429; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', 'right')
Agent properly idled at a red light. (rewarded 1.65)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.35)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -9.35)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.92)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.29)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.96)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.07)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 2.43)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.02)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.53)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.13)
56% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 789
\-------------------------
Simulating trial. . .
espilon = 0.0428; alpha = 0.0040
Simulating trial. . .
espilon = 0.0428; alpha = 0.0040
Simulating trial. . .
espilon = 0.0428; alpha = 0.0040
Simulating trial. . .
espilon = 0.0428; alpha = 0.0040
Simulating trial. . .
espilon = 0.0428; alpha = 0.0040
Simulating trial. . .
espilon = 0.0428; alpha = 0.0040
Simulating trial. . .
espilon = 0.0428; alpha = 0.0040
Simulating trial. . .
espilon = 0.0428; alpha = 0.0040
Simulating trial. . .
espilon = 0.0428; alpha = 0.0040
Simulating trial. . .
espilon = 0.0428; alpha = 0.0040
Simulating trial. . .
espilon = 0.0428; alpha = 0.0040
Simulating trial. . .
espilon = 0.0428; alpha = 0.0040
Simulating trial. . .
espilon = 0.0428; alpha = 0.0040
Simulating trial. . .
espilon = 0.0428; alpha = 0.0040
Simulating trial. . .
espilon = 0.0428; alpha = 0.0040
Simulating trial. . .
espilon = 0.0428; alpha = 0.0040
Simulating trial. . .
espilon = 0.0428; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'left')
Agent drove left instead of right. (rewarded 1.43)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.00)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.97)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.38)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.24)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.80)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.06)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.72)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', 'forward')
Agent followed the waypoint right. (rewarded 2.79)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.39)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.61)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.94)
52% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 790
\-------------------------
Simulating trial. . .
espilon = 0.0426; alpha = 0.0040
Simulating trial. . .
espilon = 0.0426; alpha = 0.0040
Simulating trial. . .
espilon = 0.0426; alpha = 0.0040
Simulating trial. . .
espilon = 0.0426; alpha = 0.0040
Simulating trial. . .
espilon = 0.0426; alpha = 0.0040
Simulating trial. . .
espilon = 0.0426; alpha = 0.0040
Simulating trial. . .
espilon = 0.0426; alpha = 0.0040
Simulating trial. . .
espilon = 0.0426; alpha = 0.0040
Simulating trial. . .
espilon = 0.0426; alpha = 0.0040
Simulating trial. . .
espilon = 0.0426; alpha = 0.0040
Simulating trial. . .
espilon = 0.0426; alpha = 0.0040
Simulating trial. . .
espilon = 0.0426; alpha = 0.0040
Simulating trial. . .
espilon = 0.0426; alpha = 0.0040
Simulating trial. . .
espilon = 0.0426; alpha = 0.0040
Simulating trial. . .
espilon = 0.0426; alpha = 0.0040
Simulating trial. . .
espilon = 0.0426; alpha = 0.0040
Simulating trial. . .
espilon = 0.0426; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.77)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.07)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 0.69)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent followed the waypoint left. (rewarded 1.78)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 2.84)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.02)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.83)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.45)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.77)
64% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 791
\-------------------------
Simulating trial. . .
espilon = 0.0424; alpha = 0.0040
Simulating trial. . .
espilon = 0.0424; alpha = 0.0040
Simulating trial. . .
espilon = 0.0424; alpha = 0.0040
Simulating trial. . .
espilon = 0.0424; alpha = 0.0040
Simulating trial. . .
espilon = 0.0424; alpha = 0.0040
Simulating trial. . .
espilon = 0.0424; alpha = 0.0040
Simulating trial. . .
espilon = 0.0424; alpha = 0.0040
Simulating trial. . .
espilon = 0.0424; alpha = 0.0040
Simulating trial. . .
espilon = 0.0424; alpha = 0.0040
Simulating trial. . .
espilon = 0.0424; alpha = 0.0040
Simulating trial. . .
espilon = 0.0424; alpha = 0.0040
Simulating trial. . .
espilon = 0.0424; alpha = 0.0040
Simulating trial. . .
espilon = 0.0424; alpha = 0.0040
Simulating trial. . .
espilon = 0.0424; alpha = 0.0040
Simulating trial. . .
espilon = 0.0424; alpha = 0.0040
Simulating trial. . .
espilon = 0.0424; alpha = 0.0040
Simulating trial. . .
espilon = 0.0424; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.28)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.50)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'right')
Agent drove right instead of left. (rewarded 1.15)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.53)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.04)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.45)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.13)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.28)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.54)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.58)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'left')
Agent drove left instead of right. (rewarded 0.06)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.40)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'right')
Agent properly idled at a red light. (rewarded 2.06)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.53)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.36)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent attempted driving left through a red light. (rewarded -10.01)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'right', None)
Agent followed the waypoint forward. (rewarded 2.00)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.10)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 0.68)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.00)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.81)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.60)
12% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 792
\-------------------------
Simulating trial. . .
espilon = 0.0423; alpha = 0.0040
Simulating trial. . .
espilon = 0.0423; alpha = 0.0040
Simulating trial. . .
espilon = 0.0423; alpha = 0.0040
Simulating trial. . .
espilon = 0.0423; alpha = 0.0040
Simulating trial. . .
espilon = 0.0423; alpha = 0.0040
Simulating trial. . .
espilon = 0.0423; alpha = 0.0040
Simulating trial. . .
espilon = 0.0423; alpha = 0.0040
Simulating trial. . .
espilon = 0.0423; alpha = 0.0040
Simulating trial. . .
espilon = 0.0423; alpha = 0.0040
Simulating trial. . .
espilon = 0.0423; alpha = 0.0040
Simulating trial. . .
espilon = 0.0423; alpha = 0.0040
Simulating trial. . .
espilon = 0.0423; alpha = 0.0040
Simulating trial. . .
espilon = 0.0423; alpha = 0.0040
Simulating trial. . .
espilon = 0.0423; alpha = 0.0040
Simulating trial. . .
espilon = 0.0423; alpha = 0.0040
Simulating trial. . .
espilon = 0.0423; alpha = 0.0040
Simulating trial. . .
espilon = 0.0423; alpha = 0.0040
Simulating trial. . .
espilon = 0.0423; alpha = 0.0040
Simulating trial. . .
espilon = 0.0423; alpha = 0.0040
Simulating trial. . .
espilon = 0.0423; alpha = 0.0040
Simulating trial. . .
espilon = 0.0423; alpha = 0.0040
Simulating trial. . .
espilon = 0.0423; alpha = 0.0040
Simulating trial. . .
espilon = 0.0423; alpha = 0.0040
Simulating trial. . .
espilon = 0.0423; alpha = 0.0040
Simulating trial. . .
espilon = 0.0423; alpha = 0.0040
Simulating trial. . .
espilon = 0.0423; alpha = 0.0040
Simulating trial. . .
espilon = 0.0423; alpha = 0.0040
Simulating trial. . .
espilon = 0.0423; alpha = 0.0040
Simulating trial. . .
espilon = 0.0423; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.90)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.18)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.87)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove forward instead of right. (rewarded 1.49)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent followed the waypoint right. (rewarded 1.24)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.86)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent followed the waypoint forward. (rewarded 1.31)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.36)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.29)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.93)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'right')
Agent properly idled at a red light. (rewarded 1.03)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.23)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.26)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.38)
44% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 793
\-------------------------
Simulating trial. . .
espilon = 0.0421; alpha = 0.0040
Simulating trial. . .
espilon = 0.0421; alpha = 0.0040
Simulating trial. . .
espilon = 0.0421; alpha = 0.0040
Simulating trial. . .
espilon = 0.0421; alpha = 0.0040
Simulating trial. . .
espilon = 0.0421; alpha = 0.0040
Simulating trial. . .
espilon = 0.0421; alpha = 0.0040
Simulating trial. . .
espilon = 0.0421; alpha = 0.0040
Simulating trial. . .
espilon = 0.0421; alpha = 0.0040
Simulating trial. . .
espilon = 0.0421; alpha = 0.0040
Simulating trial. . .
espilon = 0.0421; alpha = 0.0040
Simulating trial. . .
espilon = 0.0421; alpha = 0.0040
Simulating trial. . .
espilon = 0.0421; alpha = 0.0040
Simulating trial. . .
espilon = 0.0421; alpha = 0.0040
Simulating trial. . .
espilon = 0.0421; alpha = 0.0040
Simulating trial. . .
espilon = 0.0421; alpha = 0.0040
Simulating trial. . .
espilon = 0.0421; alpha = 0.0040
Simulating trial. . .
espilon = 0.0421; alpha = 0.0040
Simulating trial. . .
espilon = 0.0421; alpha = 0.0040
Simulating trial. . .
espilon = 0.0421; alpha = 0.0040
Simulating trial. . .
espilon = 0.0421; alpha = 0.0040
Simulating trial. . .
espilon = 0.0421; alpha = 0.0040
Simulating trial. . .
espilon = 0.0421; alpha = 0.0040
Simulating trial. . .
espilon = 0.0421; alpha = 0.0040
Simulating trial. . .
espilon = 0.0421; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.29)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.55)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.54)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'forward')
Agent drove forward instead of right. (rewarded 0.15)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.18)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 1.60)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'left')
Agent followed the waypoint right. (rewarded 1.35)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.30)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.12)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.62)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.56)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 2.56)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.20)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'right')
Agent drove forward instead of left. (rewarded 0.85)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.41)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.63)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'right', None)
Agent drove left instead of right. (rewarded 1.20)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', 'left')
Agent followed the waypoint right. (rewarded 1.43)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.40)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.09)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.74)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.33)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', 'forward')
Agent drove forward instead of right. (rewarded -0.35)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'right', 'right')
Agent drove forward instead of right. (rewarded -0.07)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.86)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 794
\-------------------------
Simulating trial. . .
espilon = 0.0419; alpha = 0.0040
Simulating trial. . .
espilon = 0.0419; alpha = 0.0040
Simulating trial. . .
espilon = 0.0419; alpha = 0.0040
Simulating trial. . .
espilon = 0.0419; alpha = 0.0040
Simulating trial. . .
espilon = 0.0419; alpha = 0.0040
Simulating trial. . .
espilon = 0.0419; alpha = 0.0040
Simulating trial. . .
espilon = 0.0419; alpha = 0.0040
Simulating trial. . .
espilon = 0.0419; alpha = 0.0040
Simulating trial. . .
espilon = 0.0419; alpha = 0.0040
Simulating trial. . .
espilon = 0.0419; alpha = 0.0040
Simulating trial. . .
espilon = 0.0419; alpha = 0.0040
Simulating trial. . .
espilon = 0.0419; alpha = 0.0040
Simulating trial. . .
espilon = 0.0419; alpha = 0.0040
Simulating trial. . .
espilon = 0.0419; alpha = 0.0040
Simulating trial. . .
espilon = 0.0419; alpha = 0.0040
Simulating trial. . .
espilon = 0.0419; alpha = 0.0040
Simulating trial. . .
espilon = 0.0419; alpha = 0.0040
Simulating trial. . .
espilon = 0.0419; alpha = 0.0040
Simulating trial. . .
espilon = 0.0419; alpha = 0.0040
Simulating trial. . .
espilon = 0.0419; alpha = 0.0040
Simulating trial. . .
espilon = 0.0419; alpha = 0.0040
Simulating trial. . .
espilon = 0.0419; alpha = 0.0040
Simulating trial. . .
espilon = 0.0419; alpha = 0.0040
Simulating trial. . .
espilon = 0.0419; alpha = 0.0040
Simulating trial. . .
espilon = 0.0419; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.50)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.72)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.57)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.77)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent followed the waypoint left. (rewarded 1.51)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 0.95)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.26)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.50)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent followed the waypoint forward. (rewarded 1.68)
55% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 795
\-------------------------
Simulating trial. . .
espilon = 0.0418; alpha = 0.0040
Simulating trial. . .
espilon = 0.0418; alpha = 0.0040
Simulating trial. . .
espilon = 0.0418; alpha = 0.0040
Simulating trial. . .
espilon = 0.0418; alpha = 0.0040
Simulating trial. . .
espilon = 0.0418; alpha = 0.0040
Simulating trial. . .
espilon = 0.0418; alpha = 0.0040
Simulating trial. . .
espilon = 0.0418; alpha = 0.0040
Simulating trial. . .
espilon = 0.0418; alpha = 0.0040
Simulating trial. . .
espilon = 0.0418; alpha = 0.0040
Simulating trial. . .
espilon = 0.0418; alpha = 0.0040
Simulating trial. . .
espilon = 0.0418; alpha = 0.0040
Simulating trial. . .
espilon = 0.0418; alpha = 0.0040
Simulating trial. . .
espilon = 0.0418; alpha = 0.0040
Simulating trial. . .
espilon = 0.0418; alpha = 0.0040
Simulating trial. . .
espilon = 0.0418; alpha = 0.0040
Simulating trial. . .
espilon = 0.0418; alpha = 0.0040
Simulating trial. . .
espilon = 0.0418; alpha = 0.0040
Simulating trial. . .
espilon = 0.0418; alpha = 0.0040
Simulating trial. . .
espilon = 0.0418; alpha = 0.0040
Simulating trial. . .
espilon = 0.0418; alpha = 0.0040
Simulating trial. . .
espilon = 0.0418; alpha = 0.0040
Simulating trial. . .
espilon = 0.0418; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.85)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'forward')
Agent followed the waypoint forward. (rewarded 2.64)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded 1.09)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.25)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.47)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.50)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.76)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.66)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 796
\-------------------------
Simulating trial. . .
espilon = 0.0416; alpha = 0.0040
Simulating trial. . .
espilon = 0.0416; alpha = 0.0040
Simulating trial. . .
espilon = 0.0416; alpha = 0.0040
Simulating trial. . .
espilon = 0.0416; alpha = 0.0040
Simulating trial. . .
espilon = 0.0416; alpha = 0.0040
Simulating trial. . .
espilon = 0.0416; alpha = 0.0040
Simulating trial. . .
espilon = 0.0416; alpha = 0.0040
Simulating trial. . .
espilon = 0.0416; alpha = 0.0040
Simulating trial. . .
espilon = 0.0416; alpha = 0.0040
Simulating trial. . .
espilon = 0.0416; alpha = 0.0040
Simulating trial. . .
espilon = 0.0416; alpha = 0.0040
Simulating trial. . .
espilon = 0.0416; alpha = 0.0040
Simulating trial. . .
espilon = 0.0416; alpha = 0.0040
Simulating trial. . .
espilon = 0.0416; alpha = 0.0040
Simulating trial. . .
espilon = 0.0416; alpha = 0.0040
Simulating trial. . .
espilon = 0.0416; alpha = 0.0040
Simulating trial. . .
espilon = 0.0416; alpha = 0.0040
Simulating trial. . .
espilon = 0.0416; alpha = 0.0040
Simulating trial. . .
espilon = 0.0416; alpha = 0.0040
Simulating trial. . .
espilon = 0.0416; alpha = 0.0040
Simulating trial. . .
espilon = 0.0416; alpha = 0.0040
Simulating trial. . .
espilon = 0.0416; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.97)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.09)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.83)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.41)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.66)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.39)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 1.48)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.37)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.51)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.17)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'left', None)
Agent drove forward instead of left. (rewarded 0.50)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent followed the waypoint left. (rewarded 1.13)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 2.40)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.73)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.45)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.05)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.13)
15% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 797
\-------------------------
Simulating trial. . .
espilon = 0.0414; alpha = 0.0040
Simulating trial. . .
espilon = 0.0414; alpha = 0.0040
Simulating trial. . .
espilon = 0.0414; alpha = 0.0040
Simulating trial. . .
espilon = 0.0414; alpha = 0.0040
Simulating trial. . .
espilon = 0.0414; alpha = 0.0040
Simulating trial. . .
espilon = 0.0414; alpha = 0.0040
Simulating trial. . .
espilon = 0.0414; alpha = 0.0040
Simulating trial. . .
espilon = 0.0414; alpha = 0.0040
Simulating trial. . .
espilon = 0.0414; alpha = 0.0040
Simulating trial. . .
espilon = 0.0414; alpha = 0.0040
Simulating trial. . .
espilon = 0.0414; alpha = 0.0040
Simulating trial. . .
espilon = 0.0414; alpha = 0.0040
Simulating trial. . .
espilon = 0.0414; alpha = 0.0040
Simulating trial. . .
espilon = 0.0414; alpha = 0.0040
Simulating trial. . .
espilon = 0.0414; alpha = 0.0040
Simulating trial. . .
espilon = 0.0414; alpha = 0.0040
Simulating trial. . .
espilon = 0.0414; alpha = 0.0040
Simulating trial. . .
espilon = 0.0414; alpha = 0.0040
Simulating trial. . .
espilon = 0.0414; alpha = 0.0040
Simulating trial. . .
espilon = 0.0414; alpha = 0.0040
Simulating trial. . .
espilon = 0.0414; alpha = 0.0040
Simulating trial. . .
espilon = 0.0414; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'forward', None)
Agent followed the waypoint left. (rewarded 2.31)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 0.31)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 1.12)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.54)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 1.44)
80% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 798
\-------------------------
Simulating trial. . .
espilon = 0.0413; alpha = 0.0040
Simulating trial. . .
espilon = 0.0413; alpha = 0.0040
Simulating trial. . .
espilon = 0.0413; alpha = 0.0040
Simulating trial. . .
espilon = 0.0413; alpha = 0.0040
Simulating trial. . .
espilon = 0.0413; alpha = 0.0040
Simulating trial. . .
espilon = 0.0413; alpha = 0.0040
Simulating trial. . .
espilon = 0.0413; alpha = 0.0040
Simulating trial. . .
espilon = 0.0413; alpha = 0.0040
Simulating trial. . .
espilon = 0.0413; alpha = 0.0040
Simulating trial. . .
espilon = 0.0413; alpha = 0.0040
Simulating trial. . .
espilon = 0.0413; alpha = 0.0040
Simulating trial. . .
espilon = 0.0413; alpha = 0.0040
Simulating trial. . .
espilon = 0.0413; alpha = 0.0040
Simulating trial. . .
espilon = 0.0413; alpha = 0.0040
Simulating trial. . .
espilon = 0.0413; alpha = 0.0040
Simulating trial. . .
espilon = 0.0413; alpha = 0.0040
Simulating trial. . .
espilon = 0.0413; alpha = 0.0040
Simulating trial. . .
espilon = 0.0413; alpha = 0.0040
Simulating trial. . .
espilon = 0.0413; alpha = 0.0040
Simulating trial. . .
espilon = 0.0413; alpha = 0.0040
Simulating trial. . .
espilon = 0.0413; alpha = 0.0040
Simulating trial. . .
espilon = 0.0413; alpha = 0.0040
Simulating trial. . .
espilon = 0.0413; alpha = 0.0040
Simulating trial. . .
espilon = 0.0413; alpha = 0.0040
Simulating trial. . .
espilon = 0.0413; alpha = 0.0040
Simulating trial. . .
espilon = 0.0413; alpha = 0.0040
Simulating trial. . .
espilon = 0.0413; alpha = 0.0040
Simulating trial. . .
espilon = 0.0413; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.13)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.03)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 1.14)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 2.77)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 2.73)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 1.03)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 2.36)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'forward')
Agent drove right instead of left. (rewarded 1.89)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent followed the waypoint right. (rewarded 2.32)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', 'left')
Agent followed the waypoint forward. (rewarded 2.44)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 2.15)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', 'left')
Agent properly idled at a red light. (rewarded 1.64)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.35)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.21)
44% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 799
\-------------------------
Simulating trial. . .
espilon = 0.0411; alpha = 0.0040
Simulating trial. . .
espilon = 0.0411; alpha = 0.0040
Simulating trial. . .
espilon = 0.0411; alpha = 0.0040
Simulating trial. . .
espilon = 0.0411; alpha = 0.0040
Simulating trial. . .
espilon = 0.0411; alpha = 0.0040
Simulating trial. . .
espilon = 0.0411; alpha = 0.0040
Simulating trial. . .
espilon = 0.0411; alpha = 0.0040
Simulating trial. . .
espilon = 0.0411; alpha = 0.0040
Simulating trial. . .
espilon = 0.0411; alpha = 0.0040
Simulating trial. . .
espilon = 0.0411; alpha = 0.0040
Simulating trial. . .
espilon = 0.0411; alpha = 0.0040
Simulating trial. . .
espilon = 0.0411; alpha = 0.0040
Simulating trial. . .
espilon = 0.0411; alpha = 0.0040
Simulating trial. . .
espilon = 0.0411; alpha = 0.0040
Simulating trial. . .
espilon = 0.0411; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent followed the waypoint forward. (rewarded 2.22)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.83)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.62)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.02)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.64)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.78)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent drove right instead of forward. (rewarded 1.28)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 2.32)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'left', None)
Agent followed the waypoint forward. (rewarded 1.84)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.77)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 1.38)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.59)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 800
\-------------------------
Simulating trial. . .
espilon = 0.0409; alpha = 0.0040
Simulating trial. . .
espilon = 0.0409; alpha = 0.0040
Simulating trial. . .
espilon = 0.0409; alpha = 0.0040
Simulating trial. . .
espilon = 0.0409; alpha = 0.0040
Simulating trial. . .
espilon = 0.0409; alpha = 0.0040
Simulating trial. . .
espilon = 0.0409; alpha = 0.0040
Simulating trial. . .
espilon = 0.0409; alpha = 0.0040
Simulating trial. . .
espilon = 0.0409; alpha = 0.0040
Simulating trial. . .
espilon = 0.0409; alpha = 0.0040
Simulating trial. . .
espilon = 0.0409; alpha = 0.0040
Simulating trial. . .
espilon = 0.0409; alpha = 0.0040
Simulating trial. . .
espilon = 0.0409; alpha = 0.0040
Simulating trial. . .
espilon = 0.0409; alpha = 0.0040
Simulating trial. . .
espilon = 0.0409; alpha = 0.0040
Simulating trial. . .
espilon = 0.0409; alpha = 0.0040
Simulating trial. . .
espilon = 0.0409; alpha = 0.0040
Simulating trial. . .
espilon = 0.0409; alpha = 0.0040
Simulating trial. . .
espilon = 0.0409; alpha = 0.0040
Simulating trial. . .
espilon = 0.0409; alpha = 0.0040
Simulating trial. . .
espilon = 0.0409; alpha = 0.0040
Simulating trial. . .
espilon = 0.0409; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.69)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.67)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.80)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.09)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.36)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.49)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 1.52)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.84)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.19)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.82)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.81)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.56)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.75)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.84)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'right')
Agent properly idled at a red light. (rewarded 0.81)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.17)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent followed the waypoint left. (rewarded 1.24)
43% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 801
\-------------------------
Simulating trial. . .
espilon = 0.0408; alpha = 0.0040
Simulating trial. . .
espilon = 0.0408; alpha = 0.0040
Simulating trial. . .
espilon = 0.0408; alpha = 0.0040
Simulating trial. . .
espilon = 0.0408; alpha = 0.0040
Simulating trial. . .
espilon = 0.0408; alpha = 0.0040
Simulating trial. . .
espilon = 0.0408; alpha = 0.0040
Simulating trial. . .
espilon = 0.0408; alpha = 0.0040
Simulating trial. . .
espilon = 0.0408; alpha = 0.0040
Simulating trial. . .
espilon = 0.0408; alpha = 0.0040
Simulating trial. . .
espilon = 0.0408; alpha = 0.0040
Simulating trial. . .
espilon = 0.0408; alpha = 0.0040
Simulating trial. . .
espilon = 0.0408; alpha = 0.0040
Simulating trial. . .
espilon = 0.0408; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 0.48)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.98)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.33)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'forward')
Agent drove right instead of left. (rewarded 1.06)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.50)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.15)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.81)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 2.33)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'right', None)
Agent followed the waypoint forward. (rewarded 2.21)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.47)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.28)
56% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 802
\-------------------------
Simulating trial. . .
espilon = 0.0406; alpha = 0.0040
Simulating trial. . .
espilon = 0.0406; alpha = 0.0040
Simulating trial. . .
espilon = 0.0406; alpha = 0.0040
Simulating trial. . .
espilon = 0.0406; alpha = 0.0040
Simulating trial. . .
espilon = 0.0406; alpha = 0.0040
Simulating trial. . .
espilon = 0.0406; alpha = 0.0040
Simulating trial. . .
espilon = 0.0406; alpha = 0.0040
Simulating trial. . .
espilon = 0.0406; alpha = 0.0040
Simulating trial. . .
espilon = 0.0406; alpha = 0.0040
Simulating trial. . .
espilon = 0.0406; alpha = 0.0040
Simulating trial. . .
espilon = 0.0406; alpha = 0.0040
Simulating trial. . .
espilon = 0.0406; alpha = 0.0040
Simulating trial. . .
espilon = 0.0406; alpha = 0.0040
Simulating trial. . .
espilon = 0.0406; alpha = 0.0040
Simulating trial. . .
espilon = 0.0406; alpha = 0.0040
Simulating trial. . .
espilon = 0.0406; alpha = 0.0040
Simulating trial. . .
espilon = 0.0406; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.92)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.30)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent followed the waypoint right. (rewarded 2.09)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 2.06)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.39)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.14)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.20)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.50)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.37)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.95)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'right')
Agent properly idled at a red light. (rewarded 1.55)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 1.53)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.00)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'left')
Agent followed the waypoint left. (rewarded 2.49)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.67)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent followed the waypoint right. (rewarded 2.73)
47% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 803
\-------------------------
Simulating trial. . .
espilon = 0.0404; alpha = 0.0040
Simulating trial. . .
espilon = 0.0404; alpha = 0.0040
Simulating trial. . .
espilon = 0.0404; alpha = 0.0040
Simulating trial. . .
espilon = 0.0404; alpha = 0.0040
Simulating trial. . .
espilon = 0.0404; alpha = 0.0040
Simulating trial. . .
espilon = 0.0404; alpha = 0.0040
Simulating trial. . .
espilon = 0.0404; alpha = 0.0040
Simulating trial. . .
espilon = 0.0404; alpha = 0.0040
Simulating trial. . .
espilon = 0.0404; alpha = 0.0040
Simulating trial. . .
espilon = 0.0404; alpha = 0.0040
Simulating trial. . .
espilon = 0.0404; alpha = 0.0040
Simulating trial. . .
espilon = 0.0404; alpha = 0.0040
Simulating trial. . .
espilon = 0.0404; alpha = 0.0040
Simulating trial. . .
espilon = 0.0404; alpha = 0.0040
Simulating trial. . .
espilon = 0.0404; alpha = 0.0040
Simulating trial. . .
espilon = 0.0404; alpha = 0.0040
Simulating trial. . .
espilon = 0.0404; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.29)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 1.06)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.85)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.90)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.94)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.46)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.37)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'right')
Agent properly idled at a red light. (rewarded 1.13)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.69)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.14)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.71)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.51)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.95)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.24)
30% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 804
\-------------------------
Simulating trial. . .
espilon = 0.0403; alpha = 0.0040
Simulating trial. . .
espilon = 0.0403; alpha = 0.0040
Simulating trial. . .
espilon = 0.0403; alpha = 0.0040
Simulating trial. . .
espilon = 0.0403; alpha = 0.0040
Simulating trial. . .
espilon = 0.0403; alpha = 0.0040
Simulating trial. . .
espilon = 0.0403; alpha = 0.0040
Simulating trial. . .
espilon = 0.0403; alpha = 0.0040
Simulating trial. . .
espilon = 0.0403; alpha = 0.0040
Simulating trial. . .
espilon = 0.0403; alpha = 0.0040
Simulating trial. . .
espilon = 0.0403; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.24)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 1.27)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.20)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.63)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.49)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.19)
76% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 805
\-------------------------
Simulating trial. . .
espilon = 0.0401; alpha = 0.0040
Simulating trial. . .
espilon = 0.0401; alpha = 0.0040
Simulating trial. . .
espilon = 0.0401; alpha = 0.0040
Simulating trial. . .
espilon = 0.0401; alpha = 0.0040
Simulating trial. . .
espilon = 0.0401; alpha = 0.0040
Simulating trial. . .
espilon = 0.0401; alpha = 0.0040
Simulating trial. . .
espilon = 0.0401; alpha = 0.0040
Simulating trial. . .
espilon = 0.0401; alpha = 0.0040
Simulating trial. . .
espilon = 0.0401; alpha = 0.0040
Simulating trial. . .
espilon = 0.0401; alpha = 0.0040
Simulating trial. . .
espilon = 0.0401; alpha = 0.0040
Simulating trial. . .
espilon = 0.0401; alpha = 0.0040
Simulating trial. . .
espilon = 0.0401; alpha = 0.0040
Simulating trial. . .
espilon = 0.0401; alpha = 0.0040
Simulating trial. . .
espilon = 0.0401; alpha = 0.0040
Simulating trial. . .
espilon = 0.0401; alpha = 0.0040
Simulating trial. . .
espilon = 0.0401; alpha = 0.0040
Simulating trial. . .
espilon = 0.0401; alpha = 0.0040
Simulating trial. . .
espilon = 0.0401; alpha = 0.0040
Simulating trial. . .
espilon = 0.0401; alpha = 0.0040
Simulating trial. . .
espilon = 0.0401; alpha = 0.0040
Simulating trial. . .
espilon = 0.0401; alpha = 0.0040
Simulating trial. . .
espilon = 0.0401; alpha = 0.0040
Simulating trial. . .
espilon = 0.0401; alpha = 0.0040
Simulating trial. . .
espilon = 0.0401; alpha = 0.0040
Simulating trial. . .
espilon = 0.0401; alpha = 0.0040
Simulating trial. . .
espilon = 0.0401; alpha = 0.0040
Simulating trial. . .
espilon = 0.0401; alpha = 0.0040
Simulating trial. . .
espilon = 0.0401; alpha = 0.0040
Simulating trial. . .
espilon = 0.0401; alpha = 0.0040
Simulating trial. . .
espilon = 0.0401; alpha = 0.0040
Simulating trial. . .
espilon = 0.0401; alpha = 0.0040
Simulating trial. . .
espilon = 0.0401; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 1.69)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 2.04)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 0.32)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.04)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.12)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.49)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.41)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', 'forward')
Agent followed the waypoint right. (rewarded 2.32)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.12)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.65)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 0.96)
45% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 806
\-------------------------
Simulating trial. . .
espilon = 0.0400; alpha = 0.0040
Simulating trial. . .
espilon = 0.0400; alpha = 0.0040
Simulating trial. . .
espilon = 0.0400; alpha = 0.0040
Simulating trial. . .
espilon = 0.0400; alpha = 0.0040
Simulating trial. . .
espilon = 0.0400; alpha = 0.0040
Simulating trial. . .
espilon = 0.0400; alpha = 0.0040
Simulating trial. . .
espilon = 0.0400; alpha = 0.0040
Simulating trial. . .
espilon = 0.0400; alpha = 0.0040
Simulating trial. . .
espilon = 0.0400; alpha = 0.0040
Simulating trial. . .
espilon = 0.0400; alpha = 0.0040
Simulating trial. . .
espilon = 0.0400; alpha = 0.0040
Simulating trial. . .
espilon = 0.0400; alpha = 0.0040
Simulating trial. . .
espilon = 0.0400; alpha = 0.0040
Simulating trial. . .
espilon = 0.0400; alpha = 0.0040
Simulating trial. . .
espilon = 0.0400; alpha = 0.0040
Simulating trial. . .
espilon = 0.0400; alpha = 0.0040
Simulating trial. . .
espilon = 0.0400; alpha = 0.0040
Simulating trial. . .
espilon = 0.0400; alpha = 0.0040
Simulating trial. . .
espilon = 0.0400; alpha = 0.0040
Simulating trial. . .
espilon = 0.0400; alpha = 0.0040
Simulating trial. . .
espilon = 0.0400; alpha = 0.0040
Simulating trial. . .
espilon = 0.0400; alpha = 0.0040
Simulating trial. . .
espilon = 0.0400; alpha = 0.0040
Simulating trial. . .
espilon = 0.0400; alpha = 0.0040
Simulating trial. . .
espilon = 0.0400; alpha = 0.0040
Simulating trial. . .
espilon = 0.0400; alpha = 0.0040
Simulating trial. . .
espilon = 0.0400; alpha = 0.0040
Simulating trial. . .
espilon = 0.0400; alpha = 0.0040
Simulating trial. . .
espilon = 0.0400; alpha = 0.0040
Simulating trial. . .
espilon = 0.0400; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent followed the waypoint left. (rewarded 2.62)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.00)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.73)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.83)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', 'forward')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.95)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.81)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', None)
Agent followed the waypoint right. (rewarded 1.90)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 2.20)
73% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 807
\-------------------------
Simulating trial. . .
espilon = 0.0398; alpha = 0.0040
Simulating trial. . .
espilon = 0.0398; alpha = 0.0040
Simulating trial. . .
espilon = 0.0398; alpha = 0.0040
Simulating trial. . .
espilon = 0.0398; alpha = 0.0040
Simulating trial. . .
espilon = 0.0398; alpha = 0.0040
Simulating trial. . .
espilon = 0.0398; alpha = 0.0040
Simulating trial. . .
espilon = 0.0398; alpha = 0.0040
Simulating trial. . .
espilon = 0.0398; alpha = 0.0040
Simulating trial. . .
espilon = 0.0398; alpha = 0.0040
Simulating trial. . .
espilon = 0.0398; alpha = 0.0040
Simulating trial. . .
espilon = 0.0398; alpha = 0.0040
Simulating trial. . .
espilon = 0.0398; alpha = 0.0040
Simulating trial. . .
espilon = 0.0398; alpha = 0.0040
Simulating trial. . .
espilon = 0.0398; alpha = 0.0040
Simulating trial. . .
espilon = 0.0398; alpha = 0.0040
Simulating trial. . .
espilon = 0.0398; alpha = 0.0040
Simulating trial. . .
espilon = 0.0398; alpha = 0.0040
Simulating trial. . .
espilon = 0.0398; alpha = 0.0040
Simulating trial. . .
espilon = 0.0398; alpha = 0.0040
Simulating trial. . .
espilon = 0.0398; alpha = 0.0040
Simulating trial. . .
espilon = 0.0398; alpha = 0.0040
Simulating trial. . .
espilon = 0.0398; alpha = 0.0040
Simulating trial. . .
espilon = 0.0398; alpha = 0.0040
Simulating trial. . .
espilon = 0.0398; alpha = 0.0040
Simulating trial. . .
espilon = 0.0398; alpha = 0.0040
Simulating trial. . .
espilon = 0.0398; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'left')
Agent followed the waypoint right. (rewarded 2.61)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.45)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.55)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.47)
80% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 808
\-------------------------
Simulating trial. . .
espilon = 0.0396; alpha = 0.0040
Simulating trial. . .
espilon = 0.0396; alpha = 0.0040
Simulating trial. . .
espilon = 0.0396; alpha = 0.0040
Simulating trial. . .
espilon = 0.0396; alpha = 0.0040
Simulating trial. . .
espilon = 0.0396; alpha = 0.0040
Simulating trial. . .
espilon = 0.0396; alpha = 0.0040
Simulating trial. . .
espilon = 0.0396; alpha = 0.0040
Simulating trial. . .
espilon = 0.0396; alpha = 0.0040
Simulating trial. . .
espilon = 0.0396; alpha = 0.0040
Simulating trial. . .
espilon = 0.0396; alpha = 0.0040
Simulating trial. . .
espilon = 0.0396; alpha = 0.0040
Simulating trial. . .
espilon = 0.0396; alpha = 0.0040
Simulating trial. . .
espilon = 0.0396; alpha = 0.0040
Simulating trial. . .
espilon = 0.0396; alpha = 0.0040
Simulating trial. . .
espilon = 0.0396; alpha = 0.0040
Simulating trial. . .
espilon = 0.0396; alpha = 0.0040
Simulating trial. . .
espilon = 0.0396; alpha = 0.0040
Simulating trial. . .
espilon = 0.0396; alpha = 0.0040
Simulating trial. . .
espilon = 0.0396; alpha = 0.0040
Simulating trial. . .
espilon = 0.0396; alpha = 0.0040
Simulating trial. . .
espilon = 0.0396; alpha = 0.0040
Simulating trial. . .
espilon = 0.0396; alpha = 0.0040
Simulating trial. . .
espilon = 0.0396; alpha = 0.0040
Simulating trial. . .
espilon = 0.0396; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.17)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 2.59)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.62)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent drove left instead of right. (rewarded 1.73)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 2.09)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.56)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.06)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.07)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 1.27)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.35)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 809
\-------------------------
Simulating trial. . .
espilon = 0.0395; alpha = 0.0040
Simulating trial. . .
espilon = 0.0395; alpha = 0.0040
Simulating trial. . .
espilon = 0.0395; alpha = 0.0040
Simulating trial. . .
espilon = 0.0395; alpha = 0.0040
Simulating trial. . .
espilon = 0.0395; alpha = 0.0040
Simulating trial. . .
espilon = 0.0395; alpha = 0.0040
Simulating trial. . .
espilon = 0.0395; alpha = 0.0040
Simulating trial. . .
espilon = 0.0395; alpha = 0.0040
Simulating trial. . .
espilon = 0.0395; alpha = 0.0040
Simulating trial. . .
espilon = 0.0395; alpha = 0.0040
Simulating trial. . .
espilon = 0.0395; alpha = 0.0040
Simulating trial. . .
espilon = 0.0395; alpha = 0.0040
Simulating trial. . .
espilon = 0.0395; alpha = 0.0040
Simulating trial. . .
espilon = 0.0395; alpha = 0.0040
Simulating trial. . .
espilon = 0.0395; alpha = 0.0040
Simulating trial. . .
espilon = 0.0395; alpha = 0.0040
Simulating trial. . .
espilon = 0.0395; alpha = 0.0040
Simulating trial. . .
espilon = 0.0395; alpha = 0.0040
Simulating trial. . .
espilon = 0.0395; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'right', None)
Agent drove forward instead of right. (rewarded 1.89)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.72)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.26)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.14)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'right', 'left')
Agent followed the waypoint right. (rewarded 2.86)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.46)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.74)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.93)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.41)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 2.16)
50% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 810
\-------------------------
Simulating trial. . .
espilon = 0.0393; alpha = 0.0040
Simulating trial. . .
espilon = 0.0393; alpha = 0.0040
Simulating trial. . .
espilon = 0.0393; alpha = 0.0040
Simulating trial. . .
espilon = 0.0393; alpha = 0.0040
Simulating trial. . .
espilon = 0.0393; alpha = 0.0040
Simulating trial. . .
espilon = 0.0393; alpha = 0.0040
Simulating trial. . .
espilon = 0.0393; alpha = 0.0040
Simulating trial. . .
espilon = 0.0393; alpha = 0.0040
Simulating trial. . .
espilon = 0.0393; alpha = 0.0040
Simulating trial. . .
espilon = 0.0393; alpha = 0.0040
Simulating trial. . .
espilon = 0.0393; alpha = 0.0040
Simulating trial. . .
espilon = 0.0393; alpha = 0.0040
Simulating trial. . .
espilon = 0.0393; alpha = 0.0040
Simulating trial. . .
espilon = 0.0393; alpha = 0.0040
Simulating trial. . .
espilon = 0.0393; alpha = 0.0040
Simulating trial. . .
espilon = 0.0393; alpha = 0.0040
Simulating trial. . .
espilon = 0.0393; alpha = 0.0040
Simulating trial. . .
espilon = 0.0393; alpha = 0.0040
Simulating trial. . .
espilon = 0.0393; alpha = 0.0040
Simulating trial. . .
espilon = 0.0393; alpha = 0.0040
Simulating trial. . .
espilon = 0.0393; alpha = 0.0040
Simulating trial. . .
espilon = 0.0393; alpha = 0.0040
Simulating trial. . .
espilon = 0.0393; alpha = 0.0040
Simulating trial. . .
espilon = 0.0393; alpha = 0.0040
Simulating trial. . .
espilon = 0.0393; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.23)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.48)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.72)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'forward')
Agent drove right instead of forward. (rewarded 1.57)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.61)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 0.99)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.90)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.59)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.67)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.59)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.39)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 0.96)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 811
\-------------------------
Simulating trial. . .
espilon = 0.0392; alpha = 0.0040
Simulating trial. . .
espilon = 0.0392; alpha = 0.0040
Simulating trial. . .
espilon = 0.0392; alpha = 0.0040
Simulating trial. . .
espilon = 0.0392; alpha = 0.0040
Simulating trial. . .
espilon = 0.0392; alpha = 0.0040
Simulating trial. . .
espilon = 0.0392; alpha = 0.0040
Simulating trial. . .
espilon = 0.0392; alpha = 0.0040
Simulating trial. . .
espilon = 0.0392; alpha = 0.0040
Simulating trial. . .
espilon = 0.0392; alpha = 0.0040
Simulating trial. . .
espilon = 0.0392; alpha = 0.0040
Simulating trial. . .
espilon = 0.0392; alpha = 0.0040
Simulating trial. . .
espilon = 0.0392; alpha = 0.0040
Simulating trial. . .
espilon = 0.0392; alpha = 0.0040
Simulating trial. . .
espilon = 0.0392; alpha = 0.0040
Simulating trial. . .
espilon = 0.0392; alpha = 0.0040
Simulating trial. . .
espilon = 0.0392; alpha = 0.0040
Simulating trial. . .
espilon = 0.0392; alpha = 0.0040
Simulating trial. . .
espilon = 0.0392; alpha = 0.0040
Simulating trial. . .
espilon = 0.0392; alpha = 0.0040
Simulating trial. . .
espilon = 0.0392; alpha = 0.0040
Simulating trial. . .
espilon = 0.0392; alpha = 0.0040
Simulating trial. . .
espilon = 0.0392; alpha = 0.0040
Simulating trial. . .
espilon = 0.0392; alpha = 0.0040
Simulating trial. . .
espilon = 0.0392; alpha = 0.0040
Simulating trial. . .
espilon = 0.0392; alpha = 0.0040
Simulating trial. . .
espilon = 0.0392; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.62)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.73)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.65)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.67)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.04)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.38)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, 'forward')
Agent properly idled at a red light. (rewarded 1.14)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'forward')
Agent followed the waypoint right. (rewarded 1.46)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 1.54)
55% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 812
\-------------------------
Simulating trial. . .
espilon = 0.0390; alpha = 0.0040
Simulating trial. . .
espilon = 0.0390; alpha = 0.0040
Simulating trial. . .
espilon = 0.0390; alpha = 0.0040
Simulating trial. . .
espilon = 0.0390; alpha = 0.0040
Simulating trial. . .
espilon = 0.0390; alpha = 0.0040
Simulating trial. . .
espilon = 0.0390; alpha = 0.0040
Simulating trial. . .
espilon = 0.0390; alpha = 0.0040
Simulating trial. . .
espilon = 0.0390; alpha = 0.0040
Simulating trial. . .
espilon = 0.0390; alpha = 0.0040
Simulating trial. . .
espilon = 0.0390; alpha = 0.0040
Simulating trial. . .
espilon = 0.0390; alpha = 0.0040
Simulating trial. . .
espilon = 0.0390; alpha = 0.0040
Simulating trial. . .
espilon = 0.0390; alpha = 0.0040
Simulating trial. . .
espilon = 0.0390; alpha = 0.0040
Simulating trial. . .
espilon = 0.0390; alpha = 0.0040
Simulating trial. . .
espilon = 0.0390; alpha = 0.0040
Simulating trial. . .
espilon = 0.0390; alpha = 0.0040
Simulating trial. . .
espilon = 0.0390; alpha = 0.0040
Simulating trial. . .
espilon = 0.0390; alpha = 0.0040
Simulating trial. . .
espilon = 0.0390; alpha = 0.0040
Simulating trial. . .
espilon = 0.0390; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', 'left')
Agent drove right instead of left. (rewarded 1.63)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.63)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.74)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.02)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.07)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.62)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.22)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.54)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.64)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.54)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 2.42)
63% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 813
\-------------------------
Simulating trial. . .
espilon = 0.0389; alpha = 0.0040
Simulating trial. . .
espilon = 0.0389; alpha = 0.0040
Simulating trial. . .
espilon = 0.0389; alpha = 0.0040
Simulating trial. . .
espilon = 0.0389; alpha = 0.0040
Simulating trial. . .
espilon = 0.0389; alpha = 0.0040
Simulating trial. . .
espilon = 0.0389; alpha = 0.0040
Simulating trial. . .
espilon = 0.0389; alpha = 0.0040
Simulating trial. . .
espilon = 0.0389; alpha = 0.0040
Simulating trial. . .
espilon = 0.0389; alpha = 0.0040
Simulating trial. . .
espilon = 0.0389; alpha = 0.0040
Simulating trial. . .
espilon = 0.0389; alpha = 0.0040
Simulating trial. . .
espilon = 0.0389; alpha = 0.0040
Simulating trial. . .
espilon = 0.0389; alpha = 0.0040
Simulating trial. . .
espilon = 0.0389; alpha = 0.0040
Simulating trial. . .
espilon = 0.0389; alpha = 0.0040
Simulating trial. . .
espilon = 0.0389; alpha = 0.0040
Simulating trial. . .
espilon = 0.0389; alpha = 0.0040
Simulating trial. . .
espilon = 0.0389; alpha = 0.0040
Simulating trial. . .
espilon = 0.0389; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', 'right')
Agent followed the waypoint right. (rewarded 2.50)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'right')
Agent drove right instead of forward. (rewarded 0.31)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', 'left')
Agent drove right instead of left. (rewarded 1.86)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.90)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent followed the waypoint right. (rewarded 2.81)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.91)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'right')
Agent followed the waypoint forward. (rewarded 1.50)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.40)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent attempted driving forward through a red light. (rewarded -9.16)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.58)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.01)
45% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 814
\-------------------------
Simulating trial. . .
espilon = 0.0387; alpha = 0.0040
Simulating trial. . .
espilon = 0.0387; alpha = 0.0040
Simulating trial. . .
espilon = 0.0387; alpha = 0.0040
Simulating trial. . .
espilon = 0.0387; alpha = 0.0040
Simulating trial. . .
espilon = 0.0387; alpha = 0.0040
Simulating trial. . .
espilon = 0.0387; alpha = 0.0040
Simulating trial. . .
espilon = 0.0387; alpha = 0.0040
Simulating trial. . .
espilon = 0.0387; alpha = 0.0040
Simulating trial. . .
espilon = 0.0387; alpha = 0.0040
Simulating trial. . .
espilon = 0.0387; alpha = 0.0040
Simulating trial. . .
espilon = 0.0387; alpha = 0.0040
Simulating trial. . .
espilon = 0.0387; alpha = 0.0040
Simulating trial. . .
espilon = 0.0387; alpha = 0.0040
Simulating trial. . .
espilon = 0.0387; alpha = 0.0040
Simulating trial. . .
espilon = 0.0387; alpha = 0.0040
Simulating trial. . .
espilon = 0.0387; alpha = 0.0040
Simulating trial. . .
espilon = 0.0387; alpha = 0.0040
Simulating trial. . .
espilon = 0.0387; alpha = 0.0040
Simulating trial. . .
espilon = 0.0387; alpha = 0.0040
Simulating trial. . .
espilon = 0.0387; alpha = 0.0040
Simulating trial. . .
espilon = 0.0387; alpha = 0.0040
Simulating trial. . .
espilon = 0.0387; alpha = 0.0040
Simulating trial. . .
espilon = 0.0387; alpha = 0.0040
Simulating trial. . .
espilon = 0.0387; alpha = 0.0040
Simulating trial. . .
espilon = 0.0387; alpha = 0.0040
Simulating trial. . .
espilon = 0.0387; alpha = 0.0040
Simulating trial. . .
espilon = 0.0387; alpha = 0.0040
Simulating trial. . .
espilon = 0.0387; alpha = 0.0040
Simulating trial. . .
espilon = 0.0387; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'right')
Agent drove right instead of left. (rewarded 0.51)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.06)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.71)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.57)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.52)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 1.92)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.49)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.65)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.75)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 0.16)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.85)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'right')
Agent drove right instead of left. (rewarded 0.19)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.21)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.46)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.66)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 0.82)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.86)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.15)
28% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 815
\-------------------------
Simulating trial. . .
espilon = 0.0385; alpha = 0.0040
Simulating trial. . .
espilon = 0.0385; alpha = 0.0040
Simulating trial. . .
espilon = 0.0385; alpha = 0.0040
Simulating trial. . .
espilon = 0.0385; alpha = 0.0040
Simulating trial. . .
espilon = 0.0385; alpha = 0.0040
Simulating trial. . .
espilon = 0.0385; alpha = 0.0040
Simulating trial. . .
espilon = 0.0385; alpha = 0.0040
Simulating trial. . .
espilon = 0.0385; alpha = 0.0040
Simulating trial. . .
espilon = 0.0385; alpha = 0.0040
Simulating trial. . .
espilon = 0.0385; alpha = 0.0040
Simulating trial. . .
espilon = 0.0385; alpha = 0.0040
Simulating trial. . .
espilon = 0.0385; alpha = 0.0040
Simulating trial. . .
espilon = 0.0385; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', 'forward')
Agent drove forward instead of left. (rewarded 0.34)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.72)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'forward')
Agent drove right instead of forward. (rewarded 1.68)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 2.68)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.81)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.02)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.54)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 0.19)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.21)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 2.63)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.48)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.00)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.47)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.29)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', 'left')
Agent properly idled at a red light. (rewarded 2.00)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', 'left')
Agent followed the waypoint right. (rewarded 2.03)
36% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 816
\-------------------------
Simulating trial. . .
espilon = 0.0384; alpha = 0.0040
Simulating trial. . .
espilon = 0.0384; alpha = 0.0040
Simulating trial. . .
espilon = 0.0384; alpha = 0.0040
Simulating trial. . .
espilon = 0.0384; alpha = 0.0040
Simulating trial. . .
espilon = 0.0384; alpha = 0.0040
Simulating trial. . .
espilon = 0.0384; alpha = 0.0040
Simulating trial. . .
espilon = 0.0384; alpha = 0.0040
Simulating trial. . .
espilon = 0.0384; alpha = 0.0040
Simulating trial. . .
espilon = 0.0384; alpha = 0.0040
Simulating trial. . .
espilon = 0.0384; alpha = 0.0040
Simulating trial. . .
espilon = 0.0384; alpha = 0.0040
Simulating trial. . .
espilon = 0.0384; alpha = 0.0040
Simulating trial. . .
espilon = 0.0384; alpha = 0.0040
Simulating trial. . .
espilon = 0.0384; alpha = 0.0040
Simulating trial. . .
espilon = 0.0384; alpha = 0.0040
Simulating trial. . .
espilon = 0.0384; alpha = 0.0040
Simulating trial. . .
espilon = 0.0384; alpha = 0.0040
Simulating trial. . .
espilon = 0.0384; alpha = 0.0040
Simulating trial. . .
espilon = 0.0384; alpha = 0.0040
Simulating trial. . .
espilon = 0.0384; alpha = 0.0040
Simulating trial. . .
espilon = 0.0384; alpha = 0.0040
Simulating trial. . .
espilon = 0.0384; alpha = 0.0040
Simulating trial. . .
espilon = 0.0384; alpha = 0.0040
Simulating trial. . .
espilon = 0.0384; alpha = 0.0040
Simulating trial. . .
espilon = 0.0384; alpha = 0.0040
Simulating trial. . .
espilon = 0.0384; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'right', 'forward')
Agent followed the waypoint forward. (rewarded 2.90)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.86)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.20)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 2.46)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.99)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 2.56)
70% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 817
\-------------------------
Simulating trial. . .
espilon = 0.0382; alpha = 0.0040
Simulating trial. . .
espilon = 0.0382; alpha = 0.0040
Simulating trial. . .
espilon = 0.0382; alpha = 0.0040
Simulating trial. . .
espilon = 0.0382; alpha = 0.0040
Simulating trial. . .
espilon = 0.0382; alpha = 0.0040
Simulating trial. . .
espilon = 0.0382; alpha = 0.0040
Simulating trial. . .
espilon = 0.0382; alpha = 0.0040
Simulating trial. . .
espilon = 0.0382; alpha = 0.0040
Simulating trial. . .
espilon = 0.0382; alpha = 0.0040
Simulating trial. . .
espilon = 0.0382; alpha = 0.0040
Simulating trial. . .
espilon = 0.0382; alpha = 0.0040
Simulating trial. . .
espilon = 0.0382; alpha = 0.0040
Simulating trial. . .
espilon = 0.0382; alpha = 0.0040
Simulating trial. . .
espilon = 0.0382; alpha = 0.0040
Simulating trial. . .
espilon = 0.0382; alpha = 0.0040
Simulating trial. . .
espilon = 0.0382; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.88)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'right')
Agent drove right instead of left. (rewarded 1.70)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.08)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.22)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.85)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.04)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.41)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.70)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.31)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.88)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 0.03)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.55)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 0.95)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.87)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.32)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.13)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.02)
32% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 818
\-------------------------
Simulating trial. . .
espilon = 0.0381; alpha = 0.0040
Simulating trial. . .
espilon = 0.0381; alpha = 0.0040
Simulating trial. . .
espilon = 0.0381; alpha = 0.0040
Simulating trial. . .
espilon = 0.0381; alpha = 0.0040
Simulating trial. . .
espilon = 0.0381; alpha = 0.0040
Simulating trial. . .
espilon = 0.0381; alpha = 0.0040
Simulating trial. . .
espilon = 0.0381; alpha = 0.0040
Simulating trial. . .
espilon = 0.0381; alpha = 0.0040
Simulating trial. . .
espilon = 0.0381; alpha = 0.0040
Simulating trial. . .
espilon = 0.0381; alpha = 0.0040
Simulating trial. . .
espilon = 0.0381; alpha = 0.0040
Simulating trial. . .
espilon = 0.0381; alpha = 0.0040
Simulating trial. . .
espilon = 0.0381; alpha = 0.0040
Simulating trial. . .
espilon = 0.0381; alpha = 0.0040
Simulating trial. . .
espilon = 0.0381; alpha = 0.0040
Simulating trial. . .
espilon = 0.0381; alpha = 0.0040
Simulating trial. . .
espilon = 0.0381; alpha = 0.0040
Simulating trial. . .
espilon = 0.0381; alpha = 0.0040
Simulating trial. . .
espilon = 0.0381; alpha = 0.0040
Simulating trial. . .
espilon = 0.0381; alpha = 0.0040
Simulating trial. . .
espilon = 0.0381; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', 'right')
Agent properly idled at a red light. (rewarded 1.62)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'right')
Agent properly idled at a red light. (rewarded 1.67)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'right')
Agent drove right instead of left. (rewarded 1.28)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.01)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'forward')
Agent drove right instead of left. (rewarded 1.66)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', 'left')
Agent drove forward instead of right. (rewarded 0.25)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.48)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 2.58)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.82)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.69)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.09)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.34)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.46)
35% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 819
\-------------------------
Simulating trial. . .
espilon = 0.0379; alpha = 0.0040
Simulating trial. . .
espilon = 0.0379; alpha = 0.0040
Simulating trial. . .
espilon = 0.0379; alpha = 0.0040
Simulating trial. . .
espilon = 0.0379; alpha = 0.0040
Simulating trial. . .
espilon = 0.0379; alpha = 0.0040
Simulating trial. . .
espilon = 0.0379; alpha = 0.0040
Simulating trial. . .
espilon = 0.0379; alpha = 0.0040
Simulating trial. . .
espilon = 0.0379; alpha = 0.0040
Simulating trial. . .
espilon = 0.0379; alpha = 0.0040
Simulating trial. . .
espilon = 0.0379; alpha = 0.0040
Simulating trial. . .
espilon = 0.0379; alpha = 0.0040
Simulating trial. . .
espilon = 0.0379; alpha = 0.0040
Simulating trial. . .
espilon = 0.0379; alpha = 0.0040
Simulating trial. . .
espilon = 0.0379; alpha = 0.0040
Simulating trial. . .
espilon = 0.0379; alpha = 0.0040
Simulating trial. . .
espilon = 0.0379; alpha = 0.0040
Simulating trial. . .
espilon = 0.0379; alpha = 0.0040
Simulating trial. . .
espilon = 0.0379; alpha = 0.0040
Simulating trial. . .
espilon = 0.0379; alpha = 0.0040
Simulating trial. . .
espilon = 0.0379; alpha = 0.0040
Simulating trial. . .
espilon = 0.0379; alpha = 0.0040
Simulating trial. . .
espilon = 0.0379; alpha = 0.0040
Simulating trial. . .
espilon = 0.0379; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.47)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.91)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.62)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.53)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.96)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.25)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.33)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.16)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.43)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.63)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.64)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.57)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 820
\-------------------------
Simulating trial. . .
espilon = 0.0378; alpha = 0.0040
Simulating trial. . .
espilon = 0.0378; alpha = 0.0040
Simulating trial. . .
espilon = 0.0378; alpha = 0.0040
Simulating trial. . .
espilon = 0.0378; alpha = 0.0040
Simulating trial. . .
espilon = 0.0378; alpha = 0.0040
Simulating trial. . .
espilon = 0.0378; alpha = 0.0040
Simulating trial. . .
espilon = 0.0378; alpha = 0.0040
Simulating trial. . .
espilon = 0.0378; alpha = 0.0040
Simulating trial. . .
espilon = 0.0378; alpha = 0.0040
Simulating trial. . .
espilon = 0.0378; alpha = 0.0040
Simulating trial. . .
espilon = 0.0378; alpha = 0.0040
Simulating trial. . .
espilon = 0.0378; alpha = 0.0040
Simulating trial. . .
espilon = 0.0378; alpha = 0.0040
Simulating trial. . .
espilon = 0.0378; alpha = 0.0040
Simulating trial. . .
espilon = 0.0378; alpha = 0.0040
Simulating trial. . .
espilon = 0.0378; alpha = 0.0040
Simulating trial. . .
espilon = 0.0378; alpha = 0.0040
Simulating trial. . .
espilon = 0.0378; alpha = 0.0040
Simulating trial. . .
espilon = 0.0378; alpha = 0.0040
Simulating trial. . .
espilon = 0.0378; alpha = 0.0040
Simulating trial. . .
espilon = 0.0378; alpha = 0.0040
Simulating trial. . .
espilon = 0.0378; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'right')
Agent followed the waypoint forward. (rewarded 2.09)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.49)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.06)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 2.90)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.07)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', 'right')
Agent drove right instead of forward. (rewarded 1.23)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.68)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.25)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.54)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.99)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 0.75)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent followed the waypoint right. (rewarded 2.70)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.82)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.87)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent followed the waypoint right. (rewarded 1.28)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.14)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.54)
32% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 821
\-------------------------
Simulating trial. . .
espilon = 0.0376; alpha = 0.0040
Simulating trial. . .
espilon = 0.0376; alpha = 0.0040
Simulating trial. . .
espilon = 0.0376; alpha = 0.0040
Simulating trial. . .
espilon = 0.0376; alpha = 0.0040
Simulating trial. . .
espilon = 0.0376; alpha = 0.0040
Simulating trial. . .
espilon = 0.0376; alpha = 0.0040
Simulating trial. . .
espilon = 0.0376; alpha = 0.0040
Simulating trial. . .
espilon = 0.0376; alpha = 0.0040
Simulating trial. . .
espilon = 0.0376; alpha = 0.0040
Simulating trial. . .
espilon = 0.0376; alpha = 0.0040
Simulating trial. . .
espilon = 0.0376; alpha = 0.0040
Simulating trial. . .
espilon = 0.0376; alpha = 0.0040
Simulating trial. . .
espilon = 0.0376; alpha = 0.0040
Simulating trial. . .
espilon = 0.0376; alpha = 0.0040
Simulating trial. . .
espilon = 0.0376; alpha = 0.0040
Simulating trial. . .
espilon = 0.0376; alpha = 0.0040
Simulating trial. . .
espilon = 0.0376; alpha = 0.0040
Simulating trial. . .
espilon = 0.0376; alpha = 0.0040
Simulating trial. . .
espilon = 0.0376; alpha = 0.0040
Simulating trial. . .
espilon = 0.0376; alpha = 0.0040
Simulating trial. . .
espilon = 0.0376; alpha = 0.0040
Simulating trial. . .
espilon = 0.0376; alpha = 0.0040
Simulating trial. . .
espilon = 0.0376; alpha = 0.0040
Simulating trial. . .
espilon = 0.0376; alpha = 0.0040
Simulating trial. . .
espilon = 0.0376; alpha = 0.0040
Simulating trial. . .
espilon = 0.0376; alpha = 0.0040
Simulating trial. . .
espilon = 0.0376; alpha = 0.0040
Simulating trial. . .
espilon = 0.0376; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'right')
Agent followed the waypoint forward. (rewarded 2.32)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'right')
Agent followed the waypoint forward. (rewarded 1.95)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 1.37)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.88)
80% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 822
\-------------------------
Simulating trial. . .
espilon = 0.0375; alpha = 0.0040
Simulating trial. . .
espilon = 0.0375; alpha = 0.0040
Simulating trial. . .
espilon = 0.0375; alpha = 0.0040
Simulating trial. . .
espilon = 0.0375; alpha = 0.0040
Simulating trial. . .
espilon = 0.0375; alpha = 0.0040
Simulating trial. . .
espilon = 0.0375; alpha = 0.0040
Simulating trial. . .
espilon = 0.0375; alpha = 0.0040
Simulating trial. . .
espilon = 0.0375; alpha = 0.0040
Simulating trial. . .
espilon = 0.0375; alpha = 0.0040
Simulating trial. . .
espilon = 0.0375; alpha = 0.0040
Simulating trial. . .
espilon = 0.0375; alpha = 0.0040
Simulating trial. . .
espilon = 0.0375; alpha = 0.0040
Simulating trial. . .
espilon = 0.0375; alpha = 0.0040
Simulating trial. . .
espilon = 0.0375; alpha = 0.0040
Simulating trial. . .
espilon = 0.0375; alpha = 0.0040
Simulating trial. . .
espilon = 0.0375; alpha = 0.0040
Simulating trial. . .
espilon = 0.0375; alpha = 0.0040
Simulating trial. . .
espilon = 0.0375; alpha = 0.0040
Simulating trial. . .
espilon = 0.0375; alpha = 0.0040
Simulating trial. . .
espilon = 0.0375; alpha = 0.0040
Simulating trial. . .
espilon = 0.0375; alpha = 0.0040
Simulating trial. . .
espilon = 0.0375; alpha = 0.0040
Simulating trial. . .
espilon = 0.0375; alpha = 0.0040
Simulating trial. . .
espilon = 0.0375; alpha = 0.0040
Simulating trial. . .
espilon = 0.0375; alpha = 0.0040
Simulating trial. . .
espilon = 0.0375; alpha = 0.0040
Simulating trial. . .
espilon = 0.0375; alpha = 0.0040
Simulating trial. . .
espilon = 0.0375; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.35)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 2.75)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 2.96)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.10)
80% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 823
\-------------------------
Simulating trial. . .
espilon = 0.0373; alpha = 0.0040
Simulating trial. . .
espilon = 0.0373; alpha = 0.0040
Simulating trial. . .
espilon = 0.0373; alpha = 0.0040
Simulating trial. . .
espilon = 0.0373; alpha = 0.0040
Simulating trial. . .
espilon = 0.0373; alpha = 0.0040
Simulating trial. . .
espilon = 0.0373; alpha = 0.0040
Simulating trial. . .
espilon = 0.0373; alpha = 0.0040
Simulating trial. . .
espilon = 0.0373; alpha = 0.0040
Simulating trial. . .
espilon = 0.0373; alpha = 0.0040
Simulating trial. . .
espilon = 0.0373; alpha = 0.0040
Simulating trial. . .
espilon = 0.0373; alpha = 0.0040
Simulating trial. . .
espilon = 0.0373; alpha = 0.0040
Simulating trial. . .
espilon = 0.0373; alpha = 0.0040
Simulating trial. . .
espilon = 0.0373; alpha = 0.0040
Simulating trial. . .
espilon = 0.0373; alpha = 0.0040
Simulating trial. . .
espilon = 0.0373; alpha = 0.0040
Simulating trial. . .
espilon = 0.0373; alpha = 0.0040
Simulating trial. . .
espilon = 0.0373; alpha = 0.0040
Simulating trial. . .
espilon = 0.0373; alpha = 0.0040
Simulating trial. . .
espilon = 0.0373; alpha = 0.0040
Simulating trial. . .
espilon = 0.0373; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 1.38)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', None)
Agent followed the waypoint forward. (rewarded 2.84)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 1.59)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.50)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.53)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 2.49)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.04)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 1.46)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.54)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.37)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.78)
45% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 824
\-------------------------
Simulating trial. . .
espilon = 0.0372; alpha = 0.0040
Simulating trial. . .
espilon = 0.0372; alpha = 0.0040
Simulating trial. . .
espilon = 0.0372; alpha = 0.0040
Simulating trial. . .
espilon = 0.0372; alpha = 0.0040
Simulating trial. . .
espilon = 0.0372; alpha = 0.0040
Simulating trial. . .
espilon = 0.0372; alpha = 0.0040
Simulating trial. . .
espilon = 0.0372; alpha = 0.0040
Simulating trial. . .
espilon = 0.0372; alpha = 0.0040
Simulating trial. . .
espilon = 0.0372; alpha = 0.0040
Simulating trial. . .
espilon = 0.0372; alpha = 0.0040
Simulating trial. . .
espilon = 0.0372; alpha = 0.0040
Simulating trial. . .
espilon = 0.0372; alpha = 0.0040
Simulating trial. . .
espilon = 0.0372; alpha = 0.0040
Simulating trial. . .
espilon = 0.0372; alpha = 0.0040
Simulating trial. . .
espilon = 0.0372; alpha = 0.0040
Simulating trial. . .
espilon = 0.0372; alpha = 0.0040
Simulating trial. . .
espilon = 0.0372; alpha = 0.0040
Simulating trial. . .
espilon = 0.0372; alpha = 0.0040
Simulating trial. . .
espilon = 0.0372; alpha = 0.0040
Simulating trial. . .
espilon = 0.0372; alpha = 0.0040
Simulating trial. . .
espilon = 0.0372; alpha = 0.0040
Simulating trial. . .
espilon = 0.0372; alpha = 0.0040
Simulating trial. . .
espilon = 0.0372; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'right')
Agent properly idled at a red light. (rewarded 2.60)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.67)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.16)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.58)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.76)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.13)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.10)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.08)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.31)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.15)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.20)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 0.93)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.02)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.54)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.22)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.72)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.20)
32% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 825
\-------------------------
Simulating trial. . .
espilon = 0.0370; alpha = 0.0040
Simulating trial. . .
espilon = 0.0370; alpha = 0.0040
Simulating trial. . .
espilon = 0.0370; alpha = 0.0040
Simulating trial. . .
espilon = 0.0370; alpha = 0.0040
Simulating trial. . .
espilon = 0.0370; alpha = 0.0040
Simulating trial. . .
espilon = 0.0370; alpha = 0.0040
Simulating trial. . .
espilon = 0.0370; alpha = 0.0040
Simulating trial. . .
espilon = 0.0370; alpha = 0.0040
Simulating trial. . .
espilon = 0.0370; alpha = 0.0040
Simulating trial. . .
espilon = 0.0370; alpha = 0.0040
Simulating trial. . .
espilon = 0.0370; alpha = 0.0040
Simulating trial. . .
espilon = 0.0370; alpha = 0.0040
Simulating trial. . .
espilon = 0.0370; alpha = 0.0040
Simulating trial. . .
espilon = 0.0370; alpha = 0.0040
Simulating trial. . .
espilon = 0.0370; alpha = 0.0040
Simulating trial. . .
espilon = 0.0370; alpha = 0.0040
Simulating trial. . .
espilon = 0.0370; alpha = 0.0040
Simulating trial. . .
espilon = 0.0370; alpha = 0.0040
Simulating trial. . .
espilon = 0.0370; alpha = 0.0040
Simulating trial. . .
espilon = 0.0370; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.37)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.95)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.98)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.31)
80% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 826
\-------------------------
Simulating trial. . .
espilon = 0.0369; alpha = 0.0040
Simulating trial. . .
espilon = 0.0369; alpha = 0.0040
Simulating trial. . .
espilon = 0.0369; alpha = 0.0040
Simulating trial. . .
espilon = 0.0369; alpha = 0.0040
Simulating trial. . .
espilon = 0.0369; alpha = 0.0040
Simulating trial. . .
espilon = 0.0369; alpha = 0.0040
Simulating trial. . .
espilon = 0.0369; alpha = 0.0040
Simulating trial. . .
espilon = 0.0369; alpha = 0.0040
Simulating trial. . .
espilon = 0.0369; alpha = 0.0040
Simulating trial. . .
espilon = 0.0369; alpha = 0.0040
Simulating trial. . .
espilon = 0.0369; alpha = 0.0040
Simulating trial. . .
espilon = 0.0369; alpha = 0.0040
Simulating trial. . .
espilon = 0.0369; alpha = 0.0040
Simulating trial. . .
espilon = 0.0369; alpha = 0.0040
Simulating trial. . .
espilon = 0.0369; alpha = 0.0040
Simulating trial. . .
espilon = 0.0369; alpha = 0.0040
Simulating trial. . .
espilon = 0.0369; alpha = 0.0040
Simulating trial. . .
espilon = 0.0369; alpha = 0.0040
Simulating trial. . .
espilon = 0.0369; alpha = 0.0040
Simulating trial. . .
espilon = 0.0369; alpha = 0.0040
Simulating trial. . .
espilon = 0.0369; alpha = 0.0040
Simulating trial. . .
espilon = 0.0369; alpha = 0.0040
Simulating trial. . .
espilon = 0.0369; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'right', 'right')
Agent drove right instead of left. (rewarded 1.50)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent drove forward instead of right. (rewarded 0.08)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded 0.09)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.50)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.30)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.40)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.64)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.50)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.35)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.95)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.44)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.00)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.43)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.96)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', 'left')
Agent drove right instead of left. (rewarded -0.07)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.35)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.21)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded 1.16)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 2.08)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 0.55)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 827
\-------------------------
Simulating trial. . .
espilon = 0.0367; alpha = 0.0040
Simulating trial. . .
espilon = 0.0367; alpha = 0.0040
Simulating trial. . .
espilon = 0.0367; alpha = 0.0040
Simulating trial. . .
espilon = 0.0367; alpha = 0.0040
Simulating trial. . .
espilon = 0.0367; alpha = 0.0040
Simulating trial. . .
espilon = 0.0367; alpha = 0.0040
Simulating trial. . .
espilon = 0.0367; alpha = 0.0040
Simulating trial. . .
espilon = 0.0367; alpha = 0.0040
Simulating trial. . .
espilon = 0.0367; alpha = 0.0040
Simulating trial. . .
espilon = 0.0367; alpha = 0.0040
Simulating trial. . .
espilon = 0.0367; alpha = 0.0040
Simulating trial. . .
espilon = 0.0367; alpha = 0.0040
Simulating trial. . .
espilon = 0.0367; alpha = 0.0040
Simulating trial. . .
espilon = 0.0367; alpha = 0.0040
Simulating trial. . .
espilon = 0.0367; alpha = 0.0040
Simulating trial. . .
espilon = 0.0367; alpha = 0.0040
Simulating trial. . .
espilon = 0.0367; alpha = 0.0040
Simulating trial. . .
espilon = 0.0367; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'forward')
Agent drove right instead of left. (rewarded 0.98)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.09)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.57)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.43)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.46)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.76)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.11)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.54)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 0.99)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.46)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.25)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'left')
Agent drove right instead of left. (rewarded 1.49)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.42)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'forward')
Agent followed the waypoint right. (rewarded 0.69)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 0.97)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.27)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.65)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 2.05)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.08)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.54)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 828
\-------------------------
Simulating trial. . .
espilon = 0.0366; alpha = 0.0040
Simulating trial. . .
espilon = 0.0366; alpha = 0.0040
Simulating trial. . .
espilon = 0.0366; alpha = 0.0040
Simulating trial. . .
espilon = 0.0366; alpha = 0.0040
Simulating trial. . .
espilon = 0.0366; alpha = 0.0040
Simulating trial. . .
espilon = 0.0366; alpha = 0.0040
Simulating trial. . .
espilon = 0.0366; alpha = 0.0040
Simulating trial. . .
espilon = 0.0366; alpha = 0.0040
Simulating trial. . .
espilon = 0.0366; alpha = 0.0040
Simulating trial. . .
espilon = 0.0366; alpha = 0.0040
Simulating trial. . .
espilon = 0.0366; alpha = 0.0040
Simulating trial. . .
espilon = 0.0366; alpha = 0.0040
Simulating trial. . .
espilon = 0.0366; alpha = 0.0040
Simulating trial. . .
espilon = 0.0366; alpha = 0.0040
Simulating trial. . .
espilon = 0.0366; alpha = 0.0040
Simulating trial. . .
espilon = 0.0366; alpha = 0.0040
Simulating trial. . .
espilon = 0.0366; alpha = 0.0040
Simulating trial. . .
espilon = 0.0366; alpha = 0.0040
Simulating trial. . .
espilon = 0.0366; alpha = 0.0040
Simulating trial. . .
espilon = 0.0366; alpha = 0.0040
Simulating trial. . .
espilon = 0.0366; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent followed the waypoint left. (rewarded 1.84)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.85)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent drove left instead of forward. (rewarded 0.08)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'right')
Agent drove forward instead of right. (rewarded 1.63)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 2.90)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.60)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.74)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'left')
Agent followed the waypoint right. (rewarded 2.78)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.32)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.78)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.33)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.71)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.06)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.16)
44% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 829
\-------------------------
Simulating trial. . .
espilon = 0.0364; alpha = 0.0040
Simulating trial. . .
espilon = 0.0364; alpha = 0.0040
Simulating trial. . .
espilon = 0.0364; alpha = 0.0040
Simulating trial. . .
espilon = 0.0364; alpha = 0.0040
Simulating trial. . .
espilon = 0.0364; alpha = 0.0040
Simulating trial. . .
espilon = 0.0364; alpha = 0.0040
Simulating trial. . .
espilon = 0.0364; alpha = 0.0040
Simulating trial. . .
espilon = 0.0364; alpha = 0.0040
Simulating trial. . .
espilon = 0.0364; alpha = 0.0040
Simulating trial. . .
espilon = 0.0364; alpha = 0.0040
Simulating trial. . .
espilon = 0.0364; alpha = 0.0040
Simulating trial. . .
espilon = 0.0364; alpha = 0.0040
Simulating trial. . .
espilon = 0.0364; alpha = 0.0040
Simulating trial. . .
espilon = 0.0364; alpha = 0.0040
Simulating trial. . .
espilon = 0.0364; alpha = 0.0040
Simulating trial. . .
espilon = 0.0364; alpha = 0.0040
Simulating trial. . .
espilon = 0.0364; alpha = 0.0040
Simulating trial. . .
espilon = 0.0364; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent followed the waypoint right. (rewarded 2.65)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -20.58)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.81)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.14)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.79)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.78)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.77)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', None)
Agent properly idled at a red light. (rewarded 2.06)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.36)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.61)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.64)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'forward')
Agent drove right instead of left. (rewarded 1.48)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.95)
63% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 1.03)
60% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.33)
57% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.22)
54% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 2.74)
51% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 830
\-------------------------
Simulating trial. . .
espilon = 0.0363; alpha = 0.0040
Simulating trial. . .
espilon = 0.0363; alpha = 0.0040
Simulating trial. . .
espilon = 0.0363; alpha = 0.0040
Simulating trial. . .
espilon = 0.0363; alpha = 0.0040
Simulating trial. . .
espilon = 0.0363; alpha = 0.0040
Simulating trial. . .
espilon = 0.0363; alpha = 0.0040
Simulating trial. . .
espilon = 0.0363; alpha = 0.0040
Simulating trial. . .
espilon = 0.0363; alpha = 0.0040
Simulating trial. . .
espilon = 0.0363; alpha = 0.0040
Simulating trial. . .
espilon = 0.0363; alpha = 0.0040
Simulating trial. . .
espilon = 0.0363; alpha = 0.0040
Simulating trial. . .
espilon = 0.0363; alpha = 0.0040
Simulating trial. . .
espilon = 0.0363; alpha = 0.0040
Simulating trial. . .
espilon = 0.0363; alpha = 0.0040
Simulating trial. . .
espilon = 0.0363; alpha = 0.0040
Simulating trial. . .
espilon = 0.0363; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'forward', 'forward')
Agent drove forward instead of right. (rewarded 1.02)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.11)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.88)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.66)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.37)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.44)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.78)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.78)
68% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 831
\-------------------------
Simulating trial. . .
espilon = 0.0362; alpha = 0.0040
Simulating trial. . .
espilon = 0.0362; alpha = 0.0040
Simulating trial. . .
espilon = 0.0362; alpha = 0.0040
Simulating trial. . .
espilon = 0.0362; alpha = 0.0040
Simulating trial. . .
espilon = 0.0362; alpha = 0.0040
Simulating trial. . .
espilon = 0.0362; alpha = 0.0040
Simulating trial. . .
espilon = 0.0362; alpha = 0.0040
Simulating trial. . .
espilon = 0.0362; alpha = 0.0040
Simulating trial. . .
espilon = 0.0362; alpha = 0.0040
Simulating trial. . .
espilon = 0.0362; alpha = 0.0040
Simulating trial. . .
espilon = 0.0362; alpha = 0.0040
Simulating trial. . .
espilon = 0.0362; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', 'left')
Agent properly idled at a red light. (rewarded 2.43)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.30)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.67)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.53)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.96)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.65)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.56)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.32)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.41)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.18)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.33)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.35)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.89)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 1.54)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.71)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 832
\-------------------------
Simulating trial. . .
espilon = 0.0360; alpha = 0.0040
Simulating trial. . .
espilon = 0.0360; alpha = 0.0040
Simulating trial. . .
espilon = 0.0360; alpha = 0.0040
Simulating trial. . .
espilon = 0.0360; alpha = 0.0040
Simulating trial. . .
espilon = 0.0360; alpha = 0.0040
Simulating trial. . .
espilon = 0.0360; alpha = 0.0040
Simulating trial. . .
espilon = 0.0360; alpha = 0.0040
Simulating trial. . .
espilon = 0.0360; alpha = 0.0040
Simulating trial. . .
espilon = 0.0360; alpha = 0.0040
Simulating trial. . .
espilon = 0.0360; alpha = 0.0040
Simulating trial. . .
espilon = 0.0360; alpha = 0.0040
Simulating trial. . .
espilon = 0.0360; alpha = 0.0040
Simulating trial. . .
espilon = 0.0360; alpha = 0.0040
Simulating trial. . .
espilon = 0.0360; alpha = 0.0040
Simulating trial. . .
espilon = 0.0360; alpha = 0.0040
Simulating trial. . .
espilon = 0.0360; alpha = 0.0040
Simulating trial. . .
espilon = 0.0360; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 0.15)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.18)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.40)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.15)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.37)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.41)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.76)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'right')
Agent drove right instead of forward. (rewarded 0.41)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.09)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.60)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 1.72)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.03)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 2.35)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.59)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.85)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.09)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.47)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 2.47)
28% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 833
\-------------------------
Simulating trial. . .
espilon = 0.0359; alpha = 0.0040
Simulating trial. . .
espilon = 0.0359; alpha = 0.0040
Simulating trial. . .
espilon = 0.0359; alpha = 0.0040
Simulating trial. . .
espilon = 0.0359; alpha = 0.0040
Simulating trial. . .
espilon = 0.0359; alpha = 0.0040
Simulating trial. . .
espilon = 0.0359; alpha = 0.0040
Simulating trial. . .
espilon = 0.0359; alpha = 0.0040
Simulating trial. . .
espilon = 0.0359; alpha = 0.0040
Simulating trial. . .
espilon = 0.0359; alpha = 0.0040
Simulating trial. . .
espilon = 0.0359; alpha = 0.0040
Simulating trial. . .
espilon = 0.0359; alpha = 0.0040
Simulating trial. . .
espilon = 0.0359; alpha = 0.0040
Simulating trial. . .
espilon = 0.0359; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 1.08)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent followed the waypoint right. (rewarded 1.02)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', 'forward')
Agent followed the waypoint right. (rewarded 2.18)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.20)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.64)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.97)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.11)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.33)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.23)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.20)
50% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 834
\-------------------------
Simulating trial. . .
espilon = 0.0357; alpha = 0.0040
Simulating trial. . .
espilon = 0.0357; alpha = 0.0040
Simulating trial. . .
espilon = 0.0357; alpha = 0.0040
Simulating trial. . .
espilon = 0.0357; alpha = 0.0040
Simulating trial. . .
espilon = 0.0357; alpha = 0.0040
Simulating trial. . .
espilon = 0.0357; alpha = 0.0040
Simulating trial. . .
espilon = 0.0357; alpha = 0.0040
Simulating trial. . .
espilon = 0.0357; alpha = 0.0040
Simulating trial. . .
espilon = 0.0357; alpha = 0.0040
Simulating trial. . .
espilon = 0.0357; alpha = 0.0040
Simulating trial. . .
espilon = 0.0357; alpha = 0.0040
Simulating trial. . .
espilon = 0.0357; alpha = 0.0040
Simulating trial. . .
espilon = 0.0357; alpha = 0.0040
Simulating trial. . .
espilon = 0.0357; alpha = 0.0040
Simulating trial. . .
espilon = 0.0357; alpha = 0.0040
Simulating trial. . .
espilon = 0.0357; alpha = 0.0040
Simulating trial. . .
espilon = 0.0357; alpha = 0.0040
Simulating trial. . .
espilon = 0.0357; alpha = 0.0040
Simulating trial. . .
espilon = 0.0357; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'right', 'left')
Agent properly idled at a red light. (rewarded 1.67)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.62)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.16)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.40)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.33)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.13)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent followed the waypoint left. (rewarded 1.46)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.66)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.08)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.58)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 835
\-------------------------
Simulating trial. . .
espilon = 0.0356; alpha = 0.0040
Simulating trial. . .
espilon = 0.0356; alpha = 0.0040
Simulating trial. . .
espilon = 0.0356; alpha = 0.0040
Simulating trial. . .
espilon = 0.0356; alpha = 0.0040
Simulating trial. . .
espilon = 0.0356; alpha = 0.0040
Simulating trial. . .
espilon = 0.0356; alpha = 0.0040
Simulating trial. . .
espilon = 0.0356; alpha = 0.0040
Simulating trial. . .
espilon = 0.0356; alpha = 0.0040
Simulating trial. . .
espilon = 0.0356; alpha = 0.0040
Simulating trial. . .
espilon = 0.0356; alpha = 0.0040
Simulating trial. . .
espilon = 0.0356; alpha = 0.0040
Simulating trial. . .
espilon = 0.0356; alpha = 0.0040
Simulating trial. . .
espilon = 0.0356; alpha = 0.0040
Simulating trial. . .
espilon = 0.0356; alpha = 0.0040
Simulating trial. . .
espilon = 0.0356; alpha = 0.0040
Simulating trial. . .
espilon = 0.0356; alpha = 0.0040
Simulating trial. . .
espilon = 0.0356; alpha = 0.0040
Simulating trial. . .
espilon = 0.0356; alpha = 0.0040
Simulating trial. . .
espilon = 0.0356; alpha = 0.0040
Simulating trial. . .
espilon = 0.0356; alpha = 0.0040
Simulating trial. . .
espilon = 0.0356; alpha = 0.0040
Simulating trial. . .
espilon = 0.0356; alpha = 0.0040
Simulating trial. . .
espilon = 0.0356; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.67)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 2.77)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.71)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.93)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'forward')
Agent drove right instead of forward. (rewarded 0.41)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.29)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.32)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'left')
Agent followed the waypoint left. (rewarded 2.34)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.12)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.00)
50% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 836
\-------------------------
Simulating trial. . .
espilon = 0.0354; alpha = 0.0040
Simulating trial. . .
espilon = 0.0354; alpha = 0.0040
Simulating trial. . .
espilon = 0.0354; alpha = 0.0040
Simulating trial. . .
espilon = 0.0354; alpha = 0.0040
Simulating trial. . .
espilon = 0.0354; alpha = 0.0040
Simulating trial. . .
espilon = 0.0354; alpha = 0.0040
Simulating trial. . .
espilon = 0.0354; alpha = 0.0040
Simulating trial. . .
espilon = 0.0354; alpha = 0.0040
Simulating trial. . .
espilon = 0.0354; alpha = 0.0040
Simulating trial. . .
espilon = 0.0354; alpha = 0.0040
Simulating trial. . .
espilon = 0.0354; alpha = 0.0040
Simulating trial. . .
espilon = 0.0354; alpha = 0.0040
Simulating trial. . .
espilon = 0.0354; alpha = 0.0040
Simulating trial. . .
espilon = 0.0354; alpha = 0.0040
Simulating trial. . .
espilon = 0.0354; alpha = 0.0040
Simulating trial. . .
espilon = 0.0354; alpha = 0.0040
Simulating trial. . .
espilon = 0.0354; alpha = 0.0040
Simulating trial. . .
espilon = 0.0354; alpha = 0.0040
Simulating trial. . .
espilon = 0.0354; alpha = 0.0040
Simulating trial. . .
espilon = 0.0354; alpha = 0.0040
Simulating trial. . .
espilon = 0.0354; alpha = 0.0040
Simulating trial. . .
espilon = 0.0354; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 2.08)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.48)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.95)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.10)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', 'forward')
Agent drove right instead of forward. (rewarded 1.91)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.63)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', 'forward')
Agent drove right instead of left. (rewarded 1.60)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', 'forward')
Agent drove forward instead of left. (rewarded 1.60)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent drove right instead of left. (rewarded -0.15)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 0.98)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', 'right')
Agent drove right instead of forward. (rewarded 0.41)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'right')
Agent properly idled at a red light. (rewarded 2.26)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.72)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'left')
Agent followed the waypoint left. (rewarded 1.92)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 0.67)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'right')
Agent properly idled at a red light. (rewarded 1.36)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 0.65)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.63)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.17)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.69)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 837
\-------------------------
Simulating trial. . .
espilon = 0.0353; alpha = 0.0040
Simulating trial. . .
espilon = 0.0353; alpha = 0.0040
Simulating trial. . .
espilon = 0.0353; alpha = 0.0040
Simulating trial. . .
espilon = 0.0353; alpha = 0.0040
Simulating trial. . .
espilon = 0.0353; alpha = 0.0040
Simulating trial. . .
espilon = 0.0353; alpha = 0.0040
Simulating trial. . .
espilon = 0.0353; alpha = 0.0040
Simulating trial. . .
espilon = 0.0353; alpha = 0.0040
Simulating trial. . .
espilon = 0.0353; alpha = 0.0040
Simulating trial. . .
espilon = 0.0353; alpha = 0.0040
Simulating trial. . .
espilon = 0.0353; alpha = 0.0040
Simulating trial. . .
espilon = 0.0353; alpha = 0.0040
Simulating trial. . .
espilon = 0.0353; alpha = 0.0040
Simulating trial. . .
espilon = 0.0353; alpha = 0.0040
Simulating trial. . .
espilon = 0.0353; alpha = 0.0040
Simulating trial. . .
espilon = 0.0353; alpha = 0.0040
Simulating trial. . .
espilon = 0.0353; alpha = 0.0040
Simulating trial. . .
espilon = 0.0353; alpha = 0.0040
Simulating trial. . .
espilon = 0.0353; alpha = 0.0040
Simulating trial. . .
espilon = 0.0353; alpha = 0.0040
Simulating trial. . .
espilon = 0.0353; alpha = 0.0040
Simulating trial. . .
espilon = 0.0353; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent followed the waypoint forward. (rewarded 2.75)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.86)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent followed the waypoint right. (rewarded 1.54)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.51)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.93)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.23)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.75)
65% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 838
\-------------------------
Simulating trial. . .
espilon = 0.0352; alpha = 0.0040
Simulating trial. . .
espilon = 0.0352; alpha = 0.0040
Simulating trial. . .
espilon = 0.0352; alpha = 0.0040
Simulating trial. . .
espilon = 0.0352; alpha = 0.0040
Simulating trial. . .
espilon = 0.0352; alpha = 0.0040
Simulating trial. . .
espilon = 0.0352; alpha = 0.0040
Simulating trial. . .
espilon = 0.0352; alpha = 0.0040
Simulating trial. . .
espilon = 0.0352; alpha = 0.0040
Simulating trial. . .
espilon = 0.0352; alpha = 0.0040
Simulating trial. . .
espilon = 0.0352; alpha = 0.0040
Simulating trial. . .
espilon = 0.0352; alpha = 0.0040
Simulating trial. . .
espilon = 0.0352; alpha = 0.0040
Simulating trial. . .
espilon = 0.0352; alpha = 0.0040
Simulating trial. . .
espilon = 0.0352; alpha = 0.0040
Simulating trial. . .
espilon = 0.0352; alpha = 0.0040
Simulating trial. . .
espilon = 0.0352; alpha = 0.0040
Simulating trial. . .
espilon = 0.0352; alpha = 0.0040
Simulating trial. . .
espilon = 0.0352; alpha = 0.0040
Simulating trial. . .
espilon = 0.0352; alpha = 0.0040
Simulating trial. . .
espilon = 0.0352; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove right instead of left. (rewarded 0.08)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.76)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.84)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 1.78)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.32)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.22)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.53)
65% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 839
\-------------------------
Simulating trial. . .
espilon = 0.0350; alpha = 0.0040
Simulating trial. . .
espilon = 0.0350; alpha = 0.0040
Simulating trial. . .
espilon = 0.0350; alpha = 0.0040
Simulating trial. . .
espilon = 0.0350; alpha = 0.0040
Simulating trial. . .
espilon = 0.0350; alpha = 0.0040
Simulating trial. . .
espilon = 0.0350; alpha = 0.0040
Simulating trial. . .
espilon = 0.0350; alpha = 0.0040
Simulating trial. . .
espilon = 0.0350; alpha = 0.0040
Simulating trial. . .
espilon = 0.0350; alpha = 0.0040
Simulating trial. . .
espilon = 0.0350; alpha = 0.0040
Simulating trial. . .
espilon = 0.0350; alpha = 0.0040
Simulating trial. . .
espilon = 0.0350; alpha = 0.0040
Simulating trial. . .
espilon = 0.0350; alpha = 0.0040
Simulating trial. . .
espilon = 0.0350; alpha = 0.0040
Simulating trial. . .
espilon = 0.0350; alpha = 0.0040
Simulating trial. . .
espilon = 0.0350; alpha = 0.0040
Simulating trial. . .
espilon = 0.0350; alpha = 0.0040
Simulating trial. . .
espilon = 0.0350; alpha = 0.0040
Simulating trial. . .
espilon = 0.0350; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', 'forward')
Agent properly idled at a red light. (rewarded 2.37)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.73)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.27)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.85)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.34)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.36)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.58)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'right')
Agent drove forward instead of left. (rewarded 1.30)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.87)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.94)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.19)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.80)
52% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 840
\-------------------------
Simulating trial. . .
espilon = 0.0349; alpha = 0.0040
Simulating trial. . .
espilon = 0.0349; alpha = 0.0040
Simulating trial. . .
espilon = 0.0349; alpha = 0.0040
Simulating trial. . .
espilon = 0.0349; alpha = 0.0040
Simulating trial. . .
espilon = 0.0349; alpha = 0.0040
Simulating trial. . .
espilon = 0.0349; alpha = 0.0040
Simulating trial. . .
espilon = 0.0349; alpha = 0.0040
Simulating trial. . .
espilon = 0.0349; alpha = 0.0040
Simulating trial. . .
espilon = 0.0349; alpha = 0.0040
Simulating trial. . .
espilon = 0.0349; alpha = 0.0040
Simulating trial. . .
espilon = 0.0349; alpha = 0.0040
Simulating trial. . .
espilon = 0.0349; alpha = 0.0040
Simulating trial. . .
espilon = 0.0349; alpha = 0.0040
Simulating trial. . .
espilon = 0.0349; alpha = 0.0040
Simulating trial. . .
espilon = 0.0349; alpha = 0.0040
Simulating trial. . .
espilon = 0.0349; alpha = 0.0040
Simulating trial. . .
espilon = 0.0349; alpha = 0.0040
Simulating trial. . .
espilon = 0.0349; alpha = 0.0040
Simulating trial. . .
espilon = 0.0349; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'right', None)
Agent drove forward instead of left. (rewarded 1.22)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.14)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.02)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.29)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.27)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.99)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.01)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.36)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.11)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.68)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.39)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.30)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', None)
Agent followed the waypoint forward. (rewarded 2.15)
48% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 841
\-------------------------
Simulating trial. . .
espilon = 0.0347; alpha = 0.0040
Simulating trial. . .
espilon = 0.0347; alpha = 0.0040
Simulating trial. . .
espilon = 0.0347; alpha = 0.0040
Simulating trial. . .
espilon = 0.0347; alpha = 0.0040
Simulating trial. . .
espilon = 0.0347; alpha = 0.0040
Simulating trial. . .
espilon = 0.0347; alpha = 0.0040
Simulating trial. . .
espilon = 0.0347; alpha = 0.0040
Simulating trial. . .
espilon = 0.0347; alpha = 0.0040
Simulating trial. . .
espilon = 0.0347; alpha = 0.0040
Simulating trial. . .
espilon = 0.0347; alpha = 0.0040
Simulating trial. . .
espilon = 0.0347; alpha = 0.0040
Simulating trial. . .
espilon = 0.0347; alpha = 0.0040
Simulating trial. . .
espilon = 0.0347; alpha = 0.0040
Simulating trial. . .
espilon = 0.0347; alpha = 0.0040
Simulating trial. . .
espilon = 0.0347; alpha = 0.0040
Simulating trial. . .
espilon = 0.0347; alpha = 0.0040
Simulating trial. . .
espilon = 0.0347; alpha = 0.0040
Simulating trial. . .
espilon = 0.0347; alpha = 0.0040
Simulating trial. . .
espilon = 0.0347; alpha = 0.0040
Simulating trial. . .
espilon = 0.0347; alpha = 0.0040
Simulating trial. . .
espilon = 0.0347; alpha = 0.0040
Simulating trial. . .
espilon = 0.0347; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.84)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.27)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.32)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.24)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.79)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.33)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.28)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.93)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.76)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.14)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.85)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.99)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.78)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.02)
30% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 842
\-------------------------
Simulating trial. . .
espilon = 0.0346; alpha = 0.0040
Simulating trial. . .
espilon = 0.0346; alpha = 0.0040
Simulating trial. . .
espilon = 0.0346; alpha = 0.0040
Simulating trial. . .
espilon = 0.0346; alpha = 0.0040
Simulating trial. . .
espilon = 0.0346; alpha = 0.0040
Simulating trial. . .
espilon = 0.0346; alpha = 0.0040
Simulating trial. . .
espilon = 0.0346; alpha = 0.0040
Simulating trial. . .
espilon = 0.0346; alpha = 0.0040
Simulating trial. . .
espilon = 0.0346; alpha = 0.0040
Simulating trial. . .
espilon = 0.0346; alpha = 0.0040
Simulating trial. . .
espilon = 0.0346; alpha = 0.0040
Simulating trial. . .
espilon = 0.0346; alpha = 0.0040
Simulating trial. . .
espilon = 0.0346; alpha = 0.0040
Simulating trial. . .
espilon = 0.0346; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.82)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.60)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.37)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.00)
80% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 843
\-------------------------
Simulating trial. . .
espilon = 0.0345; alpha = 0.0040
Simulating trial. . .
espilon = 0.0345; alpha = 0.0040
Simulating trial. . .
espilon = 0.0345; alpha = 0.0040
Simulating trial. . .
espilon = 0.0345; alpha = 0.0040
Simulating trial. . .
espilon = 0.0345; alpha = 0.0040
Simulating trial. . .
espilon = 0.0345; alpha = 0.0040
Simulating trial. . .
espilon = 0.0345; alpha = 0.0040
Simulating trial. . .
espilon = 0.0345; alpha = 0.0040
Simulating trial. . .
espilon = 0.0345; alpha = 0.0040
Simulating trial. . .
espilon = 0.0345; alpha = 0.0040
Simulating trial. . .
espilon = 0.0345; alpha = 0.0040
Simulating trial. . .
espilon = 0.0345; alpha = 0.0040
Simulating trial. . .
espilon = 0.0345; alpha = 0.0040
Simulating trial. . .
espilon = 0.0345; alpha = 0.0040
Simulating trial. . .
espilon = 0.0345; alpha = 0.0040
Simulating trial. . .
espilon = 0.0345; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.21)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', 'left')
Agent followed the waypoint right. (rewarded 1.54)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.01)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.31)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.36)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.30)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent followed the waypoint forward. (rewarded 2.40)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', 'right')
Agent followed the waypoint forward. (rewarded 1.92)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.68)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.10)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.51)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', None)
Agent followed the waypoint forward. (rewarded 2.32)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.85)
57% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 844
\-------------------------
Simulating trial. . .
espilon = 0.0343; alpha = 0.0040
Simulating trial. . .
espilon = 0.0343; alpha = 0.0040
Simulating trial. . .
espilon = 0.0343; alpha = 0.0040
Simulating trial. . .
espilon = 0.0343; alpha = 0.0040
Simulating trial. . .
espilon = 0.0343; alpha = 0.0040
Simulating trial. . .
espilon = 0.0343; alpha = 0.0040
Simulating trial. . .
espilon = 0.0343; alpha = 0.0040
Simulating trial. . .
espilon = 0.0343; alpha = 0.0040
Simulating trial. . .
espilon = 0.0343; alpha = 0.0040
Simulating trial. . .
espilon = 0.0343; alpha = 0.0040
Simulating trial. . .
espilon = 0.0343; alpha = 0.0040
Simulating trial. . .
espilon = 0.0343; alpha = 0.0040
Simulating trial. . .
espilon = 0.0343; alpha = 0.0040
Simulating trial. . .
espilon = 0.0343; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.18)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.74)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', 'right')
Agent followed the waypoint forward. (rewarded 1.92)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded 0.67)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.60)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.55)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.02)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.23)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 0.88)
55% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 845
\-------------------------
Simulating trial. . .
espilon = 0.0342; alpha = 0.0040
Simulating trial. . .
espilon = 0.0342; alpha = 0.0040
Simulating trial. . .
espilon = 0.0342; alpha = 0.0040
Simulating trial. . .
espilon = 0.0342; alpha = 0.0040
Simulating trial. . .
espilon = 0.0342; alpha = 0.0040
Simulating trial. . .
espilon = 0.0342; alpha = 0.0040
Simulating trial. . .
espilon = 0.0342; alpha = 0.0040
Simulating trial. . .
espilon = 0.0342; alpha = 0.0040
Simulating trial. . .
espilon = 0.0342; alpha = 0.0040
Simulating trial. . .
espilon = 0.0342; alpha = 0.0040
Simulating trial. . .
espilon = 0.0342; alpha = 0.0040
Simulating trial. . .
espilon = 0.0342; alpha = 0.0040
Simulating trial. . .
espilon = 0.0342; alpha = 0.0040
Simulating trial. . .
espilon = 0.0342; alpha = 0.0040
Simulating trial. . .
espilon = 0.0342; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.37)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.65)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.84)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.53)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.44)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 1.37)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.23)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', 'forward')
Agent drove forward instead of right. (rewarded 0.47)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.68)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.53)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 2.25)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.37)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.89)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 1.12)
53% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 846
\-------------------------
Simulating trial. . .
espilon = 0.0340; alpha = 0.0040
Simulating trial. . .
espilon = 0.0340; alpha = 0.0040
Simulating trial. . .
espilon = 0.0340; alpha = 0.0040
Simulating trial. . .
espilon = 0.0340; alpha = 0.0040
Simulating trial. . .
espilon = 0.0340; alpha = 0.0040
Simulating trial. . .
espilon = 0.0340; alpha = 0.0040
Simulating trial. . .
espilon = 0.0340; alpha = 0.0040
Simulating trial. . .
espilon = 0.0340; alpha = 0.0040
Simulating trial. . .
espilon = 0.0340; alpha = 0.0040
Simulating trial. . .
espilon = 0.0340; alpha = 0.0040
Simulating trial. . .
espilon = 0.0340; alpha = 0.0040
Simulating trial. . .
espilon = 0.0340; alpha = 0.0040
Simulating trial. . .
espilon = 0.0340; alpha = 0.0040
Simulating trial. . .
espilon = 0.0340; alpha = 0.0040
Simulating trial. . .
espilon = 0.0340; alpha = 0.0040
Simulating trial. . .
espilon = 0.0340; alpha = 0.0040
Simulating trial. . .
espilon = 0.0340; alpha = 0.0040
Simulating trial. . .
espilon = 0.0340; alpha = 0.0040
Simulating trial. . .
espilon = 0.0340; alpha = 0.0040
Simulating trial. . .
espilon = 0.0340; alpha = 0.0040
Simulating trial. . .
espilon = 0.0340; alpha = 0.0040
Simulating trial. . .
espilon = 0.0340; alpha = 0.0040
Simulating trial. . .
espilon = 0.0340; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.64)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.07)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.00)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.36)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.49)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.30)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 0.96)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.96)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'right', 'forward')
Agent properly idled at a red light. (rewarded 2.18)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.07)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.78)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'right', 'forward')
Agent properly idled at a red light. (rewarded 2.16)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.70)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.11)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.69)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.96)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.36)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'forward')
Agent drove forward instead of left. (rewarded 0.48)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.09)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.61)
20% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 847
\-------------------------
Simulating trial. . .
espilon = 0.0339; alpha = 0.0040
Simulating trial. . .
espilon = 0.0339; alpha = 0.0040
Simulating trial. . .
espilon = 0.0339; alpha = 0.0040
Simulating trial. . .
espilon = 0.0339; alpha = 0.0040
Simulating trial. . .
espilon = 0.0339; alpha = 0.0040
Simulating trial. . .
espilon = 0.0339; alpha = 0.0040
Simulating trial. . .
espilon = 0.0339; alpha = 0.0040
Simulating trial. . .
espilon = 0.0339; alpha = 0.0040
Simulating trial. . .
espilon = 0.0339; alpha = 0.0040
Simulating trial. . .
espilon = 0.0339; alpha = 0.0040
Simulating trial. . .
espilon = 0.0339; alpha = 0.0040
Simulating trial. . .
espilon = 0.0339; alpha = 0.0040
Simulating trial. . .
espilon = 0.0339; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', 'forward')
Agent drove forward instead of left. (rewarded 0.60)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.34)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -10.58)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.12)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.18)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent drove right instead of left. (rewarded -0.04)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.64)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.51)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 1.43)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.36)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.53)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.80)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 0.21)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.23)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 2.25)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.51)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.42)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'forward')
Agent drove right instead of forward. (rewarded 0.71)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.72)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.76)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.43)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 0.99)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.76)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 2.15)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.39)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 848
\-------------------------
Simulating trial. . .
espilon = 0.0338; alpha = 0.0040
Simulating trial. . .
espilon = 0.0338; alpha = 0.0040
Simulating trial. . .
espilon = 0.0338; alpha = 0.0040
Simulating trial. . .
espilon = 0.0338; alpha = 0.0040
Simulating trial. . .
espilon = 0.0338; alpha = 0.0040
Simulating trial. . .
espilon = 0.0338; alpha = 0.0040
Simulating trial. . .
espilon = 0.0338; alpha = 0.0040
Simulating trial. . .
espilon = 0.0338; alpha = 0.0040
Simulating trial. . .
espilon = 0.0338; alpha = 0.0040
Simulating trial. . .
espilon = 0.0338; alpha = 0.0040
Simulating trial. . .
espilon = 0.0338; alpha = 0.0040
Simulating trial. . .
espilon = 0.0338; alpha = 0.0040
Simulating trial. . .
espilon = 0.0338; alpha = 0.0040
Simulating trial. . .
espilon = 0.0338; alpha = 0.0040
Simulating trial. . .
espilon = 0.0338; alpha = 0.0040
Simulating trial. . .
espilon = 0.0338; alpha = 0.0040
Simulating trial. . .
espilon = 0.0338; alpha = 0.0040
Simulating trial. . .
espilon = 0.0338; alpha = 0.0040
Simulating trial. . .
espilon = 0.0338; alpha = 0.0040
Simulating trial. . .
espilon = 0.0338; alpha = 0.0040
Simulating trial. . .
espilon = 0.0338; alpha = 0.0040
Simulating trial. . .
espilon = 0.0338; alpha = 0.0040
Simulating trial. . .
espilon = 0.0338; alpha = 0.0040
Simulating trial. . .
espilon = 0.0338; alpha = 0.0040
Simulating trial. . .
espilon = 0.0338; alpha = 0.0040
Simulating trial. . .
espilon = 0.0338; alpha = 0.0040
Simulating trial. . .
espilon = 0.0338; alpha = 0.0040
Simulating trial. . .
espilon = 0.0338; alpha = 0.0040
Simulating trial. . .
espilon = 0.0338; alpha = 0.0040
Simulating trial. . .
espilon = 0.0338; alpha = 0.0040
Simulating trial. . .
espilon = 0.0338; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.37)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.16)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.21)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.39)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.56)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.10)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.38)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.29)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.36)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.36)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.29)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.93)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.14)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 0.91)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.75)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.42)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.08)
15% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 849
\-------------------------
Simulating trial. . .
espilon = 0.0336; alpha = 0.0040
Simulating trial. . .
espilon = 0.0336; alpha = 0.0040
Simulating trial. . .
espilon = 0.0336; alpha = 0.0040
Simulating trial. . .
espilon = 0.0336; alpha = 0.0040
Simulating trial. . .
espilon = 0.0336; alpha = 0.0040
Simulating trial. . .
espilon = 0.0336; alpha = 0.0040
Simulating trial. . .
espilon = 0.0336; alpha = 0.0040
Simulating trial. . .
espilon = 0.0336; alpha = 0.0040
Simulating trial. . .
espilon = 0.0336; alpha = 0.0040
Simulating trial. . .
espilon = 0.0336; alpha = 0.0040
Simulating trial. . .
espilon = 0.0336; alpha = 0.0040
Simulating trial. . .
espilon = 0.0336; alpha = 0.0040
Simulating trial. . .
espilon = 0.0336; alpha = 0.0040
Simulating trial. . .
espilon = 0.0336; alpha = 0.0040
Simulating trial. . .
espilon = 0.0336; alpha = 0.0040
Simulating trial. . .
espilon = 0.0336; alpha = 0.0040
Simulating trial. . .
espilon = 0.0336; alpha = 0.0040
Simulating trial. . .
espilon = 0.0336; alpha = 0.0040
Simulating trial. . .
espilon = 0.0336; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', 'left')
Agent followed the waypoint right. (rewarded 2.51)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, None)
Agent followed the waypoint right. (rewarded 1.91)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.08)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.53)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.52)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.50)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.22)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.02)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.24)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.70)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 850
\-------------------------
Simulating trial. . .
espilon = 0.0335; alpha = 0.0040
Simulating trial. . .
espilon = 0.0335; alpha = 0.0040
Simulating trial. . .
espilon = 0.0335; alpha = 0.0040
Simulating trial. . .
espilon = 0.0335; alpha = 0.0040
Simulating trial. . .
espilon = 0.0335; alpha = 0.0040
Simulating trial. . .
espilon = 0.0335; alpha = 0.0040
Simulating trial. . .
espilon = 0.0335; alpha = 0.0040
Simulating trial. . .
espilon = 0.0335; alpha = 0.0040
Simulating trial. . .
espilon = 0.0335; alpha = 0.0040
Simulating trial. . .
espilon = 0.0335; alpha = 0.0040
Simulating trial. . .
espilon = 0.0335; alpha = 0.0040
Simulating trial. . .
espilon = 0.0335; alpha = 0.0040
Simulating trial. . .
espilon = 0.0335; alpha = 0.0040
Simulating trial. . .
espilon = 0.0335; alpha = 0.0040
Simulating trial. . .
espilon = 0.0335; alpha = 0.0040
Simulating trial. . .
espilon = 0.0335; alpha = 0.0040
Simulating trial. . .
espilon = 0.0335; alpha = 0.0040
Simulating trial. . .
espilon = 0.0335; alpha = 0.0040
Simulating trial. . .
espilon = 0.0335; alpha = 0.0040
Simulating trial. . .
espilon = 0.0335; alpha = 0.0040
Simulating trial. . .
espilon = 0.0335; alpha = 0.0040
Simulating trial. . .
espilon = 0.0335; alpha = 0.0040
Simulating trial. . .
espilon = 0.0335; alpha = 0.0040
Simulating trial. . .
espilon = 0.0335; alpha = 0.0040
Simulating trial. . .
espilon = 0.0335; alpha = 0.0040
Simulating trial. . .
espilon = 0.0335; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent followed the waypoint right. (rewarded 2.59)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 2.98)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.84)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', 'left')
Agent drove right instead of left. (rewarded -0.04)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.19)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 2.29)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 2.41)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'right', None)
Agent properly idled at a red light. (rewarded 1.83)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.28)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.77)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.16)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.62)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', 'forward')
Agent drove right instead of left. (rewarded 1.19)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.09)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.40)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 0.99)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.42)
15% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 851
\-------------------------
Simulating trial. . .
espilon = 0.0334; alpha = 0.0040
Simulating trial. . .
espilon = 0.0334; alpha = 0.0040
Simulating trial. . .
espilon = 0.0334; alpha = 0.0040
Simulating trial. . .
espilon = 0.0334; alpha = 0.0040
Simulating trial. . .
espilon = 0.0334; alpha = 0.0040
Simulating trial. . .
espilon = 0.0334; alpha = 0.0040
Simulating trial. . .
espilon = 0.0334; alpha = 0.0040
Simulating trial. . .
espilon = 0.0334; alpha = 0.0040
Simulating trial. . .
espilon = 0.0334; alpha = 0.0040
Simulating trial. . .
espilon = 0.0334; alpha = 0.0040
Simulating trial. . .
espilon = 0.0334; alpha = 0.0040
Simulating trial. . .
espilon = 0.0334; alpha = 0.0040
Simulating trial. . .
espilon = 0.0334; alpha = 0.0040
Simulating trial. . .
espilon = 0.0334; alpha = 0.0040
Simulating trial. . .
espilon = 0.0334; alpha = 0.0040
Simulating trial. . .
espilon = 0.0334; alpha = 0.0040
Simulating trial. . .
espilon = 0.0334; alpha = 0.0040
Simulating trial. . .
espilon = 0.0334; alpha = 0.0040
Simulating trial. . .
espilon = 0.0334; alpha = 0.0040
Simulating trial. . .
espilon = 0.0334; alpha = 0.0040
Simulating trial. . .
espilon = 0.0334; alpha = 0.0040
Simulating trial. . .
espilon = 0.0334; alpha = 0.0040
Simulating trial. . .
espilon = 0.0334; alpha = 0.0040
Simulating trial. . .
espilon = 0.0334; alpha = 0.0040
Simulating trial. . .
espilon = 0.0334; alpha = 0.0040
Simulating trial. . .
espilon = 0.0334; alpha = 0.0040
Simulating trial. . .
espilon = 0.0334; alpha = 0.0040
Simulating trial. . .
espilon = 0.0334; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.74)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.86)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.02)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.74)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.48)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.45)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.12)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.03)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.57)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.88)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 1.68)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.05)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 0.33)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.79)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.77)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.09)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.73)
15% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 852
\-------------------------
Simulating trial. . .
espilon = 0.0332; alpha = 0.0040
Simulating trial. . .
espilon = 0.0332; alpha = 0.0040
Simulating trial. . .
espilon = 0.0332; alpha = 0.0040
Simulating trial. . .
espilon = 0.0332; alpha = 0.0040
Simulating trial. . .
espilon = 0.0332; alpha = 0.0040
Simulating trial. . .
espilon = 0.0332; alpha = 0.0040
Simulating trial. . .
espilon = 0.0332; alpha = 0.0040
Simulating trial. . .
espilon = 0.0332; alpha = 0.0040
Simulating trial. . .
espilon = 0.0332; alpha = 0.0040
Simulating trial. . .
espilon = 0.0332; alpha = 0.0040
Simulating trial. . .
espilon = 0.0332; alpha = 0.0040
Simulating trial. . .
espilon = 0.0332; alpha = 0.0040
Simulating trial. . .
espilon = 0.0332; alpha = 0.0040
Simulating trial. . .
espilon = 0.0332; alpha = 0.0040
Simulating trial. . .
espilon = 0.0332; alpha = 0.0040
Simulating trial. . .
espilon = 0.0332; alpha = 0.0040
Simulating trial. . .
espilon = 0.0332; alpha = 0.0040
Simulating trial. . .
espilon = 0.0332; alpha = 0.0040
Simulating trial. . .
espilon = 0.0332; alpha = 0.0040
Simulating trial. . .
espilon = 0.0332; alpha = 0.0040
Simulating trial. . .
espilon = 0.0332; alpha = 0.0040
Simulating trial. . .
espilon = 0.0332; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent followed the waypoint left. (rewarded 1.77)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.27)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'forward')
Agent drove right instead of left. (rewarded 1.68)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'right')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.77)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.29)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.20)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.48)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'forward')
Agent drove forward instead of left. (rewarded 0.45)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'right')
Agent drove right instead of forward. (rewarded 0.25)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 2.52)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.33)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.96)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.94)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', 'right')
Agent followed the waypoint forward. (rewarded 0.89)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.33)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', 'left')
Agent properly idled at a red light. (rewarded 1.92)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 0.54)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 0.62)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 0.48)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'right', None)
Agent followed the waypoint forward. (rewarded 0.88)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 853
\-------------------------
Simulating trial. . .
espilon = 0.0331; alpha = 0.0040
Simulating trial. . .
espilon = 0.0331; alpha = 0.0040
Simulating trial. . .
espilon = 0.0331; alpha = 0.0040
Simulating trial. . .
espilon = 0.0331; alpha = 0.0040
Simulating trial. . .
espilon = 0.0331; alpha = 0.0040
Simulating trial. . .
espilon = 0.0331; alpha = 0.0040
Simulating trial. . .
espilon = 0.0331; alpha = 0.0040
Simulating trial. . .
espilon = 0.0331; alpha = 0.0040
Simulating trial. . .
espilon = 0.0331; alpha = 0.0040
Simulating trial. . .
espilon = 0.0331; alpha = 0.0040
Simulating trial. . .
espilon = 0.0331; alpha = 0.0040
Simulating trial. . .
espilon = 0.0331; alpha = 0.0040
Simulating trial. . .
espilon = 0.0331; alpha = 0.0040
Simulating trial. . .
espilon = 0.0331; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', 'left')
Agent drove right instead of left. (rewarded 1.24)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'right', 'right')
Agent followed the waypoint forward. (rewarded 2.09)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.52)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.08)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.87)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.95)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.22)
77% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 854
\-------------------------
Simulating trial. . .
espilon = 0.0330; alpha = 0.0040
Simulating trial. . .
espilon = 0.0330; alpha = 0.0040
Simulating trial. . .
espilon = 0.0330; alpha = 0.0040
Simulating trial. . .
espilon = 0.0330; alpha = 0.0040
Simulating trial. . .
espilon = 0.0330; alpha = 0.0040
Simulating trial. . .
espilon = 0.0330; alpha = 0.0040
Simulating trial. . .
espilon = 0.0330; alpha = 0.0040
Simulating trial. . .
espilon = 0.0330; alpha = 0.0040
Simulating trial. . .
espilon = 0.0330; alpha = 0.0040
Simulating trial. . .
espilon = 0.0330; alpha = 0.0040
Simulating trial. . .
espilon = 0.0330; alpha = 0.0040
Simulating trial. . .
espilon = 0.0330; alpha = 0.0040
Simulating trial. . .
espilon = 0.0330; alpha = 0.0040
Simulating trial. . .
espilon = 0.0330; alpha = 0.0040
Simulating trial. . .
espilon = 0.0330; alpha = 0.0040
Simulating trial. . .
espilon = 0.0330; alpha = 0.0040
Simulating trial. . .
espilon = 0.0330; alpha = 0.0040
Simulating trial. . .
espilon = 0.0330; alpha = 0.0040
Simulating trial. . .
espilon = 0.0330; alpha = 0.0040
Simulating trial. . .
espilon = 0.0330; alpha = 0.0040
Simulating trial. . .
espilon = 0.0330; alpha = 0.0040
Simulating trial. . .
espilon = 0.0330; alpha = 0.0040
Simulating trial. . .
espilon = 0.0330; alpha = 0.0040
Simulating trial. . .
espilon = 0.0330; alpha = 0.0040
Simulating trial. . .
espilon = 0.0330; alpha = 0.0040
Simulating trial. . .
espilon = 0.0330; alpha = 0.0040
Simulating trial. . .
espilon = 0.0330; alpha = 0.0040
Simulating trial. . .
espilon = 0.0330; alpha = 0.0040
Simulating trial. . .
espilon = 0.0330; alpha = 0.0040
Simulating trial. . .
espilon = 0.0330; alpha = 0.0040
Simulating trial. . .
espilon = 0.0330; alpha = 0.0040
Simulating trial. . .
espilon = 0.0330; alpha = 0.0040
Simulating trial. . .
espilon = 0.0330; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', 'left')
Agent followed the waypoint forward. (rewarded 1.64)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.73)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.08)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.46)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.09)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.88)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.63)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 2.54)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 2.44)
55% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 855
\-------------------------
Simulating trial. . .
espilon = 0.0328; alpha = 0.0040
Simulating trial. . .
espilon = 0.0328; alpha = 0.0040
Simulating trial. . .
espilon = 0.0328; alpha = 0.0040
Simulating trial. . .
espilon = 0.0328; alpha = 0.0040
Simulating trial. . .
espilon = 0.0328; alpha = 0.0040
Simulating trial. . .
espilon = 0.0328; alpha = 0.0040
Simulating trial. . .
espilon = 0.0328; alpha = 0.0040
Simulating trial. . .
espilon = 0.0328; alpha = 0.0040
Simulating trial. . .
espilon = 0.0328; alpha = 0.0040
Simulating trial. . .
espilon = 0.0328; alpha = 0.0040
Simulating trial. . .
espilon = 0.0328; alpha = 0.0040
Simulating trial. . .
espilon = 0.0328; alpha = 0.0040
Simulating trial. . .
espilon = 0.0328; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', None)
Agent properly idled at a red light. (rewarded 1.00)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.23)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.51)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.63)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 2.09)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.73)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 0.35)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 0.95)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 2.82)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.20)
67% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 856
\-------------------------
Simulating trial. . .
espilon = 0.0327; alpha = 0.0040
Simulating trial. . .
espilon = 0.0327; alpha = 0.0040
Simulating trial. . .
espilon = 0.0327; alpha = 0.0040
Simulating trial. . .
espilon = 0.0327; alpha = 0.0040
Simulating trial. . .
espilon = 0.0327; alpha = 0.0040
Simulating trial. . .
espilon = 0.0327; alpha = 0.0040
Simulating trial. . .
espilon = 0.0327; alpha = 0.0040
Simulating trial. . .
espilon = 0.0327; alpha = 0.0040
Simulating trial. . .
espilon = 0.0327; alpha = 0.0040
Simulating trial. . .
espilon = 0.0327; alpha = 0.0040
Simulating trial. . .
espilon = 0.0327; alpha = 0.0040
Simulating trial. . .
espilon = 0.0327; alpha = 0.0040
Simulating trial. . .
espilon = 0.0327; alpha = 0.0040
Simulating trial. . .
espilon = 0.0327; alpha = 0.0040
Simulating trial. . .
espilon = 0.0327; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.42)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent followed the waypoint left. (rewarded 1.79)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', None)
Agent followed the waypoint forward. (rewarded 2.32)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 0.28)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 0.13)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.93)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent drove right instead of left. (rewarded 0.59)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.95)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.57)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.50)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'left', None)
Agent drove forward instead of left. (rewarded 1.61)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', 'left')
Agent drove forward instead of left. (rewarded 0.14)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.12)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.88)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.51)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 0.53)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 0.52)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.44)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.89)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.92)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 857
\-------------------------
Simulating trial. . .
espilon = 0.0326; alpha = 0.0040
Simulating trial. . .
espilon = 0.0326; alpha = 0.0040
Simulating trial. . .
espilon = 0.0326; alpha = 0.0040
Simulating trial. . .
espilon = 0.0326; alpha = 0.0040
Simulating trial. . .
espilon = 0.0326; alpha = 0.0040
Simulating trial. . .
espilon = 0.0326; alpha = 0.0040
Simulating trial. . .
espilon = 0.0326; alpha = 0.0040
Simulating trial. . .
espilon = 0.0326; alpha = 0.0040
Simulating trial. . .
espilon = 0.0326; alpha = 0.0040
Simulating trial. . .
espilon = 0.0326; alpha = 0.0040
Simulating trial. . .
espilon = 0.0326; alpha = 0.0040
Simulating trial. . .
espilon = 0.0326; alpha = 0.0040
Simulating trial. . .
espilon = 0.0326; alpha = 0.0040
Simulating trial. . .
espilon = 0.0326; alpha = 0.0040
Simulating trial. . .
espilon = 0.0326; alpha = 0.0040
Simulating trial. . .
espilon = 0.0326; alpha = 0.0040
Simulating trial. . .
espilon = 0.0326; alpha = 0.0040
Simulating trial. . .
espilon = 0.0326; alpha = 0.0040
Simulating trial. . .
espilon = 0.0326; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent followed the waypoint left. (rewarded 1.75)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.72)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.00)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.47)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.97)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.87)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.29)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.31)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'left')
Agent drove left instead of right. (rewarded 1.29)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.26)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.20)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.06)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent drove right instead of left. (rewarded 1.55)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.55)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.13)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', 'left')
Agent followed the waypoint right. (rewarded 1.22)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.83)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.46)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.49)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.14)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.29)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.73)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'right', None)
Agent drove right instead of left. (rewarded 1.02)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.37)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.30)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', 'forward')
Agent drove forward instead of right. (rewarded 0.39)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 0.53)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.71)
7% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 858
\-------------------------
Simulating trial. . .
espilon = 0.0325; alpha = 0.0040
Simulating trial. . .
espilon = 0.0325; alpha = 0.0040
Simulating trial. . .
espilon = 0.0325; alpha = 0.0040
Simulating trial. . .
espilon = 0.0325; alpha = 0.0040
Simulating trial. . .
espilon = 0.0325; alpha = 0.0040
Simulating trial. . .
espilon = 0.0325; alpha = 0.0040
Simulating trial. . .
espilon = 0.0325; alpha = 0.0040
Simulating trial. . .
espilon = 0.0325; alpha = 0.0040
Simulating trial. . .
espilon = 0.0325; alpha = 0.0040
Simulating trial. . .
espilon = 0.0325; alpha = 0.0040
Simulating trial. . .
espilon = 0.0325; alpha = 0.0040
Simulating trial. . .
espilon = 0.0325; alpha = 0.0040
Simulating trial. . .
espilon = 0.0325; alpha = 0.0040
Simulating trial. . .
espilon = 0.0325; alpha = 0.0040
Simulating trial. . .
espilon = 0.0325; alpha = 0.0040
Simulating trial. . .
espilon = 0.0325; alpha = 0.0040
Simulating trial. . .
espilon = 0.0325; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', 'left')
Agent properly idled at a red light. (rewarded 1.67)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.62)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.85)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.91)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 0.99)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 1.24)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 1.55)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.60)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.90)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.71)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.76)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 0.87)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.14)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.83)
53% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 859
\-------------------------
Simulating trial. . .
espilon = 0.0323; alpha = 0.0040
Simulating trial. . .
espilon = 0.0323; alpha = 0.0040
Simulating trial. . .
espilon = 0.0323; alpha = 0.0040
Simulating trial. . .
espilon = 0.0323; alpha = 0.0040
Simulating trial. . .
espilon = 0.0323; alpha = 0.0040
Simulating trial. . .
espilon = 0.0323; alpha = 0.0040
Simulating trial. . .
espilon = 0.0323; alpha = 0.0040
Simulating trial. . .
espilon = 0.0323; alpha = 0.0040
Simulating trial. . .
espilon = 0.0323; alpha = 0.0040
Simulating trial. . .
espilon = 0.0323; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.71)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.21)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.76)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.07)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.03)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 0.92)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.07)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.16)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.77)
55% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 860
\-------------------------
Simulating trial. . .
espilon = 0.0322; alpha = 0.0040
Simulating trial. . .
espilon = 0.0322; alpha = 0.0040
Simulating trial. . .
espilon = 0.0322; alpha = 0.0040
Simulating trial. . .
espilon = 0.0322; alpha = 0.0040
Simulating trial. . .
espilon = 0.0322; alpha = 0.0040
Simulating trial. . .
espilon = 0.0322; alpha = 0.0040
Simulating trial. . .
espilon = 0.0322; alpha = 0.0040
Simulating trial. . .
espilon = 0.0322; alpha = 0.0040
Simulating trial. . .
espilon = 0.0322; alpha = 0.0040
Simulating trial. . .
espilon = 0.0322; alpha = 0.0040
Simulating trial. . .
espilon = 0.0322; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.10)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.28)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.13)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.10)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.84)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.96)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.34)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 2.81)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.35)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.29)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'right')
Agent drove right instead of left. (rewarded 0.04)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.26)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', 'left')
Agent followed the waypoint right. (rewarded 1.59)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.72)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.23)
25% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 861
\-------------------------
Simulating trial. . .
espilon = 0.0321; alpha = 0.0040
Simulating trial. . .
espilon = 0.0321; alpha = 0.0040
Simulating trial. . .
espilon = 0.0321; alpha = 0.0040
Simulating trial. . .
espilon = 0.0321; alpha = 0.0040
Simulating trial. . .
espilon = 0.0321; alpha = 0.0040
Simulating trial. . .
espilon = 0.0321; alpha = 0.0040
Simulating trial. . .
espilon = 0.0321; alpha = 0.0040
Simulating trial. . .
espilon = 0.0321; alpha = 0.0040
Simulating trial. . .
espilon = 0.0321; alpha = 0.0040
Simulating trial. . .
espilon = 0.0321; alpha = 0.0040
Simulating trial. . .
espilon = 0.0321; alpha = 0.0040
Simulating trial. . .
espilon = 0.0321; alpha = 0.0040
Simulating trial. . .
espilon = 0.0321; alpha = 0.0040
Simulating trial. . .
espilon = 0.0321; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.14)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.36)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.51)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.13)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.90)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'right', None)
Agent drove forward instead of left. (rewarded 1.10)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.85)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.63)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.11)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.81)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.09)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'forward')
Agent attempted driving left through traffic and cause a minor accident. (rewarded -19.99)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 2.26)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.44)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.01)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.01)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.60)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded -0.01)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'left')
Agent followed the waypoint left. (rewarded 0.45)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.82)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 862
\-------------------------
Simulating trial. . .
espilon = 0.0319; alpha = 0.0040
Simulating trial. . .
espilon = 0.0319; alpha = 0.0040
Simulating trial. . .
espilon = 0.0319; alpha = 0.0040
Simulating trial. . .
espilon = 0.0319; alpha = 0.0040
Simulating trial. . .
espilon = 0.0319; alpha = 0.0040
Simulating trial. . .
espilon = 0.0319; alpha = 0.0040
Simulating trial. . .
espilon = 0.0319; alpha = 0.0040
Simulating trial. . .
espilon = 0.0319; alpha = 0.0040
Simulating trial. . .
espilon = 0.0319; alpha = 0.0040
Simulating trial. . .
espilon = 0.0319; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.73)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.91)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.62)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.90)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 0.18)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.15)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.77)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 2.50)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.01)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.13)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'right', None)
Agent followed the waypoint forward. (rewarded 1.12)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.85)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.47)
48% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 863
\-------------------------
Simulating trial. . .
espilon = 0.0318; alpha = 0.0040
Simulating trial. . .
espilon = 0.0318; alpha = 0.0040
Simulating trial. . .
espilon = 0.0318; alpha = 0.0040
Simulating trial. . .
espilon = 0.0318; alpha = 0.0040
Simulating trial. . .
espilon = 0.0318; alpha = 0.0040
Simulating trial. . .
espilon = 0.0318; alpha = 0.0040
Simulating trial. . .
espilon = 0.0318; alpha = 0.0040
Simulating trial. . .
espilon = 0.0318; alpha = 0.0040
Simulating trial. . .
espilon = 0.0318; alpha = 0.0040
Simulating trial. . .
espilon = 0.0318; alpha = 0.0040
Simulating trial. . .
espilon = 0.0318; alpha = 0.0040
Simulating trial. . .
espilon = 0.0318; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', 'left')
Agent followed the waypoint right. (rewarded 1.36)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.07)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.58)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.36)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.48)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.15)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.81)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.29)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.15)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.88)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.82)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.45)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.66)
57% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 864
\-------------------------
Simulating trial. . .
espilon = 0.0317; alpha = 0.0040
Simulating trial. . .
espilon = 0.0317; alpha = 0.0040
Simulating trial. . .
espilon = 0.0317; alpha = 0.0040
Simulating trial. . .
espilon = 0.0317; alpha = 0.0040
Simulating trial. . .
espilon = 0.0317; alpha = 0.0040
Simulating trial. . .
espilon = 0.0317; alpha = 0.0040
Simulating trial. . .
espilon = 0.0317; alpha = 0.0040
Simulating trial. . .
espilon = 0.0317; alpha = 0.0040
Simulating trial. . .
espilon = 0.0317; alpha = 0.0040
Simulating trial. . .
espilon = 0.0317; alpha = 0.0040
Simulating trial. . .
espilon = 0.0317; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.68)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.43)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.11)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.70)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.03)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent followed the waypoint left. (rewarded 2.40)
70% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 865
\-------------------------
Simulating trial. . .
espilon = 0.0316; alpha = 0.0040
Simulating trial. . .
espilon = 0.0316; alpha = 0.0040
Simulating trial. . .
espilon = 0.0316; alpha = 0.0040
Simulating trial. . .
espilon = 0.0316; alpha = 0.0040
Simulating trial. . .
espilon = 0.0316; alpha = 0.0040
Simulating trial. . .
espilon = 0.0316; alpha = 0.0040
Simulating trial. . .
espilon = 0.0316; alpha = 0.0040
Simulating trial. . .
espilon = 0.0316; alpha = 0.0040
Simulating trial. . .
espilon = 0.0316; alpha = 0.0040
Simulating trial. . .
espilon = 0.0316; alpha = 0.0040
Simulating trial. . .
espilon = 0.0316; alpha = 0.0040
Simulating trial. . .
espilon = 0.0316; alpha = 0.0040
Simulating trial. . .
espilon = 0.0316; alpha = 0.0040
Simulating trial. . .
espilon = 0.0316; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.90)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.17)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.46)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.66)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent drove right instead of left. (rewarded 0.98)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.38)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.79)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.24)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent drove forward instead of right. (rewarded 1.59)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', 'left')
Agent drove forward instead of right. (rewarded 1.24)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.75)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.85)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.38)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 1.43)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.50)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.58)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.14)
32% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 866
\-------------------------
Simulating trial. . .
espilon = 0.0314; alpha = 0.0040
Simulating trial. . .
espilon = 0.0314; alpha = 0.0040
Simulating trial. . .
espilon = 0.0314; alpha = 0.0040
Simulating trial. . .
espilon = 0.0314; alpha = 0.0040
Simulating trial. . .
espilon = 0.0314; alpha = 0.0040
Simulating trial. . .
espilon = 0.0314; alpha = 0.0040
Simulating trial. . .
espilon = 0.0314; alpha = 0.0040
Simulating trial. . .
espilon = 0.0314; alpha = 0.0040
Simulating trial. . .
espilon = 0.0314; alpha = 0.0040
Simulating trial. . .
espilon = 0.0314; alpha = 0.0040
Simulating trial. . .
espilon = 0.0314; alpha = 0.0040
Simulating trial. . .
espilon = 0.0314; alpha = 0.0040
Simulating trial. . .
espilon = 0.0314; alpha = 0.0040
Simulating trial. . .
espilon = 0.0314; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'right', 'right')
Agent followed the waypoint right. (rewarded 2.19)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.55)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.63)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.36)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 2.92)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.61)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.58)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.96)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.03)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.10)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.24)
63% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 867
\-------------------------
Simulating trial. . .
espilon = 0.0313; alpha = 0.0040
Simulating trial. . .
espilon = 0.0313; alpha = 0.0040
Simulating trial. . .
espilon = 0.0313; alpha = 0.0040
Simulating trial. . .
espilon = 0.0313; alpha = 0.0040
Simulating trial. . .
espilon = 0.0313; alpha = 0.0040
Simulating trial. . .
espilon = 0.0313; alpha = 0.0040
Simulating trial. . .
espilon = 0.0313; alpha = 0.0040
Simulating trial. . .
espilon = 0.0313; alpha = 0.0040
Simulating trial. . .
espilon = 0.0313; alpha = 0.0040
Simulating trial. . .
espilon = 0.0313; alpha = 0.0040
Simulating trial. . .
espilon = 0.0313; alpha = 0.0040
Simulating trial. . .
espilon = 0.0313; alpha = 0.0040
Simulating trial. . .
espilon = 0.0313; alpha = 0.0040
Simulating trial. . .
espilon = 0.0313; alpha = 0.0040
Simulating trial. . .
espilon = 0.0313; alpha = 0.0040
Simulating trial. . .
espilon = 0.0313; alpha = 0.0040
Simulating trial. . .
espilon = 0.0313; alpha = 0.0040
Simulating trial. . .
espilon = 0.0313; alpha = 0.0040
Simulating trial. . .
espilon = 0.0313; alpha = 0.0040
Simulating trial. . .
espilon = 0.0313; alpha = 0.0040
Simulating trial. . .
espilon = 0.0313; alpha = 0.0040
Simulating trial. . .
espilon = 0.0313; alpha = 0.0040
Simulating trial. . .
espilon = 0.0313; alpha = 0.0040
Simulating trial. . .
espilon = 0.0313; alpha = 0.0040
Simulating trial. . .
espilon = 0.0313; alpha = 0.0040
Simulating trial. . .
espilon = 0.0313; alpha = 0.0040
Simulating trial. . .
espilon = 0.0313; alpha = 0.0040
Simulating trial. . .
espilon = 0.0313; alpha = 0.0040
Simulating trial. . .
espilon = 0.0313; alpha = 0.0040
Simulating trial. . .
espilon = 0.0313; alpha = 0.0040
Simulating trial. . .
espilon = 0.0313; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent followed the waypoint left. (rewarded 2.21)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', 'forward')
Agent followed the waypoint forward. (rewarded 1.62)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.41)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.55)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.62)
75% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 868
\-------------------------
Simulating trial. . .
espilon = 0.0312; alpha = 0.0040
Simulating trial. . .
espilon = 0.0312; alpha = 0.0040
Simulating trial. . .
espilon = 0.0312; alpha = 0.0040
Simulating trial. . .
espilon = 0.0312; alpha = 0.0040
Simulating trial. . .
espilon = 0.0312; alpha = 0.0040
Simulating trial. . .
espilon = 0.0312; alpha = 0.0040
Simulating trial. . .
espilon = 0.0312; alpha = 0.0040
Simulating trial. . .
espilon = 0.0312; alpha = 0.0040
Simulating trial. . .
espilon = 0.0312; alpha = 0.0040
Simulating trial. . .
espilon = 0.0312; alpha = 0.0040
Simulating trial. . .
espilon = 0.0312; alpha = 0.0040
Simulating trial. . .
espilon = 0.0312; alpha = 0.0040
Simulating trial. . .
espilon = 0.0312; alpha = 0.0040
Simulating trial. . .
espilon = 0.0312; alpha = 0.0040
Simulating trial. . .
espilon = 0.0312; alpha = 0.0040
Simulating trial. . .
espilon = 0.0312; alpha = 0.0040
Simulating trial. . .
espilon = 0.0312; alpha = 0.0040
Simulating trial. . .
espilon = 0.0312; alpha = 0.0040
Simulating trial. . .
espilon = 0.0312; alpha = 0.0040
Simulating trial. . .
espilon = 0.0312; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.92)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 1.78)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -9.82)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.77)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.66)
75% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 869
\-------------------------
Simulating trial. . .
espilon = 0.0311; alpha = 0.0040
Simulating trial. . .
espilon = 0.0311; alpha = 0.0040
Simulating trial. . .
espilon = 0.0311; alpha = 0.0040
Simulating trial. . .
espilon = 0.0311; alpha = 0.0040
Simulating trial. . .
espilon = 0.0311; alpha = 0.0040
Simulating trial. . .
espilon = 0.0311; alpha = 0.0040
Simulating trial. . .
espilon = 0.0311; alpha = 0.0040
Simulating trial. . .
espilon = 0.0311; alpha = 0.0040
Simulating trial. . .
espilon = 0.0311; alpha = 0.0040
Simulating trial. . .
espilon = 0.0311; alpha = 0.0040
Simulating trial. . .
espilon = 0.0311; alpha = 0.0040
Simulating trial. . .
espilon = 0.0311; alpha = 0.0040
Simulating trial. . .
espilon = 0.0311; alpha = 0.0040
Simulating trial. . .
espilon = 0.0311; alpha = 0.0040
Simulating trial. . .
espilon = 0.0311; alpha = 0.0040
Simulating trial. . .
espilon = 0.0311; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 1.35)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.82)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.12)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.44)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.80)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent followed the waypoint left. (rewarded 1.18)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.76)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.14)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.31)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.41)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.51)
45% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 870
\-------------------------
Simulating trial. . .
espilon = 0.0309; alpha = 0.0040
Simulating trial. . .
espilon = 0.0309; alpha = 0.0040
Simulating trial. . .
espilon = 0.0309; alpha = 0.0040
Simulating trial. . .
espilon = 0.0309; alpha = 0.0040
Simulating trial. . .
espilon = 0.0309; alpha = 0.0040
Simulating trial. . .
espilon = 0.0309; alpha = 0.0040
Simulating trial. . .
espilon = 0.0309; alpha = 0.0040
Simulating trial. . .
espilon = 0.0309; alpha = 0.0040
Simulating trial. . .
espilon = 0.0309; alpha = 0.0040
Simulating trial. . .
espilon = 0.0309; alpha = 0.0040
Simulating trial. . .
espilon = 0.0309; alpha = 0.0040
Simulating trial. . .
espilon = 0.0309; alpha = 0.0040
Simulating trial. . .
espilon = 0.0309; alpha = 0.0040
Simulating trial. . .
espilon = 0.0309; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.23)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', None)
Agent properly idled at a red light. (rewarded 1.23)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.64)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.31)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.74)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.15)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.89)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent followed the waypoint left. (rewarded 1.66)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.81)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.86)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.70)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.49)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.27)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.31)
30% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 871
\-------------------------
Simulating trial. . .
espilon = 0.0308; alpha = 0.0040
Simulating trial. . .
espilon = 0.0308; alpha = 0.0040
Simulating trial. . .
espilon = 0.0308; alpha = 0.0040
Simulating trial. . .
espilon = 0.0308; alpha = 0.0040
Simulating trial. . .
espilon = 0.0308; alpha = 0.0040
Simulating trial. . .
espilon = 0.0308; alpha = 0.0040
Simulating trial. . .
espilon = 0.0308; alpha = 0.0040
Simulating trial. . .
espilon = 0.0308; alpha = 0.0040
Simulating trial. . .
espilon = 0.0308; alpha = 0.0040
Simulating trial. . .
espilon = 0.0308; alpha = 0.0040
Simulating trial. . .
espilon = 0.0308; alpha = 0.0040
Simulating trial. . .
espilon = 0.0308; alpha = 0.0040
Simulating trial. . .
espilon = 0.0308; alpha = 0.0040
Simulating trial. . .
espilon = 0.0308; alpha = 0.0040
Simulating trial. . .
espilon = 0.0308; alpha = 0.0040
Simulating trial. . .
espilon = 0.0308; alpha = 0.0040
Simulating trial. . .
espilon = 0.0308; alpha = 0.0040
Simulating trial. . .
espilon = 0.0308; alpha = 0.0040
Simulating trial. . .
espilon = 0.0308; alpha = 0.0040
Simulating trial. . .
espilon = 0.0308; alpha = 0.0040
Simulating trial. . .
espilon = 0.0308; alpha = 0.0040
Simulating trial. . .
espilon = 0.0308; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'left')
Agent followed the waypoint left. (rewarded 2.75)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent drove right instead of left. (rewarded 1.23)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 0.04)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.36)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 2.73)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.51)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.75)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.45)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', 'forward')
Agent drove right instead of forward. (rewarded 1.14)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent followed the waypoint left. (rewarded 0.92)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent drove right instead of left. (rewarded 1.00)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 0.89)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded -0.05)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.06)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.04)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent followed the waypoint right. (rewarded 0.88)
47% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 872
\-------------------------
Simulating trial. . .
espilon = 0.0307; alpha = 0.0040
Simulating trial. . .
espilon = 0.0307; alpha = 0.0040
Simulating trial. . .
espilon = 0.0307; alpha = 0.0040
Simulating trial. . .
espilon = 0.0307; alpha = 0.0040
Simulating trial. . .
espilon = 0.0307; alpha = 0.0040
Simulating trial. . .
espilon = 0.0307; alpha = 0.0040
Simulating trial. . .
espilon = 0.0307; alpha = 0.0040
Simulating trial. . .
espilon = 0.0307; alpha = 0.0040
Simulating trial. . .
espilon = 0.0307; alpha = 0.0040
Simulating trial. . .
espilon = 0.0307; alpha = 0.0040
Simulating trial. . .
espilon = 0.0307; alpha = 0.0040
Simulating trial. . .
espilon = 0.0307; alpha = 0.0040
Simulating trial. . .
espilon = 0.0307; alpha = 0.0040
Simulating trial. . .
espilon = 0.0307; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.78)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.98)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.70)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.34)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.96)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.07)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.37)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'forward')
Agent drove forward instead of left. (rewarded 1.32)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.02)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.21)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.29)
63% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 873
\-------------------------
Simulating trial. . .
espilon = 0.0306; alpha = 0.0040
Simulating trial. . .
espilon = 0.0306; alpha = 0.0040
Simulating trial. . .
espilon = 0.0306; alpha = 0.0040
Simulating trial. . .
espilon = 0.0306; alpha = 0.0040
Simulating trial. . .
espilon = 0.0306; alpha = 0.0040
Simulating trial. . .
espilon = 0.0306; alpha = 0.0040
Simulating trial. . .
espilon = 0.0306; alpha = 0.0040
Simulating trial. . .
espilon = 0.0306; alpha = 0.0040
Simulating trial. . .
espilon = 0.0306; alpha = 0.0040
Simulating trial. . .
espilon = 0.0306; alpha = 0.0040
Simulating trial. . .
espilon = 0.0306; alpha = 0.0040
Simulating trial. . .
espilon = 0.0306; alpha = 0.0040
Simulating trial. . .
espilon = 0.0306; alpha = 0.0040
Simulating trial. . .
espilon = 0.0306; alpha = 0.0040
Simulating trial. . .
espilon = 0.0306; alpha = 0.0040
Simulating trial. . .
espilon = 0.0306; alpha = 0.0040
Simulating trial. . .
espilon = 0.0306; alpha = 0.0040
Simulating trial. . .
espilon = 0.0306; alpha = 0.0040
Simulating trial. . .
espilon = 0.0306; alpha = 0.0040
Simulating trial. . .
espilon = 0.0306; alpha = 0.0040
Simulating trial. . .
espilon = 0.0306; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', 'left')
Agent drove left instead of right. (rewarded 0.49)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.64)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent drove right instead of left. (rewarded 1.27)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', None)
Agent followed the waypoint right. (rewarded 2.26)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.26)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.41)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.32)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.79)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.40)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.12)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.95)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.18)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.62)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.89)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.42)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.86)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 0.09)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded -0.61)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'right')
Agent drove forward instead of right. (rewarded -0.21)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 0.28)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 874
\-------------------------
Simulating trial. . .
espilon = 0.0304; alpha = 0.0040
Simulating trial. . .
espilon = 0.0304; alpha = 0.0040
Simulating trial. . .
espilon = 0.0304; alpha = 0.0040
Simulating trial. . .
espilon = 0.0304; alpha = 0.0040
Simulating trial. . .
espilon = 0.0304; alpha = 0.0040
Simulating trial. . .
espilon = 0.0304; alpha = 0.0040
Simulating trial. . .
espilon = 0.0304; alpha = 0.0040
Simulating trial. . .
espilon = 0.0304; alpha = 0.0040
Simulating trial. . .
espilon = 0.0304; alpha = 0.0040
Simulating trial. . .
espilon = 0.0304; alpha = 0.0040
Simulating trial. . .
espilon = 0.0304; alpha = 0.0040
Simulating trial. . .
espilon = 0.0304; alpha = 0.0040
Simulating trial. . .
espilon = 0.0304; alpha = 0.0040
Simulating trial. . .
espilon = 0.0304; alpha = 0.0040
Simulating trial. . .
espilon = 0.0304; alpha = 0.0040
Simulating trial. . .
espilon = 0.0304; alpha = 0.0040
Simulating trial. . .
espilon = 0.0304; alpha = 0.0040
Simulating trial. . .
espilon = 0.0304; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.65)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.65)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.15)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.78)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 0.60)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.57)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.52)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 2.15)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.32)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent followed the waypoint forward. (rewarded 0.96)
67% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 875
\-------------------------
Simulating trial. . .
espilon = 0.0303; alpha = 0.0040
Simulating trial. . .
espilon = 0.0303; alpha = 0.0040
Simulating trial. . .
espilon = 0.0303; alpha = 0.0040
Simulating trial. . .
espilon = 0.0303; alpha = 0.0040
Simulating trial. . .
espilon = 0.0303; alpha = 0.0040
Simulating trial. . .
espilon = 0.0303; alpha = 0.0040
Simulating trial. . .
espilon = 0.0303; alpha = 0.0040
Simulating trial. . .
espilon = 0.0303; alpha = 0.0040
Simulating trial. . .
espilon = 0.0303; alpha = 0.0040
Simulating trial. . .
espilon = 0.0303; alpha = 0.0040
Simulating trial. . .
espilon = 0.0303; alpha = 0.0040
Simulating trial. . .
espilon = 0.0303; alpha = 0.0040
Simulating trial. . .
espilon = 0.0303; alpha = 0.0040
Simulating trial. . .
espilon = 0.0303; alpha = 0.0040
Simulating trial. . .
espilon = 0.0303; alpha = 0.0040
Simulating trial. . .
espilon = 0.0303; alpha = 0.0040
Simulating trial. . .
espilon = 0.0303; alpha = 0.0040
Simulating trial. . .
espilon = 0.0303; alpha = 0.0040
Simulating trial. . .
espilon = 0.0303; alpha = 0.0040
Simulating trial. . .
espilon = 0.0303; alpha = 0.0040
Simulating trial. . .
espilon = 0.0303; alpha = 0.0040
Simulating trial. . .
espilon = 0.0303; alpha = 0.0040
Simulating trial. . .
espilon = 0.0303; alpha = 0.0040
Simulating trial. . .
espilon = 0.0303; alpha = 0.0040
Simulating trial. . .
espilon = 0.0303; alpha = 0.0040
Simulating trial. . .
espilon = 0.0303; alpha = 0.0040
Simulating trial. . .
espilon = 0.0303; alpha = 0.0040
Simulating trial. . .
espilon = 0.0303; alpha = 0.0040
Simulating trial. . .
espilon = 0.0303; alpha = 0.0040
Simulating trial. . .
espilon = 0.0303; alpha = 0.0040
Simulating trial. . .
espilon = 0.0303; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'right')
Agent properly idled at a red light. (rewarded 2.39)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.40)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.56)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.51)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.84)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'right')
Agent drove right instead of forward. (rewarded 0.45)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.09)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.22)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.07)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.18)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'right', 'right')
Agent properly idled at a red light. (rewarded 1.63)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded -0.07)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.25)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.21)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.31)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.73)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.72)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 1.05)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 0.64)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded -0.51)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 876
\-------------------------
Simulating trial. . .
espilon = 0.0302; alpha = 0.0040
Simulating trial. . .
espilon = 0.0302; alpha = 0.0040
Simulating trial. . .
espilon = 0.0302; alpha = 0.0040
Simulating trial. . .
espilon = 0.0302; alpha = 0.0040
Simulating trial. . .
espilon = 0.0302; alpha = 0.0040
Simulating trial. . .
espilon = 0.0302; alpha = 0.0040
Simulating trial. . .
espilon = 0.0302; alpha = 0.0040
Simulating trial. . .
espilon = 0.0302; alpha = 0.0040
Simulating trial. . .
espilon = 0.0302; alpha = 0.0040
Simulating trial. . .
espilon = 0.0302; alpha = 0.0040
Simulating trial. . .
espilon = 0.0302; alpha = 0.0040
Simulating trial. . .
espilon = 0.0302; alpha = 0.0040
Simulating trial. . .
espilon = 0.0302; alpha = 0.0040
Simulating trial. . .
espilon = 0.0302; alpha = 0.0040
Simulating trial. . .
espilon = 0.0302; alpha = 0.0040
Simulating trial. . .
espilon = 0.0302; alpha = 0.0040
Simulating trial. . .
espilon = 0.0302; alpha = 0.0040
Simulating trial. . .
espilon = 0.0302; alpha = 0.0040
Simulating trial. . .
espilon = 0.0302; alpha = 0.0040
Simulating trial. . .
espilon = 0.0302; alpha = 0.0040
Simulating trial. . .
espilon = 0.0302; alpha = 0.0040
Simulating trial. . .
espilon = 0.0302; alpha = 0.0040
Simulating trial. . .
espilon = 0.0302; alpha = 0.0040
Simulating trial. . .
espilon = 0.0302; alpha = 0.0040
Simulating trial. . .
espilon = 0.0302; alpha = 0.0040
Simulating trial. . .
espilon = 0.0302; alpha = 0.0040
Simulating trial. . .
espilon = 0.0302; alpha = 0.0040
Simulating trial. . .
espilon = 0.0302; alpha = 0.0040
Simulating trial. . .
espilon = 0.0302; alpha = 0.0040
Simulating trial. . .
espilon = 0.0302; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.44)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, 'right')
Agent followed the waypoint right. (rewarded 1.12)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.61)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.93)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.99)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.02)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.59)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.06)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.45)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.95)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 2.01)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.16)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.16)
57% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 877
\-------------------------
Simulating trial. . .
espilon = 0.0301; alpha = 0.0040
Simulating trial. . .
espilon = 0.0301; alpha = 0.0040
Simulating trial. . .
espilon = 0.0301; alpha = 0.0040
Simulating trial. . .
espilon = 0.0301; alpha = 0.0040
Simulating trial. . .
espilon = 0.0301; alpha = 0.0040
Simulating trial. . .
espilon = 0.0301; alpha = 0.0040
Simulating trial. . .
espilon = 0.0301; alpha = 0.0040
Simulating trial. . .
espilon = 0.0301; alpha = 0.0040
Simulating trial. . .
espilon = 0.0301; alpha = 0.0040
Simulating trial. . .
espilon = 0.0301; alpha = 0.0040
Simulating trial. . .
espilon = 0.0301; alpha = 0.0040
Simulating trial. . .
espilon = 0.0301; alpha = 0.0040
Simulating trial. . .
espilon = 0.0301; alpha = 0.0040
Simulating trial. . .
espilon = 0.0301; alpha = 0.0040
Simulating trial. . .
espilon = 0.0301; alpha = 0.0040
Simulating trial. . .
espilon = 0.0301; alpha = 0.0040
Simulating trial. . .
espilon = 0.0301; alpha = 0.0040
Simulating trial. . .
espilon = 0.0301; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.61)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.88)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.71)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.99)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 1.38)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.32)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.92)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent attempted driving left through a red light. (rewarded -10.93)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.92)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.68)
50% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 878
\-------------------------
Simulating trial. . .
espilon = 0.0300; alpha = 0.0040
Simulating trial. . .
espilon = 0.0300; alpha = 0.0040
Simulating trial. . .
espilon = 0.0300; alpha = 0.0040
Simulating trial. . .
espilon = 0.0300; alpha = 0.0040
Simulating trial. . .
espilon = 0.0300; alpha = 0.0040
Simulating trial. . .
espilon = 0.0300; alpha = 0.0040
Simulating trial. . .
espilon = 0.0300; alpha = 0.0040
Simulating trial. . .
espilon = 0.0300; alpha = 0.0040
Simulating trial. . .
espilon = 0.0300; alpha = 0.0040
Simulating trial. . .
espilon = 0.0300; alpha = 0.0040
Simulating trial. . .
espilon = 0.0300; alpha = 0.0040
Simulating trial. . .
espilon = 0.0300; alpha = 0.0040
Simulating trial. . .
espilon = 0.0300; alpha = 0.0040
Simulating trial. . .
espilon = 0.0300; alpha = 0.0040
Simulating trial. . .
espilon = 0.0300; alpha = 0.0040
Simulating trial. . .
espilon = 0.0300; alpha = 0.0040
Simulating trial. . .
espilon = 0.0300; alpha = 0.0040
Simulating trial. . .
espilon = 0.0300; alpha = 0.0040
Simulating trial. . .
espilon = 0.0300; alpha = 0.0040
Simulating trial. . .
espilon = 0.0300; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent followed the waypoint right. (rewarded 2.10)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.42)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.63)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.93)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.58)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.83)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent followed the waypoint forward. (rewarded 1.71)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.59)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.76)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.87)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent followed the waypoint right. (rewarded 2.74)
45% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 879
\-------------------------
Simulating trial. . .
espilon = 0.0298; alpha = 0.0040
Simulating trial. . .
espilon = 0.0298; alpha = 0.0040
Simulating trial. . .
espilon = 0.0298; alpha = 0.0040
Simulating trial. . .
espilon = 0.0298; alpha = 0.0040
Simulating trial. . .
espilon = 0.0298; alpha = 0.0040
Simulating trial. . .
espilon = 0.0298; alpha = 0.0040
Simulating trial. . .
espilon = 0.0298; alpha = 0.0040
Simulating trial. . .
espilon = 0.0298; alpha = 0.0040
Simulating trial. . .
espilon = 0.0298; alpha = 0.0040
Simulating trial. . .
espilon = 0.0298; alpha = 0.0040
Simulating trial. . .
espilon = 0.0298; alpha = 0.0040
Simulating trial. . .
espilon = 0.0298; alpha = 0.0040
Simulating trial. . .
espilon = 0.0298; alpha = 0.0040
Simulating trial. . .
espilon = 0.0298; alpha = 0.0040
Simulating trial. . .
espilon = 0.0298; alpha = 0.0040
Simulating trial. . .
espilon = 0.0298; alpha = 0.0040
Simulating trial. . .
espilon = 0.0298; alpha = 0.0040
Simulating trial. . .
espilon = 0.0298; alpha = 0.0040
Simulating trial. . .
espilon = 0.0298; alpha = 0.0040
Simulating trial. . .
espilon = 0.0298; alpha = 0.0040
Simulating trial. . .
espilon = 0.0298; alpha = 0.0040
Simulating trial. . .
espilon = 0.0298; alpha = 0.0040
Simulating trial. . .
espilon = 0.0298; alpha = 0.0040
Simulating trial. . .
espilon = 0.0298; alpha = 0.0040
Simulating trial. . .
espilon = 0.0298; alpha = 0.0040
Simulating trial. . .
espilon = 0.0298; alpha = 0.0040
Simulating trial. . .
espilon = 0.0298; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', 'left')
Agent properly idled at a red light. (rewarded 1.62)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.20)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 1.48)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.08)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.40)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.30)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.32)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.46)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.07)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.60)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.68)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.69)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.11)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.56)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded 0.99)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 2.07)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.02)
32% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 880
\-------------------------
Simulating trial. . .
espilon = 0.0297; alpha = 0.0040
Simulating trial. . .
espilon = 0.0297; alpha = 0.0040
Simulating trial. . .
espilon = 0.0297; alpha = 0.0040
Simulating trial. . .
espilon = 0.0297; alpha = 0.0040
Simulating trial. . .
espilon = 0.0297; alpha = 0.0040
Simulating trial. . .
espilon = 0.0297; alpha = 0.0040
Simulating trial. . .
espilon = 0.0297; alpha = 0.0040
Simulating trial. . .
espilon = 0.0297; alpha = 0.0040
Simulating trial. . .
espilon = 0.0297; alpha = 0.0040
Simulating trial. . .
espilon = 0.0297; alpha = 0.0040
Simulating trial. . .
espilon = 0.0297; alpha = 0.0040
Simulating trial. . .
espilon = 0.0297; alpha = 0.0040
Simulating trial. . .
espilon = 0.0297; alpha = 0.0040
Simulating trial. . .
espilon = 0.0297; alpha = 0.0040
Simulating trial. . .
espilon = 0.0297; alpha = 0.0040
Simulating trial. . .
espilon = 0.0297; alpha = 0.0040
Simulating trial. . .
espilon = 0.0297; alpha = 0.0040
Simulating trial. . .
espilon = 0.0297; alpha = 0.0040
Simulating trial. . .
espilon = 0.0297; alpha = 0.0040
Simulating trial. . .
espilon = 0.0297; alpha = 0.0040
Simulating trial. . .
espilon = 0.0297; alpha = 0.0040
Simulating trial. . .
espilon = 0.0297; alpha = 0.0040
Simulating trial. . .
espilon = 0.0297; alpha = 0.0040
Simulating trial. . .
espilon = 0.0297; alpha = 0.0040
Simulating trial. . .
espilon = 0.0297; alpha = 0.0040
Simulating trial. . .
espilon = 0.0297; alpha = 0.0040
Simulating trial. . .
espilon = 0.0297; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.62)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.53)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.64)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.00)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.14)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.41)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.65)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 2.74)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.05)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.91)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.22)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.80)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', 'forward')
Agent drove right instead of forward. (rewarded 0.53)
63% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.49)
60% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.46)
57% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.55)
54% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.12)
51% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.37)
49% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.98)
46% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.23)
43% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded -0.14)
40% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.16)
37% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'left', 'right')
Agent drove forward instead of right. (rewarded 0.73)
34% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.76)
31% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent followed the waypoint right. (rewarded 1.99)
29% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.53)
26% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 0.93)
23% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 0.58)
20% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 0.75)
17% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'right')
Agent drove right instead of forward. (rewarded 1.09)
14% of time remaining to reach destination.
/-------------------
| Step 30 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.37)
11% of time remaining to reach destination.
/-------------------
| Step 31 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 0.65)
9% of time remaining to reach destination.
/-------------------
| Step 32 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'right')
Agent drove forward instead of left. (rewarded -0.23)
6% of time remaining to reach destination.
/-------------------
| Step 33 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'right', None)
Agent drove forward instead of left. (rewarded 0.56)
3% of time remaining to reach destination.
/-------------------
| Step 34 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.10)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 881
\-------------------------
Simulating trial. . .
espilon = 0.0296; alpha = 0.0040
Simulating trial. . .
espilon = 0.0296; alpha = 0.0040
Simulating trial. . .
espilon = 0.0296; alpha = 0.0040
Simulating trial. . .
espilon = 0.0296; alpha = 0.0040
Simulating trial. . .
espilon = 0.0296; alpha = 0.0040
Simulating trial. . .
espilon = 0.0296; alpha = 0.0040
Simulating trial. . .
espilon = 0.0296; alpha = 0.0040
Simulating trial. . .
espilon = 0.0296; alpha = 0.0040
Simulating trial. . .
espilon = 0.0296; alpha = 0.0040
Simulating trial. . .
espilon = 0.0296; alpha = 0.0040
Simulating trial. . .
espilon = 0.0296; alpha = 0.0040
Simulating trial. . .
espilon = 0.0296; alpha = 0.0040
Simulating trial. . .
espilon = 0.0296; alpha = 0.0040
Simulating trial. . .
espilon = 0.0296; alpha = 0.0040
Simulating trial. . .
espilon = 0.0296; alpha = 0.0040
Simulating trial. . .
espilon = 0.0296; alpha = 0.0040
Simulating trial. . .
espilon = 0.0296; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent followed the waypoint right. (rewarded 1.23)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 1.90)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 1.36)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.68)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.47)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent drove forward instead of left. (rewarded 0.04)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.20)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent followed the waypoint left. (rewarded 2.29)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 882
\-------------------------
Simulating trial. . .
espilon = 0.0295; alpha = 0.0040
Simulating trial. . .
espilon = 0.0295; alpha = 0.0040
Simulating trial. . .
espilon = 0.0295; alpha = 0.0040
Simulating trial. . .
espilon = 0.0295; alpha = 0.0040
Simulating trial. . .
espilon = 0.0295; alpha = 0.0040
Simulating trial. . .
espilon = 0.0295; alpha = 0.0040
Simulating trial. . .
espilon = 0.0295; alpha = 0.0040
Simulating trial. . .
espilon = 0.0295; alpha = 0.0040
Simulating trial. . .
espilon = 0.0295; alpha = 0.0040
Simulating trial. . .
espilon = 0.0295; alpha = 0.0040
Simulating trial. . .
espilon = 0.0295; alpha = 0.0040
Simulating trial. . .
espilon = 0.0295; alpha = 0.0040
Simulating trial. . .
espilon = 0.0295; alpha = 0.0040
Simulating trial. . .
espilon = 0.0295; alpha = 0.0040
Simulating trial. . .
espilon = 0.0295; alpha = 0.0040
Simulating trial. . .
espilon = 0.0295; alpha = 0.0040
Simulating trial. . .
espilon = 0.0295; alpha = 0.0040
Simulating trial. . .
espilon = 0.0295; alpha = 0.0040
Simulating trial. . .
espilon = 0.0295; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', 'left')
Agent followed the waypoint right. (rewarded 2.62)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'right', None)
Agent followed the waypoint forward. (rewarded 1.21)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 2.05)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.41)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.39)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.69)
70% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 883
\-------------------------
Simulating trial. . .
espilon = 0.0294; alpha = 0.0040
Simulating trial. . .
espilon = 0.0294; alpha = 0.0040
Simulating trial. . .
espilon = 0.0294; alpha = 0.0040
Simulating trial. . .
espilon = 0.0294; alpha = 0.0040
Simulating trial. . .
espilon = 0.0294; alpha = 0.0040
Simulating trial. . .
espilon = 0.0294; alpha = 0.0040
Simulating trial. . .
espilon = 0.0294; alpha = 0.0040
Simulating trial. . .
espilon = 0.0294; alpha = 0.0040
Simulating trial. . .
espilon = 0.0294; alpha = 0.0040
Simulating trial. . .
espilon = 0.0294; alpha = 0.0040
Simulating trial. . .
espilon = 0.0294; alpha = 0.0040
Simulating trial. . .
espilon = 0.0294; alpha = 0.0040
Simulating trial. . .
espilon = 0.0294; alpha = 0.0040
Simulating trial. . .
espilon = 0.0294; alpha = 0.0040
Simulating trial. . .
espilon = 0.0294; alpha = 0.0040
Simulating trial. . .
espilon = 0.0294; alpha = 0.0040
Simulating trial. . .
espilon = 0.0294; alpha = 0.0040
Simulating trial. . .
espilon = 0.0294; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'right', 'forward')
Agent followed the waypoint forward. (rewarded 2.49)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'right', None)
Agent properly idled at a red light. (rewarded 2.99)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.44)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.71)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent attempted driving forward through a red light. (rewarded -10.99)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.71)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.04)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.30)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.85)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 1.28)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 1.61)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.97)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'right')
Agent drove right instead of forward. (rewarded 0.01)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.21)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.05)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 0.79)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.83)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'right')
Agent drove right instead of left. (rewarded 1.58)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.26)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.50)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.21)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.64)
27% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 884
\-------------------------
Simulating trial. . .
espilon = 0.0292; alpha = 0.0040
Simulating trial. . .
espilon = 0.0292; alpha = 0.0040
Simulating trial. . .
espilon = 0.0292; alpha = 0.0040
Simulating trial. . .
espilon = 0.0292; alpha = 0.0040
Simulating trial. . .
espilon = 0.0292; alpha = 0.0040
Simulating trial. . .
espilon = 0.0292; alpha = 0.0040
Simulating trial. . .
espilon = 0.0292; alpha = 0.0040
Simulating trial. . .
espilon = 0.0292; alpha = 0.0040
Simulating trial. . .
espilon = 0.0292; alpha = 0.0040
Simulating trial. . .
espilon = 0.0292; alpha = 0.0040
Simulating trial. . .
espilon = 0.0292; alpha = 0.0040
Simulating trial. . .
espilon = 0.0292; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'left')
Agent drove forward instead of right. (rewarded 1.37)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.41)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', 'right')
Agent drove right instead of forward. (rewarded 0.15)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent followed the waypoint left. (rewarded 2.68)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.87)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.46)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.24)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 1.84)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', 'left')
Agent followed the waypoint forward. (rewarded 2.32)
55% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 885
\-------------------------
Simulating trial. . .
espilon = 0.0291; alpha = 0.0040
Simulating trial. . .
espilon = 0.0291; alpha = 0.0040
Simulating trial. . .
espilon = 0.0291; alpha = 0.0040
Simulating trial. . .
espilon = 0.0291; alpha = 0.0040
Simulating trial. . .
espilon = 0.0291; alpha = 0.0040
Simulating trial. . .
espilon = 0.0291; alpha = 0.0040
Simulating trial. . .
espilon = 0.0291; alpha = 0.0040
Simulating trial. . .
espilon = 0.0291; alpha = 0.0040
Simulating trial. . .
espilon = 0.0291; alpha = 0.0040
Simulating trial. . .
espilon = 0.0291; alpha = 0.0040
Simulating trial. . .
espilon = 0.0291; alpha = 0.0040
Simulating trial. . .
espilon = 0.0291; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.07)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.85)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.99)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.88)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.64)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'right', 'left')
Agent drove right instead of forward. (rewarded 1.52)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.18)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent followed the waypoint left. (rewarded 1.14)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.58)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 1.21)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.18)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 2.05)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.98)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.35)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.77)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.60)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.62)
15% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 886
\-------------------------
Simulating trial. . .
espilon = 0.0290; alpha = 0.0040
Simulating trial. . .
espilon = 0.0290; alpha = 0.0040
Simulating trial. . .
espilon = 0.0290; alpha = 0.0040
Simulating trial. . .
espilon = 0.0290; alpha = 0.0040
Simulating trial. . .
espilon = 0.0290; alpha = 0.0040
Simulating trial. . .
espilon = 0.0290; alpha = 0.0040
Simulating trial. . .
espilon = 0.0290; alpha = 0.0040
Simulating trial. . .
espilon = 0.0290; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.89)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.21)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.09)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.22)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.12)
80% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 887
\-------------------------
Simulating trial. . .
espilon = 0.0289; alpha = 0.0040
Simulating trial. . .
espilon = 0.0289; alpha = 0.0040
Simulating trial. . .
espilon = 0.0289; alpha = 0.0040
Simulating trial. . .
espilon = 0.0289; alpha = 0.0040
Simulating trial. . .
espilon = 0.0289; alpha = 0.0040
Simulating trial. . .
espilon = 0.0289; alpha = 0.0040
Simulating trial. . .
espilon = 0.0289; alpha = 0.0040
Simulating trial. . .
espilon = 0.0289; alpha = 0.0040
Simulating trial. . .
espilon = 0.0289; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.71)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.09)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent drove right instead of forward. (rewarded 1.06)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.57)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.62)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent followed the waypoint left. (rewarded 1.44)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'right')
Agent properly idled at a red light. (rewarded 2.17)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent drove left instead of forward. (rewarded 1.61)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.35)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.71)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.61)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'forward')
Agent drove forward instead of right. (rewarded 1.21)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 1.44)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.06)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.24)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.83)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.92)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 2.02)
28% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 888
\-------------------------
Simulating trial. . .
espilon = 0.0288; alpha = 0.0040
Simulating trial. . .
espilon = 0.0288; alpha = 0.0040
Simulating trial. . .
espilon = 0.0288; alpha = 0.0040
Simulating trial. . .
espilon = 0.0288; alpha = 0.0040
Simulating trial. . .
espilon = 0.0288; alpha = 0.0040
Simulating trial. . .
espilon = 0.0288; alpha = 0.0040
Simulating trial. . .
espilon = 0.0288; alpha = 0.0040
Simulating trial. . .
espilon = 0.0288; alpha = 0.0040
Simulating trial. . .
espilon = 0.0288; alpha = 0.0040
Simulating trial. . .
espilon = 0.0288; alpha = 0.0040
Simulating trial. . .
espilon = 0.0288; alpha = 0.0040
Simulating trial. . .
espilon = 0.0288; alpha = 0.0040
Simulating trial. . .
espilon = 0.0288; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, None)
Agent followed the waypoint right. (rewarded 2.56)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.82)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.46)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.82)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'right')
Agent followed the waypoint right. (rewarded 1.71)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.84)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.17)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.42)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.79)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.70)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 0.97)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.72)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent followed the waypoint left. (rewarded 1.36)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 2.25)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.39)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.23)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.84)
32% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 889
\-------------------------
Simulating trial. . .
espilon = 0.0287; alpha = 0.0040
Simulating trial. . .
espilon = 0.0287; alpha = 0.0040
Simulating trial. . .
espilon = 0.0287; alpha = 0.0040
Simulating trial. . .
espilon = 0.0287; alpha = 0.0040
Simulating trial. . .
espilon = 0.0287; alpha = 0.0040
Simulating trial. . .
espilon = 0.0287; alpha = 0.0040
Simulating trial. . .
espilon = 0.0287; alpha = 0.0040
Simulating trial. . .
espilon = 0.0287; alpha = 0.0040
Simulating trial. . .
espilon = 0.0287; alpha = 0.0040
Simulating trial. . .
espilon = 0.0287; alpha = 0.0040
Simulating trial. . .
espilon = 0.0287; alpha = 0.0040
Simulating trial. . .
espilon = 0.0287; alpha = 0.0040
Simulating trial. . .
espilon = 0.0287; alpha = 0.0040
Simulating trial. . .
espilon = 0.0287; alpha = 0.0040
Simulating trial. . .
espilon = 0.0287; alpha = 0.0040
Simulating trial. . .
espilon = 0.0287; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 2.20)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 1.57)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.30)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', 'right')
Agent properly idled at a red light. (rewarded 1.44)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.80)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.91)
70% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 890
\-------------------------
Simulating trial. . .
espilon = 0.0286; alpha = 0.0040
Simulating trial. . .
espilon = 0.0286; alpha = 0.0040
Simulating trial. . .
espilon = 0.0286; alpha = 0.0040
Simulating trial. . .
espilon = 0.0286; alpha = 0.0040
Simulating trial. . .
espilon = 0.0286; alpha = 0.0040
Simulating trial. . .
espilon = 0.0286; alpha = 0.0040
Simulating trial. . .
espilon = 0.0286; alpha = 0.0040
Simulating trial. . .
espilon = 0.0286; alpha = 0.0040
Simulating trial. . .
espilon = 0.0286; alpha = 0.0040
Simulating trial. . .
espilon = 0.0286; alpha = 0.0040
Simulating trial. . .
espilon = 0.0286; alpha = 0.0040
Simulating trial. . .
espilon = 0.0286; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.19)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', 'right')
Agent drove right instead of forward. (rewarded 0.90)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.61)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.98)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.07)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.27)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.25)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.84)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.69)
64% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 891
\-------------------------
Simulating trial. . .
espilon = 0.0284; alpha = 0.0040
Simulating trial. . .
espilon = 0.0284; alpha = 0.0040
Simulating trial. . .
espilon = 0.0284; alpha = 0.0040
Simulating trial. . .
espilon = 0.0284; alpha = 0.0040
Simulating trial. . .
espilon = 0.0284; alpha = 0.0040
Simulating trial. . .
espilon = 0.0284; alpha = 0.0040
Simulating trial. . .
espilon = 0.0284; alpha = 0.0040
Simulating trial. . .
espilon = 0.0284; alpha = 0.0040
Simulating trial. . .
espilon = 0.0284; alpha = 0.0040
Simulating trial. . .
espilon = 0.0284; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', 'left')
Agent followed the waypoint forward. (rewarded 1.67)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.10)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.02)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.79)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.78)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.37)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.06)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'left')
Agent drove right instead of left. (rewarded 1.37)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'right')
Agent followed the waypoint right. (rewarded 2.09)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.29)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.17)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.17)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.24)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.77)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.69)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.44)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.84)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.15)
28% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 892
\-------------------------
Simulating trial. . .
espilon = 0.0283; alpha = 0.0040
Simulating trial. . .
espilon = 0.0283; alpha = 0.0040
Simulating trial. . .
espilon = 0.0283; alpha = 0.0040
Simulating trial. . .
espilon = 0.0283; alpha = 0.0040
Simulating trial. . .
espilon = 0.0283; alpha = 0.0040
Simulating trial. . .
espilon = 0.0283; alpha = 0.0040
Simulating trial. . .
espilon = 0.0283; alpha = 0.0040
Simulating trial. . .
espilon = 0.0283; alpha = 0.0040
Simulating trial. . .
espilon = 0.0283; alpha = 0.0040
Simulating trial. . .
espilon = 0.0283; alpha = 0.0040
Simulating trial. . .
espilon = 0.0283; alpha = 0.0040
Simulating trial. . .
espilon = 0.0283; alpha = 0.0040
Simulating trial. . .
espilon = 0.0283; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'right', None)
Agent properly idled at a red light. (rewarded 1.14)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.73)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.13)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'right', 'forward')
Agent followed the waypoint right. (rewarded 1.20)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.06)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.88)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 1.47)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 1.83)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 2.17)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 2.11)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.31)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.96)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 0.98)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.99)
44% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 893
\-------------------------
Simulating trial. . .
espilon = 0.0282; alpha = 0.0040
Simulating trial. . .
espilon = 0.0282; alpha = 0.0040
Simulating trial. . .
espilon = 0.0282; alpha = 0.0040
Simulating trial. . .
espilon = 0.0282; alpha = 0.0040
Simulating trial. . .
espilon = 0.0282; alpha = 0.0040
Simulating trial. . .
espilon = 0.0282; alpha = 0.0040
Simulating trial. . .
espilon = 0.0282; alpha = 0.0040
Simulating trial. . .
espilon = 0.0282; alpha = 0.0040
Simulating trial. . .
espilon = 0.0282; alpha = 0.0040
Simulating trial. . .
espilon = 0.0282; alpha = 0.0040
Simulating trial. . .
espilon = 0.0282; alpha = 0.0040
Simulating trial. . .
espilon = 0.0282; alpha = 0.0040
Simulating trial. . .
espilon = 0.0282; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'right')
Agent properly idled at a red light. (rewarded 1.33)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.73)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.61)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'right')
Agent followed the waypoint right. (rewarded 1.59)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.66)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.51)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent followed the waypoint forward. (rewarded 2.76)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'right')
Agent drove right instead of forward. (rewarded 1.54)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.20)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent followed the waypoint left. (rewarded 1.47)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', 'right')
Agent drove right instead of forward. (rewarded 1.23)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.08)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent attempted driving forward through a red light. (rewarded -9.64)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.67)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.23)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'left')
Agent followed the waypoint right. (rewarded 1.19)
47% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 894
\-------------------------
Simulating trial. . .
espilon = 0.0281; alpha = 0.0040
Simulating trial. . .
espilon = 0.0281; alpha = 0.0040
Simulating trial. . .
espilon = 0.0281; alpha = 0.0040
Simulating trial. . .
espilon = 0.0281; alpha = 0.0040
Simulating trial. . .
espilon = 0.0281; alpha = 0.0040
Simulating trial. . .
espilon = 0.0281; alpha = 0.0040
Simulating trial. . .
espilon = 0.0281; alpha = 0.0040
Simulating trial. . .
espilon = 0.0281; alpha = 0.0040
Simulating trial. . .
espilon = 0.0281; alpha = 0.0040
Simulating trial. . .
espilon = 0.0281; alpha = 0.0040
Simulating trial. . .
espilon = 0.0281; alpha = 0.0040
Simulating trial. . .
espilon = 0.0281; alpha = 0.0040
Simulating trial. . .
espilon = 0.0281; alpha = 0.0040
Simulating trial. . .
espilon = 0.0281; alpha = 0.0040
Simulating trial. . .
espilon = 0.0281; alpha = 0.0040
Simulating trial. . .
espilon = 0.0281; alpha = 0.0040
Simulating trial. . .
espilon = 0.0281; alpha = 0.0040
Simulating trial. . .
espilon = 0.0281; alpha = 0.0040
Simulating trial. . .
espilon = 0.0281; alpha = 0.0040
Simulating trial. . .
espilon = 0.0281; alpha = 0.0040
Simulating trial. . .
espilon = 0.0281; alpha = 0.0040
Simulating trial. . .
espilon = 0.0281; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 0.48)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.72)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.26)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.55)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.12)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.40)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.08)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent followed the waypoint right. (rewarded 1.43)
68% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 895
\-------------------------
Simulating trial. . .
espilon = 0.0280; alpha = 0.0040
Simulating trial. . .
espilon = 0.0280; alpha = 0.0040
Simulating trial. . .
espilon = 0.0280; alpha = 0.0040
Simulating trial. . .
espilon = 0.0280; alpha = 0.0040
Simulating trial. . .
espilon = 0.0280; alpha = 0.0040
Simulating trial. . .
espilon = 0.0280; alpha = 0.0040
Simulating trial. . .
espilon = 0.0280; alpha = 0.0040
Simulating trial. . .
espilon = 0.0280; alpha = 0.0040
Simulating trial. . .
espilon = 0.0280; alpha = 0.0040
Simulating trial. . .
espilon = 0.0280; alpha = 0.0040
Simulating trial. . .
espilon = 0.0280; alpha = 0.0040
Simulating trial. . .
espilon = 0.0280; alpha = 0.0040
Simulating trial. . .
espilon = 0.0280; alpha = 0.0040
Simulating trial. . .
espilon = 0.0280; alpha = 0.0040
Simulating trial. . .
espilon = 0.0280; alpha = 0.0040
Simulating trial. . .
espilon = 0.0280; alpha = 0.0040
Simulating trial. . .
espilon = 0.0280; alpha = 0.0040
Simulating trial. . .
espilon = 0.0280; alpha = 0.0040
Simulating trial. . .
espilon = 0.0280; alpha = 0.0040
Simulating trial. . .
espilon = 0.0280; alpha = 0.0040
Simulating trial. . .
espilon = 0.0280; alpha = 0.0040
Simulating trial. . .
espilon = 0.0280; alpha = 0.0040
Simulating trial. . .
espilon = 0.0280; alpha = 0.0040
Simulating trial. . .
espilon = 0.0280; alpha = 0.0040
Simulating trial. . .
espilon = 0.0280; alpha = 0.0040
Simulating trial. . .
espilon = 0.0280; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.32)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 2.56)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.78)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.22)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.29)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.79)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.75)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.66)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 2.44)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.40)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'right')
Agent drove right instead of forward. (rewarded 0.45)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.63)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 2.62)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 0.80)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 1.93)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 0.90)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', 'forward')
Agent properly idled at a red light. (rewarded 0.67)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.71)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 0.64)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.23)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.83)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'forward')
Agent drove forward instead of left. (rewarded 0.71)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 0.92)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded -0.14)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', 'forward')
Agent properly idled at a red light. (rewarded 0.94)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 896
\-------------------------
Simulating trial. . .
espilon = 0.0279; alpha = 0.0040
Simulating trial. . .
espilon = 0.0279; alpha = 0.0040
Simulating trial. . .
espilon = 0.0279; alpha = 0.0040
Simulating trial. . .
espilon = 0.0279; alpha = 0.0040
Simulating trial. . .
espilon = 0.0279; alpha = 0.0040
Simulating trial. . .
espilon = 0.0279; alpha = 0.0040
Simulating trial. . .
espilon = 0.0279; alpha = 0.0040
Simulating trial. . .
espilon = 0.0279; alpha = 0.0040
Simulating trial. . .
espilon = 0.0279; alpha = 0.0040
Simulating trial. . .
espilon = 0.0279; alpha = 0.0040
Simulating trial. . .
espilon = 0.0279; alpha = 0.0040
Simulating trial. . .
espilon = 0.0279; alpha = 0.0040
Simulating trial. . .
espilon = 0.0279; alpha = 0.0040
Simulating trial. . .
espilon = 0.0279; alpha = 0.0040
Simulating trial. . .
espilon = 0.0279; alpha = 0.0040
Simulating trial. . .
espilon = 0.0279; alpha = 0.0040
Simulating trial. . .
espilon = 0.0279; alpha = 0.0040
Simulating trial. . .
espilon = 0.0279; alpha = 0.0040
Simulating trial. . .
espilon = 0.0279; alpha = 0.0040
Simulating trial. . .
espilon = 0.0279; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.64)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 1.91)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'right', None)
Agent drove right instead of left. (rewarded 1.23)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.08)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.99)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.83)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.75)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.56)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.05)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'right', None)
Agent followed the waypoint forward. (rewarded 2.04)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.52)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 0.91)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 2.39)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.49)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.68)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.06)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.72)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'forward')
Agent drove right instead of left. (rewarded -0.32)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 0.82)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 0.72)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.47)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 0.72)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'right')
Agent properly idled at a red light. (rewarded 0.66)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 1.25)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.99)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 897
\-------------------------
Simulating trial. . .
espilon = 0.0278; alpha = 0.0040
Simulating trial. . .
espilon = 0.0278; alpha = 0.0040
Simulating trial. . .
espilon = 0.0278; alpha = 0.0040
Simulating trial. . .
espilon = 0.0278; alpha = 0.0040
Simulating trial. . .
espilon = 0.0278; alpha = 0.0040
Simulating trial. . .
espilon = 0.0278; alpha = 0.0040
Simulating trial. . .
espilon = 0.0278; alpha = 0.0040
Simulating trial. . .
espilon = 0.0278; alpha = 0.0040
Simulating trial. . .
espilon = 0.0278; alpha = 0.0040
Simulating trial. . .
espilon = 0.0278; alpha = 0.0040
Simulating trial. . .
espilon = 0.0278; alpha = 0.0040
Simulating trial. . .
espilon = 0.0278; alpha = 0.0040
Simulating trial. . .
espilon = 0.0278; alpha = 0.0040
Simulating trial. . .
espilon = 0.0278; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.93)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.70)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.70)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.68)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.68)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.43)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.44)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'right', 'left')
Agent followed the waypoint forward. (rewarded 0.93)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.79)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 1.87)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.50)
56% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 898
\-------------------------
Simulating trial. . .
espilon = 0.0277; alpha = 0.0040
Simulating trial. . .
espilon = 0.0277; alpha = 0.0040
Simulating trial. . .
espilon = 0.0277; alpha = 0.0040
Simulating trial. . .
espilon = 0.0277; alpha = 0.0040
Simulating trial. . .
espilon = 0.0277; alpha = 0.0040
Simulating trial. . .
espilon = 0.0277; alpha = 0.0040
Simulating trial. . .
espilon = 0.0277; alpha = 0.0040
Simulating trial. . .
espilon = 0.0277; alpha = 0.0040
Simulating trial. . .
espilon = 0.0277; alpha = 0.0040
Simulating trial. . .
espilon = 0.0277; alpha = 0.0040
Simulating trial. . .
espilon = 0.0277; alpha = 0.0040
Simulating trial. . .
espilon = 0.0277; alpha = 0.0040
Simulating trial. . .
espilon = 0.0277; alpha = 0.0040
Simulating trial. . .
espilon = 0.0277; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.47)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.83)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.98)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'forward')
Agent drove right instead of forward. (rewarded 0.45)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.76)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.00)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.93)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.43)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded -0.01)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.23)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.35)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.78)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'forward')
Agent drove right instead of left. (rewarded -0.18)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 1.57)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'left', 'left')
Agent followed the waypoint forward. (rewarded 2.14)
25% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 899
\-------------------------
Simulating trial. . .
espilon = 0.0275; alpha = 0.0040
Simulating trial. . .
espilon = 0.0275; alpha = 0.0040
Simulating trial. . .
espilon = 0.0275; alpha = 0.0040
Simulating trial. . .
espilon = 0.0275; alpha = 0.0040
Simulating trial. . .
espilon = 0.0275; alpha = 0.0040
Simulating trial. . .
espilon = 0.0275; alpha = 0.0040
Simulating trial. . .
espilon = 0.0275; alpha = 0.0040
Simulating trial. . .
espilon = 0.0275; alpha = 0.0040
Simulating trial. . .
espilon = 0.0275; alpha = 0.0040
Simulating trial. . .
espilon = 0.0275; alpha = 0.0040
Simulating trial. . .
espilon = 0.0275; alpha = 0.0040
Simulating trial. . .
espilon = 0.0275; alpha = 0.0040
Simulating trial. . .
espilon = 0.0275; alpha = 0.0040
Simulating trial. . .
espilon = 0.0275; alpha = 0.0040
Simulating trial. . .
espilon = 0.0275; alpha = 0.0040
Simulating trial. . .
espilon = 0.0275; alpha = 0.0040
Simulating trial. . .
espilon = 0.0275; alpha = 0.0040
Simulating trial. . .
espilon = 0.0275; alpha = 0.0040
Simulating trial. . .
espilon = 0.0275; alpha = 0.0040
Simulating trial. . .
espilon = 0.0275; alpha = 0.0040
Simulating trial. . .
espilon = 0.0275; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', 'right')
Agent drove right instead of left. (rewarded 0.62)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.56)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 2.97)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 1.39)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.70)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.22)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.35)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'right')
Agent drove forward instead of left. (rewarded 0.63)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.25)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.60)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent followed the waypoint left. (rewarded 2.65)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.55)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 2.84)
63% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 900
\-------------------------
Simulating trial. . .
espilon = 0.0274; alpha = 0.0040
Simulating trial. . .
espilon = 0.0274; alpha = 0.0040
Simulating trial. . .
espilon = 0.0274; alpha = 0.0040
Simulating trial. . .
espilon = 0.0274; alpha = 0.0040
Simulating trial. . .
espilon = 0.0274; alpha = 0.0040
Simulating trial. . .
espilon = 0.0274; alpha = 0.0040
Simulating trial. . .
espilon = 0.0274; alpha = 0.0040
Simulating trial. . .
espilon = 0.0274; alpha = 0.0040
Simulating trial. . .
espilon = 0.0274; alpha = 0.0040
Simulating trial. . .
espilon = 0.0274; alpha = 0.0040
Simulating trial. . .
espilon = 0.0274; alpha = 0.0040
Simulating trial. . .
espilon = 0.0274; alpha = 0.0040
Simulating trial. . .
espilon = 0.0274; alpha = 0.0040
Simulating trial. . .
espilon = 0.0274; alpha = 0.0040
Simulating trial. . .
espilon = 0.0274; alpha = 0.0040
Simulating trial. . .
espilon = 0.0274; alpha = 0.0040
Simulating trial. . .
espilon = 0.0274; alpha = 0.0040
Simulating trial. . .
espilon = 0.0274; alpha = 0.0040
Simulating trial. . .
espilon = 0.0274; alpha = 0.0040
Simulating trial. . .
espilon = 0.0274; alpha = 0.0040
Simulating trial. . .
espilon = 0.0274; alpha = 0.0040
Simulating trial. . .
espilon = 0.0274; alpha = 0.0040
Simulating trial. . .
espilon = 0.0274; alpha = 0.0040
Simulating trial. . .
espilon = 0.0274; alpha = 0.0040
Simulating trial. . .
espilon = 0.0274; alpha = 0.0040
Simulating trial. . .
espilon = 0.0274; alpha = 0.0040
Simulating trial. . .
espilon = 0.0274; alpha = 0.0040
Simulating trial. . .
espilon = 0.0274; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.93)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.55)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.31)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.24)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.82)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.11)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 2.09)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.35)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.18)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.03)
67% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 901
\-------------------------
Simulating trial. . .
espilon = 0.0273; alpha = 0.0040
Simulating trial. . .
espilon = 0.0273; alpha = 0.0040
Simulating trial. . .
espilon = 0.0273; alpha = 0.0040
Simulating trial. . .
espilon = 0.0273; alpha = 0.0040
Simulating trial. . .
espilon = 0.0273; alpha = 0.0040
Simulating trial. . .
espilon = 0.0273; alpha = 0.0040
Simulating trial. . .
espilon = 0.0273; alpha = 0.0040
Simulating trial. . .
espilon = 0.0273; alpha = 0.0040
Simulating trial. . .
espilon = 0.0273; alpha = 0.0040
Simulating trial. . .
espilon = 0.0273; alpha = 0.0040
Simulating trial. . .
espilon = 0.0273; alpha = 0.0040
Simulating trial. . .
espilon = 0.0273; alpha = 0.0040
Simulating trial. . .
espilon = 0.0273; alpha = 0.0040
Simulating trial. . .
espilon = 0.0273; alpha = 0.0040
Simulating trial. . .
espilon = 0.0273; alpha = 0.0040
Simulating trial. . .
espilon = 0.0273; alpha = 0.0040
Simulating trial. . .
espilon = 0.0273; alpha = 0.0040
Simulating trial. . .
espilon = 0.0273; alpha = 0.0040
Simulating trial. . .
espilon = 0.0273; alpha = 0.0040
Simulating trial. . .
espilon = 0.0273; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.50)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 2.88)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.04)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 2.02)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.98)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.67)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.77)
65% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 902
\-------------------------
Simulating trial. . .
espilon = 0.0272; alpha = 0.0040
Simulating trial. . .
espilon = 0.0272; alpha = 0.0040
Simulating trial. . .
espilon = 0.0272; alpha = 0.0040
Simulating trial. . .
espilon = 0.0272; alpha = 0.0040
Simulating trial. . .
espilon = 0.0272; alpha = 0.0040
Simulating trial. . .
espilon = 0.0272; alpha = 0.0040
Simulating trial. . .
espilon = 0.0272; alpha = 0.0040
Simulating trial. . .
espilon = 0.0272; alpha = 0.0040
Simulating trial. . .
espilon = 0.0272; alpha = 0.0040
Simulating trial. . .
espilon = 0.0272; alpha = 0.0040
Simulating trial. . .
espilon = 0.0272; alpha = 0.0040
Simulating trial. . .
espilon = 0.0272; alpha = 0.0040
Simulating trial. . .
espilon = 0.0272; alpha = 0.0040
Simulating trial. . .
espilon = 0.0272; alpha = 0.0040
Simulating trial. . .
espilon = 0.0272; alpha = 0.0040
Simulating trial. . .
espilon = 0.0272; alpha = 0.0040
Simulating trial. . .
espilon = 0.0272; alpha = 0.0040
Simulating trial. . .
espilon = 0.0272; alpha = 0.0040
Simulating trial. . .
espilon = 0.0272; alpha = 0.0040
Simulating trial. . .
espilon = 0.0272; alpha = 0.0040
Simulating trial. . .
espilon = 0.0272; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.27)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.85)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.60)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.71)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 2.47)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.09)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.78)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'right')
Agent drove right instead of forward. (rewarded 0.68)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 1.47)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.87)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.22)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'right', None)
Agent drove forward instead of left. (rewarded 0.96)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.43)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 0.84)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.39)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.47)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 2.30)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 0.90)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.64)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'forward', None)
Agent followed the waypoint right. (rewarded 1.75)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.39)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.82)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent followed the waypoint left. (rewarded 1.76)
8% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 903
\-------------------------
Simulating trial. . .
espilon = 0.0271; alpha = 0.0040
Simulating trial. . .
espilon = 0.0271; alpha = 0.0040
Simulating trial. . .
espilon = 0.0271; alpha = 0.0040
Simulating trial. . .
espilon = 0.0271; alpha = 0.0040
Simulating trial. . .
espilon = 0.0271; alpha = 0.0040
Simulating trial. . .
espilon = 0.0271; alpha = 0.0040
Simulating trial. . .
espilon = 0.0271; alpha = 0.0040
Simulating trial. . .
espilon = 0.0271; alpha = 0.0040
Simulating trial. . .
espilon = 0.0271; alpha = 0.0040
Simulating trial. . .
espilon = 0.0271; alpha = 0.0040
Simulating trial. . .
espilon = 0.0271; alpha = 0.0040
Simulating trial. . .
espilon = 0.0271; alpha = 0.0040
Simulating trial. . .
espilon = 0.0271; alpha = 0.0040
Simulating trial. . .
espilon = 0.0271; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.21)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.82)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 0.12)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.80)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.60)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.70)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.18)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 1.39)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.25)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.48)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.38)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.35)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 1.24)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.34)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded 1.23)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 2.34)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.96)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.81)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.11)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.07)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 904
\-------------------------
Simulating trial. . .
espilon = 0.0270; alpha = 0.0040
Simulating trial. . .
espilon = 0.0270; alpha = 0.0040
Simulating trial. . .
espilon = 0.0270; alpha = 0.0040
Simulating trial. . .
espilon = 0.0270; alpha = 0.0040
Simulating trial. . .
espilon = 0.0270; alpha = 0.0040
Simulating trial. . .
espilon = 0.0270; alpha = 0.0040
Simulating trial. . .
espilon = 0.0270; alpha = 0.0040
Simulating trial. . .
espilon = 0.0270; alpha = 0.0040
Simulating trial. . .
espilon = 0.0270; alpha = 0.0040
Simulating trial. . .
espilon = 0.0270; alpha = 0.0040
Simulating trial. . .
espilon = 0.0270; alpha = 0.0040
Simulating trial. . .
espilon = 0.0270; alpha = 0.0040
Simulating trial. . .
espilon = 0.0270; alpha = 0.0040
Simulating trial. . .
espilon = 0.0270; alpha = 0.0040
Simulating trial. . .
espilon = 0.0270; alpha = 0.0040
Simulating trial. . .
espilon = 0.0270; alpha = 0.0040
Simulating trial. . .
espilon = 0.0270; alpha = 0.0040
Simulating trial. . .
espilon = 0.0270; alpha = 0.0040
Simulating trial. . .
espilon = 0.0270; alpha = 0.0040
Simulating trial. . .
espilon = 0.0270; alpha = 0.0040
Simulating trial. . .
espilon = 0.0270; alpha = 0.0040
Simulating trial. . .
espilon = 0.0270; alpha = 0.0040
Simulating trial. . .
espilon = 0.0270; alpha = 0.0040
Simulating trial. . .
espilon = 0.0270; alpha = 0.0040
Simulating trial. . .
espilon = 0.0270; alpha = 0.0040
Simulating trial. . .
espilon = 0.0270; alpha = 0.0040
Simulating trial. . .
espilon = 0.0270; alpha = 0.0040
Simulating trial. . .
espilon = 0.0270; alpha = 0.0040
Simulating trial. . .
espilon = 0.0270; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 1.09)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.93)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.28)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.27)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.36)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.61)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'left')
Agent drove right instead of left. (rewarded 0.53)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.44)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.76)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 0.36)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.60)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.19)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.10)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.17)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.08)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.77)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.26)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'right')
Agent properly idled at a red light. (rewarded 0.39)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'right')
Agent drove right instead of forward. (rewarded 0.23)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 2.00)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 905
\-------------------------
Simulating trial. . .
espilon = 0.0269; alpha = 0.0040
Simulating trial. . .
espilon = 0.0269; alpha = 0.0040
Simulating trial. . .
espilon = 0.0269; alpha = 0.0040
Simulating trial. . .
espilon = 0.0269; alpha = 0.0040
Simulating trial. . .
espilon = 0.0269; alpha = 0.0040
Simulating trial. . .
espilon = 0.0269; alpha = 0.0040
Simulating trial. . .
espilon = 0.0269; alpha = 0.0040
Simulating trial. . .
espilon = 0.0269; alpha = 0.0040
Simulating trial. . .
espilon = 0.0269; alpha = 0.0040
Simulating trial. . .
espilon = 0.0269; alpha = 0.0040
Simulating trial. . .
espilon = 0.0269; alpha = 0.0040
Simulating trial. . .
espilon = 0.0269; alpha = 0.0040
Simulating trial. . .
espilon = 0.0269; alpha = 0.0040
Simulating trial. . .
espilon = 0.0269; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.09)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.90)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.84)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.84)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.59)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.73)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.53)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 0.00)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.62)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.10)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.17)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.63)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.84)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.70)
44% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 906
\-------------------------
Simulating trial. . .
espilon = 0.0268; alpha = 0.0040
Simulating trial. . .
espilon = 0.0268; alpha = 0.0040
Simulating trial. . .
espilon = 0.0268; alpha = 0.0040
Simulating trial. . .
espilon = 0.0268; alpha = 0.0040
Simulating trial. . .
espilon = 0.0268; alpha = 0.0040
Simulating trial. . .
espilon = 0.0268; alpha = 0.0040
Simulating trial. . .
espilon = 0.0268; alpha = 0.0040
Simulating trial. . .
espilon = 0.0268; alpha = 0.0040
Simulating trial. . .
espilon = 0.0268; alpha = 0.0040
Simulating trial. . .
espilon = 0.0268; alpha = 0.0040
Simulating trial. . .
espilon = 0.0268; alpha = 0.0040
Simulating trial. . .
espilon = 0.0268; alpha = 0.0040
Simulating trial. . .
espilon = 0.0268; alpha = 0.0040
Simulating trial. . .
espilon = 0.0268; alpha = 0.0040
Simulating trial. . .
espilon = 0.0268; alpha = 0.0040
Simulating trial. . .
espilon = 0.0268; alpha = 0.0040
Simulating trial. . .
espilon = 0.0268; alpha = 0.0040
Simulating trial. . .
espilon = 0.0268; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'forward')
Agent drove forward instead of right. (rewarded 0.51)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.12)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.98)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.77)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.41)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.88)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.33)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 2.04)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.67)
55% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 907
\-------------------------
Simulating trial. . .
espilon = 0.0267; alpha = 0.0040
Simulating trial. . .
espilon = 0.0267; alpha = 0.0040
Simulating trial. . .
espilon = 0.0267; alpha = 0.0040
Simulating trial. . .
espilon = 0.0267; alpha = 0.0040
Simulating trial. . .
espilon = 0.0267; alpha = 0.0040
Simulating trial. . .
espilon = 0.0267; alpha = 0.0040
Simulating trial. . .
espilon = 0.0267; alpha = 0.0040
Simulating trial. . .
espilon = 0.0267; alpha = 0.0040
Simulating trial. . .
espilon = 0.0267; alpha = 0.0040
Simulating trial. . .
espilon = 0.0267; alpha = 0.0040
Simulating trial. . .
espilon = 0.0267; alpha = 0.0040
Simulating trial. . .
espilon = 0.0267; alpha = 0.0040
Simulating trial. . .
espilon = 0.0267; alpha = 0.0040
Simulating trial. . .
espilon = 0.0267; alpha = 0.0040
Simulating trial. . .
espilon = 0.0267; alpha = 0.0040
Simulating trial. . .
espilon = 0.0267; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent followed the waypoint right. (rewarded 2.85)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.52)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.31)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.82)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.58)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.11)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'right', 'forward')
Agent properly idled at a red light. (rewarded 2.12)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.10)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 908
\-------------------------
Simulating trial. . .
espilon = 0.0266; alpha = 0.0040
Simulating trial. . .
espilon = 0.0266; alpha = 0.0040
Simulating trial. . .
espilon = 0.0266; alpha = 0.0040
Simulating trial. . .
espilon = 0.0266; alpha = 0.0040
Simulating trial. . .
espilon = 0.0266; alpha = 0.0040
Simulating trial. . .
espilon = 0.0266; alpha = 0.0040
Simulating trial. . .
espilon = 0.0266; alpha = 0.0040
Simulating trial. . .
espilon = 0.0266; alpha = 0.0040
Simulating trial. . .
espilon = 0.0266; alpha = 0.0040
Simulating trial. . .
espilon = 0.0266; alpha = 0.0040
Simulating trial. . .
espilon = 0.0266; alpha = 0.0040
Simulating trial. . .
espilon = 0.0266; alpha = 0.0040
Simulating trial. . .
espilon = 0.0266; alpha = 0.0040
Simulating trial. . .
espilon = 0.0266; alpha = 0.0040
Simulating trial. . .
espilon = 0.0266; alpha = 0.0040
Simulating trial. . .
espilon = 0.0266; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'forward', 'right')
Agent drove right instead of left. (rewarded 0.73)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.48)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.62)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.11)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.09)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.35)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.72)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 1.58)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 0.91)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.95)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.77)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.12)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 2.64)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.16)
30% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 909
\-------------------------
Simulating trial. . .
espilon = 0.0265; alpha = 0.0040
Simulating trial. . .
espilon = 0.0265; alpha = 0.0040
Simulating trial. . .
espilon = 0.0265; alpha = 0.0040
Simulating trial. . .
espilon = 0.0265; alpha = 0.0040
Simulating trial. . .
espilon = 0.0265; alpha = 0.0040
Simulating trial. . .
espilon = 0.0265; alpha = 0.0040
Simulating trial. . .
espilon = 0.0265; alpha = 0.0040
Simulating trial. . .
espilon = 0.0265; alpha = 0.0040
Simulating trial. . .
espilon = 0.0265; alpha = 0.0040
Simulating trial. . .
espilon = 0.0265; alpha = 0.0040
Simulating trial. . .
espilon = 0.0265; alpha = 0.0040
Simulating trial. . .
espilon = 0.0265; alpha = 0.0040
Simulating trial. . .
espilon = 0.0265; alpha = 0.0040
Simulating trial. . .
espilon = 0.0265; alpha = 0.0040
Simulating trial. . .
espilon = 0.0265; alpha = 0.0040
Simulating trial. . .
espilon = 0.0265; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 2.35)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded 1.58)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 1.51)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 0.96)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.40)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 1.63)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.23)
65% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 910
\-------------------------
Simulating trial. . .
espilon = 0.0264; alpha = 0.0040
Simulating trial. . .
espilon = 0.0264; alpha = 0.0040
Simulating trial. . .
espilon = 0.0264; alpha = 0.0040
Simulating trial. . .
espilon = 0.0264; alpha = 0.0040
Simulating trial. . .
espilon = 0.0264; alpha = 0.0040
Simulating trial. . .
espilon = 0.0264; alpha = 0.0040
Simulating trial. . .
espilon = 0.0264; alpha = 0.0040
Simulating trial. . .
espilon = 0.0264; alpha = 0.0040
Simulating trial. . .
espilon = 0.0264; alpha = 0.0040
Simulating trial. . .
espilon = 0.0264; alpha = 0.0040
Simulating trial. . .
espilon = 0.0264; alpha = 0.0040
Simulating trial. . .
espilon = 0.0264; alpha = 0.0040
Simulating trial. . .
espilon = 0.0264; alpha = 0.0040
Simulating trial. . .
espilon = 0.0264; alpha = 0.0040
Simulating trial. . .
espilon = 0.0264; alpha = 0.0040
Simulating trial. . .
espilon = 0.0264; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'left', None)
Agent drove forward instead of left. (rewarded 0.50)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.31)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.01)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.91)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'left')
Agent drove forward instead of right. (rewarded 1.79)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.28)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.23)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent followed the waypoint right. (rewarded 1.12)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'forward')
Agent drove forward instead of right. (rewarded 1.77)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent followed the waypoint right. (rewarded 0.94)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.17)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'left')
Agent drove forward instead of right. (rewarded 0.30)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.55)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.27)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.55)
25% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 911
\-------------------------
Simulating trial. . .
espilon = 0.0263; alpha = 0.0040
Simulating trial. . .
espilon = 0.0263; alpha = 0.0040
Simulating trial. . .
espilon = 0.0263; alpha = 0.0040
Simulating trial. . .
espilon = 0.0263; alpha = 0.0040
Simulating trial. . .
espilon = 0.0263; alpha = 0.0040
Simulating trial. . .
espilon = 0.0263; alpha = 0.0040
Simulating trial. . .
espilon = 0.0263; alpha = 0.0040
Simulating trial. . .
espilon = 0.0263; alpha = 0.0040
Simulating trial. . .
espilon = 0.0263; alpha = 0.0040
Simulating trial. . .
espilon = 0.0263; alpha = 0.0040
Simulating trial. . .
espilon = 0.0263; alpha = 0.0040
Simulating trial. . .
espilon = 0.0263; alpha = 0.0040
Simulating trial. . .
espilon = 0.0263; alpha = 0.0040
Simulating trial. . .
espilon = 0.0263; alpha = 0.0040
Simulating trial. . .
espilon = 0.0263; alpha = 0.0040
Simulating trial. . .
espilon = 0.0263; alpha = 0.0040
Simulating trial. . .
espilon = 0.0263; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'right', None)
Agent followed the waypoint right. (rewarded 2.09)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.04)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 1.36)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 2.24)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.92)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'forward')
Agent followed the waypoint forward. (rewarded 2.14)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 1.03)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.10)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.79)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.83)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'left')
Agent followed the waypoint left. (rewarded 1.61)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.67)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.57)
63% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.97)
60% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.30)
57% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.65)
54% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 912
\-------------------------
Simulating trial. . .
espilon = 0.0261; alpha = 0.0040
Simulating trial. . .
espilon = 0.0261; alpha = 0.0040
Simulating trial. . .
espilon = 0.0261; alpha = 0.0040
Simulating trial. . .
espilon = 0.0261; alpha = 0.0040
Simulating trial. . .
espilon = 0.0261; alpha = 0.0040
Simulating trial. . .
espilon = 0.0261; alpha = 0.0040
Simulating trial. . .
espilon = 0.0261; alpha = 0.0040
Simulating trial. . .
espilon = 0.0261; alpha = 0.0040
Simulating trial. . .
espilon = 0.0261; alpha = 0.0040
Simulating trial. . .
espilon = 0.0261; alpha = 0.0040
Simulating trial. . .
espilon = 0.0261; alpha = 0.0040
Simulating trial. . .
espilon = 0.0261; alpha = 0.0040
Simulating trial. . .
espilon = 0.0261; alpha = 0.0040
Simulating trial. . .
espilon = 0.0261; alpha = 0.0040
Simulating trial. . .
espilon = 0.0261; alpha = 0.0040
Simulating trial. . .
espilon = 0.0261; alpha = 0.0040
Simulating trial. . .
espilon = 0.0261; alpha = 0.0040
Simulating trial. . .
espilon = 0.0261; alpha = 0.0040
Simulating trial. . .
espilon = 0.0261; alpha = 0.0040
Simulating trial. . .
espilon = 0.0261; alpha = 0.0040
Simulating trial. . .
espilon = 0.0261; alpha = 0.0040
Simulating trial. . .
espilon = 0.0261; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 2.91)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'left')
Agent followed the waypoint right. (rewarded 2.92)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'right')
Agent followed the waypoint forward. (rewarded 1.39)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.90)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.45)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent followed the waypoint forward. (rewarded 1.11)
76% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 913
\-------------------------
Simulating trial. . .
espilon = 0.0260; alpha = 0.0040
Simulating trial. . .
espilon = 0.0260; alpha = 0.0040
Simulating trial. . .
espilon = 0.0260; alpha = 0.0040
Simulating trial. . .
espilon = 0.0260; alpha = 0.0040
Simulating trial. . .
espilon = 0.0260; alpha = 0.0040
Simulating trial. . .
espilon = 0.0260; alpha = 0.0040
Simulating trial. . .
espilon = 0.0260; alpha = 0.0040
Simulating trial. . .
espilon = 0.0260; alpha = 0.0040
Simulating trial. . .
espilon = 0.0260; alpha = 0.0040
Simulating trial. . .
espilon = 0.0260; alpha = 0.0040
Simulating trial. . .
espilon = 0.0260; alpha = 0.0040
Simulating trial. . .
espilon = 0.0260; alpha = 0.0040
Simulating trial. . .
espilon = 0.0260; alpha = 0.0040
Simulating trial. . .
espilon = 0.0260; alpha = 0.0040
Simulating trial. . .
espilon = 0.0260; alpha = 0.0040
Simulating trial. . .
espilon = 0.0260; alpha = 0.0040
Simulating trial. . .
espilon = 0.0260; alpha = 0.0040
Simulating trial. . .
espilon = 0.0260; alpha = 0.0040
Simulating trial. . .
espilon = 0.0260; alpha = 0.0040
Simulating trial. . .
espilon = 0.0260; alpha = 0.0040
Simulating trial. . .
espilon = 0.0260; alpha = 0.0040
Simulating trial. . .
espilon = 0.0260; alpha = 0.0040
Simulating trial. . .
espilon = 0.0260; alpha = 0.0040
Simulating trial. . .
espilon = 0.0260; alpha = 0.0040
Simulating trial. . .
espilon = 0.0260; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent followed the waypoint right. (rewarded 2.79)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent drove left instead of right. (rewarded 0.59)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -10.44)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.28)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.78)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.32)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.73)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.32)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.75)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.80)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.71)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'forward')
Agent followed the waypoint forward. (rewarded 0.91)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 0.95)
35% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 914
\-------------------------
Simulating trial. . .
espilon = 0.0259; alpha = 0.0040
Simulating trial. . .
espilon = 0.0259; alpha = 0.0040
Simulating trial. . .
espilon = 0.0259; alpha = 0.0040
Simulating trial. . .
espilon = 0.0259; alpha = 0.0040
Simulating trial. . .
espilon = 0.0259; alpha = 0.0040
Simulating trial. . .
espilon = 0.0259; alpha = 0.0040
Simulating trial. . .
espilon = 0.0259; alpha = 0.0040
Simulating trial. . .
espilon = 0.0259; alpha = 0.0040
Simulating trial. . .
espilon = 0.0259; alpha = 0.0040
Simulating trial. . .
espilon = 0.0259; alpha = 0.0040
Simulating trial. . .
espilon = 0.0259; alpha = 0.0040
Simulating trial. . .
espilon = 0.0259; alpha = 0.0040
Simulating trial. . .
espilon = 0.0259; alpha = 0.0040
Simulating trial. . .
espilon = 0.0259; alpha = 0.0040
Simulating trial. . .
espilon = 0.0259; alpha = 0.0040
Simulating trial. . .
espilon = 0.0259; alpha = 0.0040
Simulating trial. . .
espilon = 0.0259; alpha = 0.0040
Simulating trial. . .
espilon = 0.0259; alpha = 0.0040
Simulating trial. . .
espilon = 0.0259; alpha = 0.0040
Simulating trial. . .
espilon = 0.0259; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 0.60)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.17)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.52)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.62)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.48)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.24)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'right', None)
Agent drove right instead of forward. (rewarded 1.23)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 0.20)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.48)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.33)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 915
\-------------------------
Simulating trial. . .
espilon = 0.0258; alpha = 0.0040
Simulating trial. . .
espilon = 0.0258; alpha = 0.0040
Simulating trial. . .
espilon = 0.0258; alpha = 0.0040
Simulating trial. . .
espilon = 0.0258; alpha = 0.0040
Simulating trial. . .
espilon = 0.0258; alpha = 0.0040
Simulating trial. . .
espilon = 0.0258; alpha = 0.0040
Simulating trial. . .
espilon = 0.0258; alpha = 0.0040
Simulating trial. . .
espilon = 0.0258; alpha = 0.0040
Simulating trial. . .
espilon = 0.0258; alpha = 0.0040
Simulating trial. . .
espilon = 0.0258; alpha = 0.0040
Simulating trial. . .
espilon = 0.0258; alpha = 0.0040
Simulating trial. . .
espilon = 0.0258; alpha = 0.0040
Simulating trial. . .
espilon = 0.0258; alpha = 0.0040
Simulating trial. . .
espilon = 0.0258; alpha = 0.0040
Simulating trial. . .
espilon = 0.0258; alpha = 0.0040
Simulating trial. . .
espilon = 0.0258; alpha = 0.0040
Simulating trial. . .
espilon = 0.0258; alpha = 0.0040
Simulating trial. . .
espilon = 0.0258; alpha = 0.0040
Simulating trial. . .
espilon = 0.0258; alpha = 0.0040
Simulating trial. . .
espilon = 0.0258; alpha = 0.0040
Simulating trial. . .
espilon = 0.0258; alpha = 0.0040
Simulating trial. . .
espilon = 0.0258; alpha = 0.0040
Simulating trial. . .
espilon = 0.0258; alpha = 0.0040
Simulating trial. . .
espilon = 0.0258; alpha = 0.0040
Simulating trial. . .
espilon = 0.0258; alpha = 0.0040
Simulating trial. . .
espilon = 0.0258; alpha = 0.0040
Simulating trial. . .
espilon = 0.0258; alpha = 0.0040
Simulating trial. . .
espilon = 0.0258; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 1.16)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.62)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 1.23)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.17)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.86)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'forward')
Agent drove right instead of forward. (rewarded 0.46)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -10.76)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.40)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.97)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.46)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'forward')
Agent properly idled at a red light. (rewarded 1.46)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.05)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded -0.17)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.57)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.02)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.14)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 2.26)
15% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 916
\-------------------------
Simulating trial. . .
espilon = 0.0257; alpha = 0.0040
Simulating trial. . .
espilon = 0.0257; alpha = 0.0040
Simulating trial. . .
espilon = 0.0257; alpha = 0.0040
Simulating trial. . .
espilon = 0.0257; alpha = 0.0040
Simulating trial. . .
espilon = 0.0257; alpha = 0.0040
Simulating trial. . .
espilon = 0.0257; alpha = 0.0040
Simulating trial. . .
espilon = 0.0257; alpha = 0.0040
Simulating trial. . .
espilon = 0.0257; alpha = 0.0040
Simulating trial. . .
espilon = 0.0257; alpha = 0.0040
Simulating trial. . .
espilon = 0.0257; alpha = 0.0040
Simulating trial. . .
espilon = 0.0257; alpha = 0.0040
Simulating trial. . .
espilon = 0.0257; alpha = 0.0040
Simulating trial. . .
espilon = 0.0257; alpha = 0.0040
Simulating trial. . .
espilon = 0.0257; alpha = 0.0040
Simulating trial. . .
espilon = 0.0257; alpha = 0.0040
Simulating trial. . .
espilon = 0.0257; alpha = 0.0040
Simulating trial. . .
espilon = 0.0257; alpha = 0.0040
Simulating trial. . .
espilon = 0.0257; alpha = 0.0040
Simulating trial. . .
espilon = 0.0257; alpha = 0.0040
Simulating trial. . .
espilon = 0.0257; alpha = 0.0040
Simulating trial. . .
espilon = 0.0257; alpha = 0.0040
Simulating trial. . .
espilon = 0.0257; alpha = 0.0040
Simulating trial. . .
espilon = 0.0257; alpha = 0.0040
Simulating trial. . .
espilon = 0.0257; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.79)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.08)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.87)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'right')
Agent properly idled at a red light. (rewarded 2.59)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.46)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.81)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.92)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.56)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.57)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.17)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.63)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 0.99)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 0.98)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.34)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.38)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.78)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.05)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.58)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.20)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.74)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 917
\-------------------------
Simulating trial. . .
espilon = 0.0256; alpha = 0.0040
Simulating trial. . .
espilon = 0.0256; alpha = 0.0040
Simulating trial. . .
espilon = 0.0256; alpha = 0.0040
Simulating trial. . .
espilon = 0.0256; alpha = 0.0040
Simulating trial. . .
espilon = 0.0256; alpha = 0.0040
Simulating trial. . .
espilon = 0.0256; alpha = 0.0040
Simulating trial. . .
espilon = 0.0256; alpha = 0.0040
Simulating trial. . .
espilon = 0.0256; alpha = 0.0040
Simulating trial. . .
espilon = 0.0256; alpha = 0.0040
Simulating trial. . .
espilon = 0.0256; alpha = 0.0040
Simulating trial. . .
espilon = 0.0256; alpha = 0.0040
Simulating trial. . .
espilon = 0.0256; alpha = 0.0040
Simulating trial. . .
espilon = 0.0256; alpha = 0.0040
Simulating trial. . .
espilon = 0.0256; alpha = 0.0040
Simulating trial. . .
espilon = 0.0256; alpha = 0.0040
Simulating trial. . .
espilon = 0.0256; alpha = 0.0040
Simulating trial. . .
espilon = 0.0256; alpha = 0.0040
Simulating trial. . .
espilon = 0.0256; alpha = 0.0040
Simulating trial. . .
espilon = 0.0256; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.71)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'forward')
Agent drove right instead of left. (rewarded 0.82)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'forward')
Agent followed the waypoint right. (rewarded 2.30)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.91)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.28)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.28)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.26)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.62)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.85)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.97)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.04)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent followed the waypoint forward. (rewarded 2.52)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', 'left')
Agent followed the waypoint forward. (rewarded 1.02)
35% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 918
\-------------------------
Simulating trial. . .
espilon = 0.0255; alpha = 0.0040
Simulating trial. . .
espilon = 0.0255; alpha = 0.0040
Simulating trial. . .
espilon = 0.0255; alpha = 0.0040
Simulating trial. . .
espilon = 0.0255; alpha = 0.0040
Simulating trial. . .
espilon = 0.0255; alpha = 0.0040
Simulating trial. . .
espilon = 0.0255; alpha = 0.0040
Simulating trial. . .
espilon = 0.0255; alpha = 0.0040
Simulating trial. . .
espilon = 0.0255; alpha = 0.0040
Simulating trial. . .
espilon = 0.0255; alpha = 0.0040
Simulating trial. . .
espilon = 0.0255; alpha = 0.0040
Simulating trial. . .
espilon = 0.0255; alpha = 0.0040
Simulating trial. . .
espilon = 0.0255; alpha = 0.0040
Simulating trial. . .
espilon = 0.0255; alpha = 0.0040
Simulating trial. . .
espilon = 0.0255; alpha = 0.0040
Simulating trial. . .
espilon = 0.0255; alpha = 0.0040
Simulating trial. . .
espilon = 0.0255; alpha = 0.0040
Simulating trial. . .
espilon = 0.0255; alpha = 0.0040
Simulating trial. . .
espilon = 0.0255; alpha = 0.0040
Simulating trial. . .
espilon = 0.0255; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.30)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.78)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.32)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.25)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.86)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 0.69)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.03)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.60)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.26)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.97)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.71)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.62)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.98)
35% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 919
\-------------------------
Simulating trial. . .
espilon = 0.0254; alpha = 0.0040
Simulating trial. . .
espilon = 0.0254; alpha = 0.0040
Simulating trial. . .
espilon = 0.0254; alpha = 0.0040
Simulating trial. . .
espilon = 0.0254; alpha = 0.0040
Simulating trial. . .
espilon = 0.0254; alpha = 0.0040
Simulating trial. . .
espilon = 0.0254; alpha = 0.0040
Simulating trial. . .
espilon = 0.0254; alpha = 0.0040
Simulating trial. . .
espilon = 0.0254; alpha = 0.0040
Simulating trial. . .
espilon = 0.0254; alpha = 0.0040
Simulating trial. . .
espilon = 0.0254; alpha = 0.0040
Simulating trial. . .
espilon = 0.0254; alpha = 0.0040
Simulating trial. . .
espilon = 0.0254; alpha = 0.0040
Simulating trial. . .
espilon = 0.0254; alpha = 0.0040
Simulating trial. . .
espilon = 0.0254; alpha = 0.0040
Simulating trial. . .
espilon = 0.0254; alpha = 0.0040
Simulating trial. . .
espilon = 0.0254; alpha = 0.0040
Simulating trial. . .
espilon = 0.0254; alpha = 0.0040
Simulating trial. . .
espilon = 0.0254; alpha = 0.0040
Simulating trial. . .
espilon = 0.0254; alpha = 0.0040
Simulating trial. . .
espilon = 0.0254; alpha = 0.0040
Simulating trial. . .
espilon = 0.0254; alpha = 0.0040
Simulating trial. . .
espilon = 0.0254; alpha = 0.0040
Simulating trial. . .
espilon = 0.0254; alpha = 0.0040
Simulating trial. . .
espilon = 0.0254; alpha = 0.0040
Simulating trial. . .
espilon = 0.0254; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.38)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.04)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.92)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.46)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.96)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.90)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'right')
Agent drove right instead of forward. (rewarded 1.08)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 2.75)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent drove right instead of left. (rewarded 1.78)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent drove right instead of forward. (rewarded 1.72)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 0.84)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.70)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.46)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded 0.63)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.62)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.65)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 0.45)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'right', None)
Agent properly idled at a red light. (rewarded 0.82)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'right', None)
Agent properly idled at a red light. (rewarded 0.89)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.48)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 920
\-------------------------
Simulating trial. . .
espilon = 0.0253; alpha = 0.0040
Simulating trial. . .
espilon = 0.0253; alpha = 0.0040
Simulating trial. . .
espilon = 0.0253; alpha = 0.0040
Simulating trial. . .
espilon = 0.0253; alpha = 0.0040
Simulating trial. . .
espilon = 0.0253; alpha = 0.0040
Simulating trial. . .
espilon = 0.0253; alpha = 0.0040
Simulating trial. . .
espilon = 0.0253; alpha = 0.0040
Simulating trial. . .
espilon = 0.0253; alpha = 0.0040
Simulating trial. . .
espilon = 0.0253; alpha = 0.0040
Simulating trial. . .
espilon = 0.0253; alpha = 0.0040
Simulating trial. . .
espilon = 0.0253; alpha = 0.0040
Simulating trial. . .
espilon = 0.0253; alpha = 0.0040
Simulating trial. . .
espilon = 0.0253; alpha = 0.0040
Simulating trial. . .
espilon = 0.0253; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.67)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.96)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.64)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.78)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.85)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.12)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 1.07)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 1.83)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.48)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.02)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent followed the waypoint forward. (rewarded 2.28)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 0.98)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.70)
57% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 921
\-------------------------
Simulating trial. . .
espilon = 0.0252; alpha = 0.0040
Simulating trial. . .
espilon = 0.0252; alpha = 0.0040
Simulating trial. . .
espilon = 0.0252; alpha = 0.0040
Simulating trial. . .
espilon = 0.0252; alpha = 0.0040
Simulating trial. . .
espilon = 0.0252; alpha = 0.0040
Simulating trial. . .
espilon = 0.0252; alpha = 0.0040
Simulating trial. . .
espilon = 0.0252; alpha = 0.0040
Simulating trial. . .
espilon = 0.0252; alpha = 0.0040
Simulating trial. . .
espilon = 0.0252; alpha = 0.0040
Simulating trial. . .
espilon = 0.0252; alpha = 0.0040
Simulating trial. . .
espilon = 0.0252; alpha = 0.0040
Simulating trial. . .
espilon = 0.0252; alpha = 0.0040
Simulating trial. . .
espilon = 0.0252; alpha = 0.0040
Simulating trial. . .
espilon = 0.0252; alpha = 0.0040
Simulating trial. . .
espilon = 0.0252; alpha = 0.0040
Simulating trial. . .
espilon = 0.0252; alpha = 0.0040
Simulating trial. . .
espilon = 0.0252; alpha = 0.0040
Simulating trial. . .
espilon = 0.0252; alpha = 0.0040
Simulating trial. . .
espilon = 0.0252; alpha = 0.0040
Simulating trial. . .
espilon = 0.0252; alpha = 0.0040
Simulating trial. . .
espilon = 0.0252; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.59)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.65)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.08)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.39)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.04)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.00)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.59)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.41)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.32)
55% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 922
\-------------------------
Simulating trial. . .
espilon = 0.0251; alpha = 0.0040
Simulating trial. . .
espilon = 0.0251; alpha = 0.0040
Simulating trial. . .
espilon = 0.0251; alpha = 0.0040
Simulating trial. . .
espilon = 0.0251; alpha = 0.0040
Simulating trial. . .
espilon = 0.0251; alpha = 0.0040
Simulating trial. . .
espilon = 0.0251; alpha = 0.0040
Simulating trial. . .
espilon = 0.0251; alpha = 0.0040
Simulating trial. . .
espilon = 0.0251; alpha = 0.0040
Simulating trial. . .
espilon = 0.0251; alpha = 0.0040
Simulating trial. . .
espilon = 0.0251; alpha = 0.0040
Simulating trial. . .
espilon = 0.0251; alpha = 0.0040
Simulating trial. . .
espilon = 0.0251; alpha = 0.0040
Simulating trial. . .
espilon = 0.0251; alpha = 0.0040
Simulating trial. . .
espilon = 0.0251; alpha = 0.0040
Simulating trial. . .
espilon = 0.0251; alpha = 0.0040
Simulating trial. . .
espilon = 0.0251; alpha = 0.0040
Simulating trial. . .
espilon = 0.0251; alpha = 0.0040
Simulating trial. . .
espilon = 0.0251; alpha = 0.0040
Simulating trial. . .
espilon = 0.0251; alpha = 0.0040
Simulating trial. . .
espilon = 0.0251; alpha = 0.0040
Simulating trial. . .
espilon = 0.0251; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 1.43)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.71)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.04)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.03)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.35)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent followed the waypoint left. (rewarded 1.90)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 2.81)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.84)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.23)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.51)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.90)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent attempted driving left through a red light. (rewarded -10.96)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.01)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.71)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.91)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.61)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.46)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.17)
10% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 923
\-------------------------
Simulating trial. . .
espilon = 0.0250; alpha = 0.0040
Simulating trial. . .
espilon = 0.0250; alpha = 0.0040
Simulating trial. . .
espilon = 0.0250; alpha = 0.0040
Simulating trial. . .
espilon = 0.0250; alpha = 0.0040
Simulating trial. . .
espilon = 0.0250; alpha = 0.0040
Simulating trial. . .
espilon = 0.0250; alpha = 0.0040
Simulating trial. . .
espilon = 0.0250; alpha = 0.0040
Simulating trial. . .
espilon = 0.0250; alpha = 0.0040
Simulating trial. . .
espilon = 0.0250; alpha = 0.0040
Simulating trial. . .
espilon = 0.0250; alpha = 0.0040
Simulating trial. . .
espilon = 0.0250; alpha = 0.0040
Simulating trial. . .
espilon = 0.0250; alpha = 0.0040
Simulating trial. . .
espilon = 0.0250; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 1.13)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.05)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.52)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 0.93)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.36)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent followed the waypoint right. (rewarded 1.52)
76% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 924
\-------------------------
Simulating trial. . .
espilon = 0.0249; alpha = 0.0040
Simulating trial. . .
espilon = 0.0249; alpha = 0.0040
Simulating trial. . .
espilon = 0.0249; alpha = 0.0040
Simulating trial. . .
espilon = 0.0249; alpha = 0.0040
Simulating trial. . .
espilon = 0.0249; alpha = 0.0040
Simulating trial. . .
espilon = 0.0249; alpha = 0.0040
Simulating trial. . .
espilon = 0.0249; alpha = 0.0040
Simulating trial. . .
espilon = 0.0249; alpha = 0.0040
Simulating trial. . .
espilon = 0.0249; alpha = 0.0040
Simulating trial. . .
espilon = 0.0249; alpha = 0.0040
Simulating trial. . .
espilon = 0.0249; alpha = 0.0040
Simulating trial. . .
espilon = 0.0249; alpha = 0.0040
Simulating trial. . .
espilon = 0.0249; alpha = 0.0040
Simulating trial. . .
espilon = 0.0249; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 2.74)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.12)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.46)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.76)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.30)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.61)
70% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 925
\-------------------------
Simulating trial. . .
espilon = 0.0248; alpha = 0.0040
Simulating trial. . .
espilon = 0.0248; alpha = 0.0040
Simulating trial. . .
espilon = 0.0248; alpha = 0.0040
Simulating trial. . .
espilon = 0.0248; alpha = 0.0040
Simulating trial. . .
espilon = 0.0248; alpha = 0.0040
Simulating trial. . .
espilon = 0.0248; alpha = 0.0040
Simulating trial. . .
espilon = 0.0248; alpha = 0.0040
Simulating trial. . .
espilon = 0.0248; alpha = 0.0040
Simulating trial. . .
espilon = 0.0248; alpha = 0.0040
Simulating trial. . .
espilon = 0.0248; alpha = 0.0040
Simulating trial. . .
espilon = 0.0248; alpha = 0.0040
Simulating trial. . .
espilon = 0.0248; alpha = 0.0040
Simulating trial. . .
espilon = 0.0248; alpha = 0.0040
Simulating trial. . .
espilon = 0.0248; alpha = 0.0040
Simulating trial. . .
espilon = 0.0248; alpha = 0.0040
Simulating trial. . .
espilon = 0.0248; alpha = 0.0040
Simulating trial. . .
espilon = 0.0248; alpha = 0.0040
Simulating trial. . .
espilon = 0.0248; alpha = 0.0040
Simulating trial. . .
espilon = 0.0248; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.51)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.68)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.48)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.08)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.48)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.46)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.80)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.97)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.53)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.58)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 1.28)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 2.18)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'forward')
Agent drove right instead of forward. (rewarded 1.79)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.55)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.32)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.54)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.26)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.56)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.03)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.35)
33% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 926
\-------------------------
Simulating trial. . .
espilon = 0.0247; alpha = 0.0040
Simulating trial. . .
espilon = 0.0247; alpha = 0.0040
Simulating trial. . .
espilon = 0.0247; alpha = 0.0040
Simulating trial. . .
espilon = 0.0247; alpha = 0.0040
Simulating trial. . .
espilon = 0.0247; alpha = 0.0040
Simulating trial. . .
espilon = 0.0247; alpha = 0.0040
Simulating trial. . .
espilon = 0.0247; alpha = 0.0040
Simulating trial. . .
espilon = 0.0247; alpha = 0.0040
Simulating trial. . .
espilon = 0.0247; alpha = 0.0040
Simulating trial. . .
espilon = 0.0247; alpha = 0.0040
Simulating trial. . .
espilon = 0.0247; alpha = 0.0040
Simulating trial. . .
espilon = 0.0247; alpha = 0.0040
Simulating trial. . .
espilon = 0.0247; alpha = 0.0040
Simulating trial. . .
espilon = 0.0247; alpha = 0.0040
Simulating trial. . .
espilon = 0.0247; alpha = 0.0040
Simulating trial. . .
espilon = 0.0247; alpha = 0.0040
Simulating trial. . .
espilon = 0.0247; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.45)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.57)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.64)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.72)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 0.96)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.16)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.02)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 0.88)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 927
\-------------------------
Simulating trial. . .
espilon = 0.0246; alpha = 0.0040
Simulating trial. . .
espilon = 0.0246; alpha = 0.0040
Simulating trial. . .
espilon = 0.0246; alpha = 0.0040
Simulating trial. . .
espilon = 0.0246; alpha = 0.0040
Simulating trial. . .
espilon = 0.0246; alpha = 0.0040
Simulating trial. . .
espilon = 0.0246; alpha = 0.0040
Simulating trial. . .
espilon = 0.0246; alpha = 0.0040
Simulating trial. . .
espilon = 0.0246; alpha = 0.0040
Simulating trial. . .
espilon = 0.0246; alpha = 0.0040
Simulating trial. . .
espilon = 0.0246; alpha = 0.0040
Simulating trial. . .
espilon = 0.0246; alpha = 0.0040
Simulating trial. . .
espilon = 0.0246; alpha = 0.0040
Simulating trial. . .
espilon = 0.0246; alpha = 0.0040
Simulating trial. . .
espilon = 0.0246; alpha = 0.0040
Simulating trial. . .
espilon = 0.0246; alpha = 0.0040
Simulating trial. . .
espilon = 0.0246; alpha = 0.0040
Simulating trial. . .
espilon = 0.0246; alpha = 0.0040
Simulating trial. . .
espilon = 0.0246; alpha = 0.0040
Simulating trial. . .
espilon = 0.0246; alpha = 0.0040
Simulating trial. . .
espilon = 0.0246; alpha = 0.0040
Simulating trial. . .
espilon = 0.0246; alpha = 0.0040
Simulating trial. . .
espilon = 0.0246; alpha = 0.0040
Simulating trial. . .
espilon = 0.0246; alpha = 0.0040
Simulating trial. . .
espilon = 0.0246; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.41)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.83)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.45)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.43)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.67)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.49)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.19)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.43)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.21)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'right', None)
Agent properly idled at a red light. (rewarded 2.73)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.38)
45% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 928
\-------------------------
Simulating trial. . .
espilon = 0.0245; alpha = 0.0040
Simulating trial. . .
espilon = 0.0245; alpha = 0.0040
Simulating trial. . .
espilon = 0.0245; alpha = 0.0040
Simulating trial. . .
espilon = 0.0245; alpha = 0.0040
Simulating trial. . .
espilon = 0.0245; alpha = 0.0040
Simulating trial. . .
espilon = 0.0245; alpha = 0.0040
Simulating trial. . .
espilon = 0.0245; alpha = 0.0040
Simulating trial. . .
espilon = 0.0245; alpha = 0.0040
Simulating trial. . .
espilon = 0.0245; alpha = 0.0040
Simulating trial. . .
espilon = 0.0245; alpha = 0.0040
Simulating trial. . .
espilon = 0.0245; alpha = 0.0040
Simulating trial. . .
espilon = 0.0245; alpha = 0.0040
Simulating trial. . .
espilon = 0.0245; alpha = 0.0040
Simulating trial. . .
espilon = 0.0245; alpha = 0.0040
Simulating trial. . .
espilon = 0.0245; alpha = 0.0040
Simulating trial. . .
espilon = 0.0245; alpha = 0.0040
Simulating trial. . .
espilon = 0.0245; alpha = 0.0040
Simulating trial. . .
espilon = 0.0245; alpha = 0.0040
Simulating trial. . .
espilon = 0.0245; alpha = 0.0040
Simulating trial. . .
espilon = 0.0245; alpha = 0.0040
Simulating trial. . .
espilon = 0.0245; alpha = 0.0040
Simulating trial. . .
espilon = 0.0245; alpha = 0.0040
Simulating trial. . .
espilon = 0.0245; alpha = 0.0040
Simulating trial. . .
espilon = 0.0245; alpha = 0.0040
Simulating trial. . .
espilon = 0.0245; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 1.17)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.61)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.90)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.27)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.59)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.70)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.35)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.59)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.68)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.83)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent followed the waypoint right. (rewarded 0.98)
56% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 929
\-------------------------
Simulating trial. . .
espilon = 0.0244; alpha = 0.0040
Simulating trial. . .
espilon = 0.0244; alpha = 0.0040
Simulating trial. . .
espilon = 0.0244; alpha = 0.0040
Simulating trial. . .
espilon = 0.0244; alpha = 0.0040
Simulating trial. . .
espilon = 0.0244; alpha = 0.0040
Simulating trial. . .
espilon = 0.0244; alpha = 0.0040
Simulating trial. . .
espilon = 0.0244; alpha = 0.0040
Simulating trial. . .
espilon = 0.0244; alpha = 0.0040
Simulating trial. . .
espilon = 0.0244; alpha = 0.0040
Simulating trial. . .
espilon = 0.0244; alpha = 0.0040
Simulating trial. . .
espilon = 0.0244; alpha = 0.0040
Simulating trial. . .
espilon = 0.0244; alpha = 0.0040
Simulating trial. . .
espilon = 0.0244; alpha = 0.0040
Simulating trial. . .
espilon = 0.0244; alpha = 0.0040
Simulating trial. . .
espilon = 0.0244; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'right', None)
Agent followed the waypoint forward. (rewarded 2.77)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.01)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.49)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.78)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.57)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.47)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.58)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.59)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.58)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.63)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', 'left')
Agent followed the waypoint forward. (rewarded 2.41)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.75)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.11)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.08)
53% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 930
\-------------------------
Simulating trial. . .
espilon = 0.0243; alpha = 0.0040
Simulating trial. . .
espilon = 0.0243; alpha = 0.0040
Simulating trial. . .
espilon = 0.0243; alpha = 0.0040
Simulating trial. . .
espilon = 0.0243; alpha = 0.0040
Simulating trial. . .
espilon = 0.0243; alpha = 0.0040
Simulating trial. . .
espilon = 0.0243; alpha = 0.0040
Simulating trial. . .
espilon = 0.0243; alpha = 0.0040
Simulating trial. . .
espilon = 0.0243; alpha = 0.0040
Simulating trial. . .
espilon = 0.0243; alpha = 0.0040
Simulating trial. . .
espilon = 0.0243; alpha = 0.0040
Simulating trial. . .
espilon = 0.0243; alpha = 0.0040
Simulating trial. . .
espilon = 0.0243; alpha = 0.0040
Simulating trial. . .
espilon = 0.0243; alpha = 0.0040
Simulating trial. . .
espilon = 0.0243; alpha = 0.0040
Simulating trial. . .
espilon = 0.0243; alpha = 0.0040
Simulating trial. . .
espilon = 0.0243; alpha = 0.0040
Simulating trial. . .
espilon = 0.0243; alpha = 0.0040
Simulating trial. . .
espilon = 0.0243; alpha = 0.0040
Simulating trial. . .
espilon = 0.0243; alpha = 0.0040
Simulating trial. . .
espilon = 0.0243; alpha = 0.0040
Simulating trial. . .
espilon = 0.0243; alpha = 0.0040
Simulating trial. . .
espilon = 0.0243; alpha = 0.0040
Simulating trial. . .
espilon = 0.0243; alpha = 0.0040
Simulating trial. . .
espilon = 0.0243; alpha = 0.0040
Simulating trial. . .
espilon = 0.0243; alpha = 0.0040
Simulating trial. . .
espilon = 0.0243; alpha = 0.0040
Simulating trial. . .
espilon = 0.0243; alpha = 0.0040
Simulating trial. . .
espilon = 0.0243; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.39)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.07)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.46)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.37)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.65)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.87)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', 'left')
Agent followed the waypoint left. (rewarded 0.93)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.71)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.92)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.70)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 931
\-------------------------
Simulating trial. . .
espilon = 0.0242; alpha = 0.0040
Simulating trial. . .
espilon = 0.0242; alpha = 0.0040
Simulating trial. . .
espilon = 0.0242; alpha = 0.0040
Simulating trial. . .
espilon = 0.0242; alpha = 0.0040
Simulating trial. . .
espilon = 0.0242; alpha = 0.0040
Simulating trial. . .
espilon = 0.0242; alpha = 0.0040
Simulating trial. . .
espilon = 0.0242; alpha = 0.0040
Simulating trial. . .
espilon = 0.0242; alpha = 0.0040
Simulating trial. . .
espilon = 0.0242; alpha = 0.0040
Simulating trial. . .
espilon = 0.0242; alpha = 0.0040
Simulating trial. . .
espilon = 0.0242; alpha = 0.0040
Simulating trial. . .
espilon = 0.0242; alpha = 0.0040
Simulating trial. . .
espilon = 0.0242; alpha = 0.0040
Simulating trial. . .
espilon = 0.0242; alpha = 0.0040
Simulating trial. . .
espilon = 0.0242; alpha = 0.0040
Simulating trial. . .
espilon = 0.0242; alpha = 0.0040
Simulating trial. . .
espilon = 0.0242; alpha = 0.0040
Simulating trial. . .
espilon = 0.0242; alpha = 0.0040
Simulating trial. . .
espilon = 0.0242; alpha = 0.0040
Simulating trial. . .
espilon = 0.0242; alpha = 0.0040
Simulating trial. . .
espilon = 0.0242; alpha = 0.0040
Simulating trial. . .
espilon = 0.0242; alpha = 0.0040
Simulating trial. . .
espilon = 0.0242; alpha = 0.0040
Simulating trial. . .
espilon = 0.0242; alpha = 0.0040
Simulating trial. . .
espilon = 0.0242; alpha = 0.0040
Simulating trial. . .
espilon = 0.0242; alpha = 0.0040
Simulating trial. . .
espilon = 0.0242; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.63)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.56)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.51)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.27)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.76)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', 'forward')
Agent followed the waypoint forward. (rewarded 1.84)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.67)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.86)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.49)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'right', 'left')
Agent followed the waypoint forward. (rewarded 1.16)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.29)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.23)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.92)
63% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.39)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 932
\-------------------------
Simulating trial. . .
espilon = 0.0241; alpha = 0.0040
Simulating trial. . .
espilon = 0.0241; alpha = 0.0040
Simulating trial. . .
espilon = 0.0241; alpha = 0.0040
Simulating trial. . .
espilon = 0.0241; alpha = 0.0040
Simulating trial. . .
espilon = 0.0241; alpha = 0.0040
Simulating trial. . .
espilon = 0.0241; alpha = 0.0040
Simulating trial. . .
espilon = 0.0241; alpha = 0.0040
Simulating trial. . .
espilon = 0.0241; alpha = 0.0040
Simulating trial. . .
espilon = 0.0241; alpha = 0.0040
Simulating trial. . .
espilon = 0.0241; alpha = 0.0040
Simulating trial. . .
espilon = 0.0241; alpha = 0.0040
Simulating trial. . .
espilon = 0.0241; alpha = 0.0040
Simulating trial. . .
espilon = 0.0241; alpha = 0.0040
Simulating trial. . .
espilon = 0.0241; alpha = 0.0040
Simulating trial. . .
espilon = 0.0241; alpha = 0.0040
Simulating trial. . .
espilon = 0.0241; alpha = 0.0040
Simulating trial. . .
espilon = 0.0241; alpha = 0.0040
Simulating trial. . .
espilon = 0.0241; alpha = 0.0040
Simulating trial. . .
espilon = 0.0241; alpha = 0.0040
Simulating trial. . .
espilon = 0.0241; alpha = 0.0040
Simulating trial. . .
espilon = 0.0241; alpha = 0.0040
Simulating trial. . .
espilon = 0.0241; alpha = 0.0040
Simulating trial. . .
espilon = 0.0241; alpha = 0.0040
Simulating trial. . .
espilon = 0.0241; alpha = 0.0040
Simulating trial. . .
espilon = 0.0241; alpha = 0.0040
Simulating trial. . .
espilon = 0.0241; alpha = 0.0040
Simulating trial. . .
espilon = 0.0241; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.98)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.01)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.60)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.61)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 1.80)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.03)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 2.69)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.69)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.99)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.39)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'forward')
Agent drove right instead of forward. (rewarded 1.51)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.00)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.84)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'forward')
Agent drove forward instead of left. (rewarded -0.30)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.40)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.74)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.38)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 0.57)
10% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 933
\-------------------------
Simulating trial. . .
espilon = 0.0240; alpha = 0.0040
Simulating trial. . .
espilon = 0.0240; alpha = 0.0040
Simulating trial. . .
espilon = 0.0240; alpha = 0.0040
Simulating trial. . .
espilon = 0.0240; alpha = 0.0040
Simulating trial. . .
espilon = 0.0240; alpha = 0.0040
Simulating trial. . .
espilon = 0.0240; alpha = 0.0040
Simulating trial. . .
espilon = 0.0240; alpha = 0.0040
Simulating trial. . .
espilon = 0.0240; alpha = 0.0040
Simulating trial. . .
espilon = 0.0240; alpha = 0.0040
Simulating trial. . .
espilon = 0.0240; alpha = 0.0040
Simulating trial. . .
espilon = 0.0240; alpha = 0.0040
Simulating trial. . .
espilon = 0.0240; alpha = 0.0040
Simulating trial. . .
espilon = 0.0240; alpha = 0.0040
Simulating trial. . .
espilon = 0.0240; alpha = 0.0040
Simulating trial. . .
espilon = 0.0240; alpha = 0.0040
Simulating trial. . .
espilon = 0.0240; alpha = 0.0040
Simulating trial. . .
espilon = 0.0240; alpha = 0.0040
Simulating trial. . .
espilon = 0.0240; alpha = 0.0040
Simulating trial. . .
espilon = 0.0240; alpha = 0.0040
Simulating trial. . .
espilon = 0.0240; alpha = 0.0040
Simulating trial. . .
espilon = 0.0240; alpha = 0.0040
Simulating trial. . .
espilon = 0.0240; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', 'forward')
Agent drove right instead of left. (rewarded 0.85)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', 'left')
Agent followed the waypoint right. (rewarded 1.55)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.38)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 2.06)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.02)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.17)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'right')
Agent properly idled at a red light. (rewarded 0.95)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', 'right')
Agent drove right instead of left. (rewarded -0.03)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.23)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.45)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.37)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 2.83)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 1.58)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.36)
53% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 934
\-------------------------
Simulating trial. . .
espilon = 0.0239; alpha = 0.0040
Simulating trial. . .
espilon = 0.0239; alpha = 0.0040
Simulating trial. . .
espilon = 0.0239; alpha = 0.0040
Simulating trial. . .
espilon = 0.0239; alpha = 0.0040
Simulating trial. . .
espilon = 0.0239; alpha = 0.0040
Simulating trial. . .
espilon = 0.0239; alpha = 0.0040
Simulating trial. . .
espilon = 0.0239; alpha = 0.0040
Simulating trial. . .
espilon = 0.0239; alpha = 0.0040
Simulating trial. . .
espilon = 0.0239; alpha = 0.0040
Simulating trial. . .
espilon = 0.0239; alpha = 0.0040
Simulating trial. . .
espilon = 0.0239; alpha = 0.0040
Simulating trial. . .
espilon = 0.0239; alpha = 0.0040
Simulating trial. . .
espilon = 0.0239; alpha = 0.0040
Simulating trial. . .
espilon = 0.0239; alpha = 0.0040
Simulating trial. . .
espilon = 0.0239; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', 'left')
Agent followed the waypoint left. (rewarded 2.75)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.56)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 1.22)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 1.60)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.13)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 2.77)
76% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 935
\-------------------------
Simulating trial. . .
espilon = 0.0238; alpha = 0.0040
Simulating trial. . .
espilon = 0.0238; alpha = 0.0040
Simulating trial. . .
espilon = 0.0238; alpha = 0.0040
Simulating trial. . .
espilon = 0.0238; alpha = 0.0040
Simulating trial. . .
espilon = 0.0238; alpha = 0.0040
Simulating trial. . .
espilon = 0.0238; alpha = 0.0040
Simulating trial. . .
espilon = 0.0238; alpha = 0.0040
Simulating trial. . .
espilon = 0.0238; alpha = 0.0040
Simulating trial. . .
espilon = 0.0238; alpha = 0.0040
Simulating trial. . .
espilon = 0.0238; alpha = 0.0040
Simulating trial. . .
espilon = 0.0238; alpha = 0.0040
Simulating trial. . .
espilon = 0.0238; alpha = 0.0040
Simulating trial. . .
espilon = 0.0238; alpha = 0.0040
Simulating trial. . .
espilon = 0.0238; alpha = 0.0040
Simulating trial. . .
espilon = 0.0238; alpha = 0.0040
Simulating trial. . .
espilon = 0.0238; alpha = 0.0040
Simulating trial. . .
espilon = 0.0238; alpha = 0.0040
Simulating trial. . .
espilon = 0.0238; alpha = 0.0040
Simulating trial. . .
espilon = 0.0238; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.66)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.67)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', 'right')
Agent followed the waypoint forward. (rewarded 2.62)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.16)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.62)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent drove right instead of left. (rewarded 0.76)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.59)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'left', None)
Agent followed the waypoint right. (rewarded 2.05)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.52)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 1.42)
50% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 936
\-------------------------
Simulating trial. . .
espilon = 0.0238; alpha = 0.0040
Simulating trial. . .
espilon = 0.0238; alpha = 0.0040
Simulating trial. . .
espilon = 0.0238; alpha = 0.0040
Simulating trial. . .
espilon = 0.0238; alpha = 0.0040
Simulating trial. . .
espilon = 0.0238; alpha = 0.0040
Simulating trial. . .
espilon = 0.0238; alpha = 0.0040
Simulating trial. . .
espilon = 0.0238; alpha = 0.0040
Simulating trial. . .
espilon = 0.0238; alpha = 0.0040
Simulating trial. . .
espilon = 0.0238; alpha = 0.0040
Simulating trial. . .
espilon = 0.0238; alpha = 0.0040
Simulating trial. . .
espilon = 0.0238; alpha = 0.0040
Simulating trial. . .
espilon = 0.0238; alpha = 0.0040
Simulating trial. . .
espilon = 0.0238; alpha = 0.0040
Simulating trial. . .
espilon = 0.0238; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.77)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.68)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.01)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.52)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 0.58)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.22)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -40.80)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.54)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.02)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.60)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.87)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove left instead of forward. (rewarded 0.75)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.43)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.27)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.03)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.13)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent followed the waypoint forward. (rewarded 0.65)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'right', 'right')
Agent drove forward instead of right. (rewarded -0.16)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 2.12)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.98)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 937
\-------------------------
Simulating trial. . .
espilon = 0.0237; alpha = 0.0040
Simulating trial. . .
espilon = 0.0237; alpha = 0.0040
Simulating trial. . .
espilon = 0.0237; alpha = 0.0040
Simulating trial. . .
espilon = 0.0237; alpha = 0.0040
Simulating trial. . .
espilon = 0.0237; alpha = 0.0040
Simulating trial. . .
espilon = 0.0237; alpha = 0.0040
Simulating trial. . .
espilon = 0.0237; alpha = 0.0040
Simulating trial. . .
espilon = 0.0237; alpha = 0.0040
Simulating trial. . .
espilon = 0.0237; alpha = 0.0040
Simulating trial. . .
espilon = 0.0237; alpha = 0.0040
Simulating trial. . .
espilon = 0.0237; alpha = 0.0040
Simulating trial. . .
espilon = 0.0237; alpha = 0.0040
Simulating trial. . .
espilon = 0.0237; alpha = 0.0040
Simulating trial. . .
espilon = 0.0237; alpha = 0.0040
Simulating trial. . .
espilon = 0.0237; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 1.21)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.37)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.57)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.24)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.77)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.14)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.34)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.44)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 938
\-------------------------
Simulating trial. . .
espilon = 0.0236; alpha = 0.0040
Simulating trial. . .
espilon = 0.0236; alpha = 0.0040
Simulating trial. . .
espilon = 0.0236; alpha = 0.0040
Simulating trial. . .
espilon = 0.0236; alpha = 0.0040
Simulating trial. . .
espilon = 0.0236; alpha = 0.0040
Simulating trial. . .
espilon = 0.0236; alpha = 0.0040
Simulating trial. . .
espilon = 0.0236; alpha = 0.0040
Simulating trial. . .
espilon = 0.0236; alpha = 0.0040
Simulating trial. . .
espilon = 0.0236; alpha = 0.0040
Simulating trial. . .
espilon = 0.0236; alpha = 0.0040
Simulating trial. . .
espilon = 0.0236; alpha = 0.0040
Simulating trial. . .
espilon = 0.0236; alpha = 0.0040
Simulating trial. . .
espilon = 0.0236; alpha = 0.0040
Simulating trial. . .
espilon = 0.0236; alpha = 0.0040
Simulating trial. . .
espilon = 0.0236; alpha = 0.0040
Simulating trial. . .
espilon = 0.0236; alpha = 0.0040
Simulating trial. . .
espilon = 0.0236; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', 'forward')
Agent followed the waypoint right. (rewarded 1.21)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.63)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.17)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.43)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.55)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.01)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'right')
Agent followed the waypoint right. (rewarded 1.62)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.06)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.58)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.71)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.22)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.25)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.71)
35% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 939
\-------------------------
Simulating trial. . .
espilon = 0.0235; alpha = 0.0040
Simulating trial. . .
espilon = 0.0235; alpha = 0.0040
Simulating trial. . .
espilon = 0.0235; alpha = 0.0040
Simulating trial. . .
espilon = 0.0235; alpha = 0.0040
Simulating trial. . .
espilon = 0.0235; alpha = 0.0040
Simulating trial. . .
espilon = 0.0235; alpha = 0.0040
Simulating trial. . .
espilon = 0.0235; alpha = 0.0040
Simulating trial. . .
espilon = 0.0235; alpha = 0.0040
Simulating trial. . .
espilon = 0.0235; alpha = 0.0040
Simulating trial. . .
espilon = 0.0235; alpha = 0.0040
Simulating trial. . .
espilon = 0.0235; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', 'left')
Agent followed the waypoint right. (rewarded 2.42)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.83)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 1.56)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 2.44)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.16)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.78)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.66)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.70)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 940
\-------------------------
Simulating trial. . .
espilon = 0.0234; alpha = 0.0040
Simulating trial. . .
espilon = 0.0234; alpha = 0.0040
Simulating trial. . .
espilon = 0.0234; alpha = 0.0040
Simulating trial. . .
espilon = 0.0234; alpha = 0.0040
Simulating trial. . .
espilon = 0.0234; alpha = 0.0040
Simulating trial. . .
espilon = 0.0234; alpha = 0.0040
Simulating trial. . .
espilon = 0.0234; alpha = 0.0040
Simulating trial. . .
espilon = 0.0234; alpha = 0.0040
Simulating trial. . .
espilon = 0.0234; alpha = 0.0040
Simulating trial. . .
espilon = 0.0234; alpha = 0.0040
Simulating trial. . .
espilon = 0.0234; alpha = 0.0040
Simulating trial. . .
espilon = 0.0234; alpha = 0.0040
Simulating trial. . .
espilon = 0.0234; alpha = 0.0040
Simulating trial. . .
espilon = 0.0234; alpha = 0.0040
Simulating trial. . .
espilon = 0.0234; alpha = 0.0040
Simulating trial. . .
espilon = 0.0234; alpha = 0.0040
Simulating trial. . .
espilon = 0.0234; alpha = 0.0040
Simulating trial. . .
espilon = 0.0234; alpha = 0.0040
Simulating trial. . .
espilon = 0.0234; alpha = 0.0040
Simulating trial. . .
espilon = 0.0234; alpha = 0.0040
Simulating trial. . .
espilon = 0.0234; alpha = 0.0040
Simulating trial. . .
espilon = 0.0234; alpha = 0.0040
Simulating trial. . .
espilon = 0.0234; alpha = 0.0040
Simulating trial. . .
espilon = 0.0234; alpha = 0.0040
Simulating trial. . .
espilon = 0.0234; alpha = 0.0040
Simulating trial. . .
espilon = 0.0234; alpha = 0.0040
Simulating trial. . .
espilon = 0.0234; alpha = 0.0040
Simulating trial. . .
espilon = 0.0234; alpha = 0.0040
Simulating trial. . .
espilon = 0.0234; alpha = 0.0040
Simulating trial. . .
espilon = 0.0234; alpha = 0.0040
Simulating trial. . .
espilon = 0.0234; alpha = 0.0040
Simulating trial. . .
espilon = 0.0234; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.35)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.04)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.32)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.22)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.01)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.23)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.56)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.71)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.39)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'right')
Agent drove right instead of forward. (rewarded 0.48)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 2.48)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.42)
52% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 941
\-------------------------
Simulating trial. . .
espilon = 0.0233; alpha = 0.0040
Simulating trial. . .
espilon = 0.0233; alpha = 0.0040
Simulating trial. . .
espilon = 0.0233; alpha = 0.0040
Simulating trial. . .
espilon = 0.0233; alpha = 0.0040
Simulating trial. . .
espilon = 0.0233; alpha = 0.0040
Simulating trial. . .
espilon = 0.0233; alpha = 0.0040
Simulating trial. . .
espilon = 0.0233; alpha = 0.0040
Simulating trial. . .
espilon = 0.0233; alpha = 0.0040
Simulating trial. . .
espilon = 0.0233; alpha = 0.0040
Simulating trial. . .
espilon = 0.0233; alpha = 0.0040
Simulating trial. . .
espilon = 0.0233; alpha = 0.0040
Simulating trial. . .
espilon = 0.0233; alpha = 0.0040
Simulating trial. . .
espilon = 0.0233; alpha = 0.0040
Simulating trial. . .
espilon = 0.0233; alpha = 0.0040
Simulating trial. . .
espilon = 0.0233; alpha = 0.0040
Simulating trial. . .
espilon = 0.0233; alpha = 0.0040
Simulating trial. . .
espilon = 0.0233; alpha = 0.0040
Simulating trial. . .
espilon = 0.0233; alpha = 0.0040
Simulating trial. . .
espilon = 0.0233; alpha = 0.0040
Simulating trial. . .
espilon = 0.0233; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.96)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.97)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 1.97)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.46)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.48)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.00)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.84)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.77)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.67)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.97)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.26)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.37)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.93)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.27)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 0.91)
25% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 942
\-------------------------
Simulating trial. . .
espilon = 0.0232; alpha = 0.0040
Simulating trial. . .
espilon = 0.0232; alpha = 0.0040
Simulating trial. . .
espilon = 0.0232; alpha = 0.0040
Simulating trial. . .
espilon = 0.0232; alpha = 0.0040
Simulating trial. . .
espilon = 0.0232; alpha = 0.0040
Simulating trial. . .
espilon = 0.0232; alpha = 0.0040
Simulating trial. . .
espilon = 0.0232; alpha = 0.0040
Simulating trial. . .
espilon = 0.0232; alpha = 0.0040
Simulating trial. . .
espilon = 0.0232; alpha = 0.0040
Simulating trial. . .
espilon = 0.0232; alpha = 0.0040
Simulating trial. . .
espilon = 0.0232; alpha = 0.0040
Simulating trial. . .
espilon = 0.0232; alpha = 0.0040
Simulating trial. . .
espilon = 0.0232; alpha = 0.0040
Simulating trial. . .
espilon = 0.0232; alpha = 0.0040
Simulating trial. . .
espilon = 0.0232; alpha = 0.0040
Simulating trial. . .
espilon = 0.0232; alpha = 0.0040
Simulating trial. . .
espilon = 0.0232; alpha = 0.0040
Simulating trial. . .
espilon = 0.0232; alpha = 0.0040
Simulating trial. . .
espilon = 0.0232; alpha = 0.0040
Simulating trial. . .
espilon = 0.0232; alpha = 0.0040
Simulating trial. . .
espilon = 0.0232; alpha = 0.0040
Simulating trial. . .
espilon = 0.0232; alpha = 0.0040
Simulating trial. . .
espilon = 0.0232; alpha = 0.0040
Simulating trial. . .
espilon = 0.0232; alpha = 0.0040
Simulating trial. . .
espilon = 0.0232; alpha = 0.0040
Simulating trial. . .
espilon = 0.0232; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 1.38)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.86)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.55)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.71)
80% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 943
\-------------------------
Simulating trial. . .
espilon = 0.0231; alpha = 0.0040
Simulating trial. . .
espilon = 0.0231; alpha = 0.0040
Simulating trial. . .
espilon = 0.0231; alpha = 0.0040
Simulating trial. . .
espilon = 0.0231; alpha = 0.0040
Simulating trial. . .
espilon = 0.0231; alpha = 0.0040
Simulating trial. . .
espilon = 0.0231; alpha = 0.0040
Simulating trial. . .
espilon = 0.0231; alpha = 0.0040
Simulating trial. . .
espilon = 0.0231; alpha = 0.0040
Simulating trial. . .
espilon = 0.0231; alpha = 0.0040
Simulating trial. . .
espilon = 0.0231; alpha = 0.0040
Simulating trial. . .
espilon = 0.0231; alpha = 0.0040
Simulating trial. . .
espilon = 0.0231; alpha = 0.0040
Simulating trial. . .
espilon = 0.0231; alpha = 0.0040
Simulating trial. . .
espilon = 0.0231; alpha = 0.0040
Simulating trial. . .
espilon = 0.0231; alpha = 0.0040
Simulating trial. . .
espilon = 0.0231; alpha = 0.0040
Simulating trial. . .
espilon = 0.0231; alpha = 0.0040
Simulating trial. . .
espilon = 0.0231; alpha = 0.0040
Simulating trial. . .
espilon = 0.0231; alpha = 0.0040
Simulating trial. . .
espilon = 0.0231; alpha = 0.0040
Simulating trial. . .
espilon = 0.0231; alpha = 0.0040
Simulating trial. . .
espilon = 0.0231; alpha = 0.0040
Simulating trial. . .
espilon = 0.0231; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 1.05)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.28)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.73)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.81)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.83)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.25)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.64)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', 'forward')
Agent drove forward instead of right. (rewarded 1.60)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'left')
Agent followed the waypoint right. (rewarded 1.52)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.21)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.54)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent followed the waypoint left. (rewarded 2.32)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 2.27)
63% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.69)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 944
\-------------------------
Simulating trial. . .
espilon = 0.0230; alpha = 0.0040
Simulating trial. . .
espilon = 0.0230; alpha = 0.0040
Simulating trial. . .
espilon = 0.0230; alpha = 0.0040
Simulating trial. . .
espilon = 0.0230; alpha = 0.0040
Simulating trial. . .
espilon = 0.0230; alpha = 0.0040
Simulating trial. . .
espilon = 0.0230; alpha = 0.0040
Simulating trial. . .
espilon = 0.0230; alpha = 0.0040
Simulating trial. . .
espilon = 0.0230; alpha = 0.0040
Simulating trial. . .
espilon = 0.0230; alpha = 0.0040
Simulating trial. . .
espilon = 0.0230; alpha = 0.0040
Simulating trial. . .
espilon = 0.0230; alpha = 0.0040
Simulating trial. . .
espilon = 0.0230; alpha = 0.0040
Simulating trial. . .
espilon = 0.0230; alpha = 0.0040
Simulating trial. . .
espilon = 0.0230; alpha = 0.0040
Simulating trial. . .
espilon = 0.0230; alpha = 0.0040
Simulating trial. . .
espilon = 0.0230; alpha = 0.0040
Simulating trial. . .
espilon = 0.0230; alpha = 0.0040
Simulating trial. . .
espilon = 0.0230; alpha = 0.0040
Simulating trial. . .
espilon = 0.0230; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.80)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.84)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.90)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.39)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.87)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'forward')
Agent followed the waypoint forward. (rewarded 1.91)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.02)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.58)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 945
\-------------------------
Simulating trial. . .
espilon = 0.0229; alpha = 0.0040
Simulating trial. . .
espilon = 0.0229; alpha = 0.0040
Simulating trial. . .
espilon = 0.0229; alpha = 0.0040
Simulating trial. . .
espilon = 0.0229; alpha = 0.0040
Simulating trial. . .
espilon = 0.0229; alpha = 0.0040
Simulating trial. . .
espilon = 0.0229; alpha = 0.0040
Simulating trial. . .
espilon = 0.0229; alpha = 0.0040
Simulating trial. . .
espilon = 0.0229; alpha = 0.0040
Simulating trial. . .
espilon = 0.0229; alpha = 0.0040
Simulating trial. . .
espilon = 0.0229; alpha = 0.0040
Simulating trial. . .
espilon = 0.0229; alpha = 0.0040
Simulating trial. . .
espilon = 0.0229; alpha = 0.0040
Simulating trial. . .
espilon = 0.0229; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 2.56)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', None)
Agent followed the waypoint forward. (rewarded 2.25)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.75)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.77)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.07)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.80)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.08)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.29)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.90)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.20)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 2.29)
45% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 946
\-------------------------
Simulating trial. . .
espilon = 0.0228; alpha = 0.0040
Simulating trial. . .
espilon = 0.0228; alpha = 0.0040
Simulating trial. . .
espilon = 0.0228; alpha = 0.0040
Simulating trial. . .
espilon = 0.0228; alpha = 0.0040
Simulating trial. . .
espilon = 0.0228; alpha = 0.0040
Simulating trial. . .
espilon = 0.0228; alpha = 0.0040
Simulating trial. . .
espilon = 0.0228; alpha = 0.0040
Simulating trial. . .
espilon = 0.0228; alpha = 0.0040
Simulating trial. . .
espilon = 0.0228; alpha = 0.0040
Simulating trial. . .
espilon = 0.0228; alpha = 0.0040
Simulating trial. . .
espilon = 0.0228; alpha = 0.0040
Simulating trial. . .
espilon = 0.0228; alpha = 0.0040
Simulating trial. . .
espilon = 0.0228; alpha = 0.0040
Simulating trial. . .
espilon = 0.0228; alpha = 0.0040
Simulating trial. . .
espilon = 0.0228; alpha = 0.0040
Simulating trial. . .
espilon = 0.0228; alpha = 0.0040
Simulating trial. . .
espilon = 0.0228; alpha = 0.0040
Simulating trial. . .
espilon = 0.0228; alpha = 0.0040
Simulating trial. . .
espilon = 0.0228; alpha = 0.0040
Simulating trial. . .
espilon = 0.0228; alpha = 0.0040
Simulating trial. . .
espilon = 0.0228; alpha = 0.0040
Simulating trial. . .
espilon = 0.0228; alpha = 0.0040
Simulating trial. . .
espilon = 0.0228; alpha = 0.0040
Simulating trial. . .
espilon = 0.0228; alpha = 0.0040
Simulating trial. . .
espilon = 0.0228; alpha = 0.0040
Simulating trial. . .
espilon = 0.0228; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'forward')
Agent drove forward instead of right. (rewarded 1.89)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, 'right')
Agent followed the waypoint right. (rewarded 2.57)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'right', None)
Agent followed the waypoint forward. (rewarded 2.78)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 2.79)
80% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 947
\-------------------------
Simulating trial. . .
espilon = 0.0227; alpha = 0.0040
Simulating trial. . .
espilon = 0.0227; alpha = 0.0040
Simulating trial. . .
espilon = 0.0227; alpha = 0.0040
Simulating trial. . .
espilon = 0.0227; alpha = 0.0040
Simulating trial. . .
espilon = 0.0227; alpha = 0.0040
Simulating trial. . .
espilon = 0.0227; alpha = 0.0040
Simulating trial. . .
espilon = 0.0227; alpha = 0.0040
Simulating trial. . .
espilon = 0.0227; alpha = 0.0040
Simulating trial. . .
espilon = 0.0227; alpha = 0.0040
Simulating trial. . .
espilon = 0.0227; alpha = 0.0040
Simulating trial. . .
espilon = 0.0227; alpha = 0.0040
Simulating trial. . .
espilon = 0.0227; alpha = 0.0040
Simulating trial. . .
espilon = 0.0227; alpha = 0.0040
Simulating trial. . .
espilon = 0.0227; alpha = 0.0040
Simulating trial. . .
espilon = 0.0227; alpha = 0.0040
Simulating trial. . .
espilon = 0.0227; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.20)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.81)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.39)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.98)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.14)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -39.83)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.51)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.58)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.55)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.57)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.85)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.49)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.96)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 2.04)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.38)
50% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 948
\-------------------------
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.11)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent followed the waypoint right. (rewarded 1.30)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.61)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 0.36)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent drove right instead of left. (rewarded 0.77)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'right', None)
Agent attempted driving left through a red light. (rewarded -10.57)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.09)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.79)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent followed the waypoint right. (rewarded 1.05)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.85)
50% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 949
\-------------------------
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
Simulating trial. . .
espilon = 0.0226; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.51)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.01)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'right')
Agent properly idled at a red light. (rewarded 1.85)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.26)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.83)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.37)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.81)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.11)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.92)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.23)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'right')
Agent followed the waypoint forward. (rewarded 1.61)
56% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 950
\-------------------------
Simulating trial. . .
espilon = 0.0225; alpha = 0.0040
Simulating trial. . .
espilon = 0.0225; alpha = 0.0040
Simulating trial. . .
espilon = 0.0225; alpha = 0.0040
Simulating trial. . .
espilon = 0.0225; alpha = 0.0040
Simulating trial. . .
espilon = 0.0225; alpha = 0.0040
Simulating trial. . .
espilon = 0.0225; alpha = 0.0040
Simulating trial. . .
espilon = 0.0225; alpha = 0.0040
Simulating trial. . .
espilon = 0.0225; alpha = 0.0040
Simulating trial. . .
espilon = 0.0225; alpha = 0.0040
Simulating trial. . .
espilon = 0.0225; alpha = 0.0040
Simulating trial. . .
espilon = 0.0225; alpha = 0.0040
Simulating trial. . .
espilon = 0.0225; alpha = 0.0040
Simulating trial. . .
espilon = 0.0225; alpha = 0.0040
Simulating trial. . .
espilon = 0.0225; alpha = 0.0040
Simulating trial. . .
espilon = 0.0225; alpha = 0.0040
Simulating trial. . .
espilon = 0.0225; alpha = 0.0040
Simulating trial. . .
espilon = 0.0225; alpha = 0.0040
Simulating trial. . .
espilon = 0.0225; alpha = 0.0040
Simulating trial. . .
espilon = 0.0225; alpha = 0.0040
Simulating trial. . .
espilon = 0.0225; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', 'forward')
Agent properly idled at a red light. (rewarded 1.51)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.34)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.35)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 1.24)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.88)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.28)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.23)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 1.95)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 1.19)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.45)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.47)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.59)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 1.18)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.26)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.24)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 0.00)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 1.40)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.09)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.48)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.74)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'right', None)
Agent drove right instead of forward. (rewarded 1.31)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.04)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'right')
Agent drove right instead of left. (rewarded 0.40)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.02)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.22)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 951
\-------------------------
Simulating trial. . .
espilon = 0.0224; alpha = 0.0040
Simulating trial. . .
espilon = 0.0224; alpha = 0.0040
Simulating trial. . .
espilon = 0.0224; alpha = 0.0040
Simulating trial. . .
espilon = 0.0224; alpha = 0.0040
Simulating trial. . .
espilon = 0.0224; alpha = 0.0040
Simulating trial. . .
espilon = 0.0224; alpha = 0.0040
Simulating trial. . .
espilon = 0.0224; alpha = 0.0040
Simulating trial. . .
espilon = 0.0224; alpha = 0.0040
Simulating trial. . .
espilon = 0.0224; alpha = 0.0040
Simulating trial. . .
espilon = 0.0224; alpha = 0.0040
Simulating trial. . .
espilon = 0.0224; alpha = 0.0040
Simulating trial. . .
espilon = 0.0224; alpha = 0.0040
Simulating trial. . .
espilon = 0.0224; alpha = 0.0040
Simulating trial. . .
espilon = 0.0224; alpha = 0.0040
Simulating trial. . .
espilon = 0.0224; alpha = 0.0040
Simulating trial. . .
espilon = 0.0224; alpha = 0.0040
Simulating trial. . .
espilon = 0.0224; alpha = 0.0040
Simulating trial. . .
espilon = 0.0224; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.21)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.32)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 0.43)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded 1.73)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.87)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.85)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent followed the waypoint left. (rewarded 0.89)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.28)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.46)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.33)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.95)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.24)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.38)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.32)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.56)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.28)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.33)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 2.08)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 2.01)
5% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 952
\-------------------------
Simulating trial. . .
espilon = 0.0223; alpha = 0.0040
Simulating trial. . .
espilon = 0.0223; alpha = 0.0040
Simulating trial. . .
espilon = 0.0223; alpha = 0.0040
Simulating trial. . .
espilon = 0.0223; alpha = 0.0040
Simulating trial. . .
espilon = 0.0223; alpha = 0.0040
Simulating trial. . .
espilon = 0.0223; alpha = 0.0040
Simulating trial. . .
espilon = 0.0223; alpha = 0.0040
Simulating trial. . .
espilon = 0.0223; alpha = 0.0040
Simulating trial. . .
espilon = 0.0223; alpha = 0.0040
Simulating trial. . .
espilon = 0.0223; alpha = 0.0040
Simulating trial. . .
espilon = 0.0223; alpha = 0.0040
Simulating trial. . .
espilon = 0.0223; alpha = 0.0040
Simulating trial. . .
espilon = 0.0223; alpha = 0.0040
Simulating trial. . .
espilon = 0.0223; alpha = 0.0040
Simulating trial. . .
espilon = 0.0223; alpha = 0.0040
Simulating trial. . .
espilon = 0.0223; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', 'forward')
Agent drove right instead of left. (rewarded 0.13)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 1.70)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent followed the waypoint right. (rewarded 1.16)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 1.62)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.12)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent followed the waypoint forward. (rewarded 1.44)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.78)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'right')
Agent drove forward instead of left. (rewarded 0.65)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', 'right')
Agent drove right instead of left. (rewarded 0.84)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent drove left instead of right. (rewarded 1.56)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.17)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.43)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 1.25)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.67)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'forward')
Agent drove right instead of forward. (rewarded 0.67)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.76)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.47)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent followed the waypoint left. (rewarded 1.08)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.55)
24% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 953
\-------------------------
Simulating trial. . .
espilon = 0.0222; alpha = 0.0040
Simulating trial. . .
espilon = 0.0222; alpha = 0.0040
Simulating trial. . .
espilon = 0.0222; alpha = 0.0040
Simulating trial. . .
espilon = 0.0222; alpha = 0.0040
Simulating trial. . .
espilon = 0.0222; alpha = 0.0040
Simulating trial. . .
espilon = 0.0222; alpha = 0.0040
Simulating trial. . .
espilon = 0.0222; alpha = 0.0040
Simulating trial. . .
espilon = 0.0222; alpha = 0.0040
Simulating trial. . .
espilon = 0.0222; alpha = 0.0040
Simulating trial. . .
espilon = 0.0222; alpha = 0.0040
Simulating trial. . .
espilon = 0.0222; alpha = 0.0040
Simulating trial. . .
espilon = 0.0222; alpha = 0.0040
Simulating trial. . .
espilon = 0.0222; alpha = 0.0040
Simulating trial. . .
espilon = 0.0222; alpha = 0.0040
Simulating trial. . .
espilon = 0.0222; alpha = 0.0040
Simulating trial. . .
espilon = 0.0222; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.31)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.00)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'forward')
Agent drove right instead of forward. (rewarded 1.60)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', 'left')
Agent drove forward instead of left. (rewarded 0.81)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent followed the waypoint left. (rewarded 1.28)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'right')
Agent followed the waypoint forward. (rewarded 2.13)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.66)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'right', 'left')
Agent followed the waypoint right. (rewarded 1.23)
73% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 954
\-------------------------
Simulating trial. . .
espilon = 0.0221; alpha = 0.0040
Simulating trial. . .
espilon = 0.0221; alpha = 0.0040
Simulating trial. . .
espilon = 0.0221; alpha = 0.0040
Simulating trial. . .
espilon = 0.0221; alpha = 0.0040
Simulating trial. . .
espilon = 0.0221; alpha = 0.0040
Simulating trial. . .
espilon = 0.0221; alpha = 0.0040
Simulating trial. . .
espilon = 0.0221; alpha = 0.0040
Simulating trial. . .
espilon = 0.0221; alpha = 0.0040
Simulating trial. . .
espilon = 0.0221; alpha = 0.0040
Simulating trial. . .
espilon = 0.0221; alpha = 0.0040
Simulating trial. . .
espilon = 0.0221; alpha = 0.0040
Simulating trial. . .
espilon = 0.0221; alpha = 0.0040
Simulating trial. . .
espilon = 0.0221; alpha = 0.0040
Simulating trial. . .
espilon = 0.0221; alpha = 0.0040
Simulating trial. . .
espilon = 0.0221; alpha = 0.0040
Simulating trial. . .
espilon = 0.0221; alpha = 0.0040
Simulating trial. . .
espilon = 0.0221; alpha = 0.0040
Simulating trial. . .
espilon = 0.0221; alpha = 0.0040
Simulating trial. . .
espilon = 0.0221; alpha = 0.0040
Simulating trial. . .
espilon = 0.0221; alpha = 0.0040
Simulating trial. . .
espilon = 0.0221; alpha = 0.0040
Simulating trial. . .
espilon = 0.0221; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.13)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.47)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving left through a red light. (rewarded -9.90)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.91)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 1.67)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'left')
Agent followed the waypoint left. (rewarded 2.43)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.23)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.77)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.19)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.18)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 0.42)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.56)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 2.46)
35% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 955
\-------------------------
Simulating trial. . .
espilon = 0.0220; alpha = 0.0040
Simulating trial. . .
espilon = 0.0220; alpha = 0.0040
Simulating trial. . .
espilon = 0.0220; alpha = 0.0040
Simulating trial. . .
espilon = 0.0220; alpha = 0.0040
Simulating trial. . .
espilon = 0.0220; alpha = 0.0040
Simulating trial. . .
espilon = 0.0220; alpha = 0.0040
Simulating trial. . .
espilon = 0.0220; alpha = 0.0040
Simulating trial. . .
espilon = 0.0220; alpha = 0.0040
Simulating trial. . .
espilon = 0.0220; alpha = 0.0040
Simulating trial. . .
espilon = 0.0220; alpha = 0.0040
Simulating trial. . .
espilon = 0.0220; alpha = 0.0040
Simulating trial. . .
espilon = 0.0220; alpha = 0.0040
Simulating trial. . .
espilon = 0.0220; alpha = 0.0040
Simulating trial. . .
espilon = 0.0220; alpha = 0.0040
Simulating trial. . .
espilon = 0.0220; alpha = 0.0040
Simulating trial. . .
espilon = 0.0220; alpha = 0.0040
Simulating trial. . .
espilon = 0.0220; alpha = 0.0040
Simulating trial. . .
espilon = 0.0220; alpha = 0.0040
Simulating trial. . .
espilon = 0.0220; alpha = 0.0040
Simulating trial. . .
espilon = 0.0220; alpha = 0.0040
Simulating trial. . .
espilon = 0.0220; alpha = 0.0040
Simulating trial. . .
espilon = 0.0220; alpha = 0.0040
Simulating trial. . .
espilon = 0.0220; alpha = 0.0040
Simulating trial. . .
espilon = 0.0220; alpha = 0.0040
Simulating trial. . .
espilon = 0.0220; alpha = 0.0040
Simulating trial. . .
espilon = 0.0220; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent drove right instead of left. (rewarded 1.28)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent followed the waypoint right. (rewarded 1.35)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded 1.02)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.36)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.40)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.39)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.58)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.75)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.49)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.71)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.55)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.30)
52% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 956
\-------------------------
Simulating trial. . .
espilon = 0.0219; alpha = 0.0040
Simulating trial. . .
espilon = 0.0219; alpha = 0.0040
Simulating trial. . .
espilon = 0.0219; alpha = 0.0040
Simulating trial. . .
espilon = 0.0219; alpha = 0.0040
Simulating trial. . .
espilon = 0.0219; alpha = 0.0040
Simulating trial. . .
espilon = 0.0219; alpha = 0.0040
Simulating trial. . .
espilon = 0.0219; alpha = 0.0040
Simulating trial. . .
espilon = 0.0219; alpha = 0.0040
Simulating trial. . .
espilon = 0.0219; alpha = 0.0040
Simulating trial. . .
espilon = 0.0219; alpha = 0.0040
Simulating trial. . .
espilon = 0.0219; alpha = 0.0040
Simulating trial. . .
espilon = 0.0219; alpha = 0.0040
Simulating trial. . .
espilon = 0.0219; alpha = 0.0040
Simulating trial. . .
espilon = 0.0219; alpha = 0.0040
Simulating trial. . .
espilon = 0.0219; alpha = 0.0040
Simulating trial. . .
espilon = 0.0219; alpha = 0.0040
Simulating trial. . .
espilon = 0.0219; alpha = 0.0040
Simulating trial. . .
espilon = 0.0219; alpha = 0.0040
Simulating trial. . .
espilon = 0.0219; alpha = 0.0040
Simulating trial. . .
espilon = 0.0219; alpha = 0.0040
Simulating trial. . .
espilon = 0.0219; alpha = 0.0040
Simulating trial. . .
espilon = 0.0219; alpha = 0.0040
Simulating trial. . .
espilon = 0.0219; alpha = 0.0040
Simulating trial. . .
espilon = 0.0219; alpha = 0.0040
Simulating trial. . .
espilon = 0.0219; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.78)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 1.01)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.26)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.19)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.25)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.81)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.18)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', 'forward')
Agent properly idled at a red light. (rewarded 0.91)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent drove right instead of left. (rewarded 0.19)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.65)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.23)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'forward', None)
Agent properly idled at a red light. (rewarded 1.13)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.75)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.50)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 0.69)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.48)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.34)
15% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 957
\-------------------------
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.81)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.64)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent followed the waypoint right. (rewarded 1.93)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.65)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.24)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.36)
70% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 958
\-------------------------
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
Simulating trial. . .
espilon = 0.0218; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.28)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'right', None)
Agent followed the waypoint forward. (rewarded 1.97)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.39)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.52)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.70)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.83)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.04)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 2.88)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.31)
64% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 959
\-------------------------
Simulating trial. . .
espilon = 0.0217; alpha = 0.0040
Simulating trial. . .
espilon = 0.0217; alpha = 0.0040
Simulating trial. . .
espilon = 0.0217; alpha = 0.0040
Simulating trial. . .
espilon = 0.0217; alpha = 0.0040
Simulating trial. . .
espilon = 0.0217; alpha = 0.0040
Simulating trial. . .
espilon = 0.0217; alpha = 0.0040
Simulating trial. . .
espilon = 0.0217; alpha = 0.0040
Simulating trial. . .
espilon = 0.0217; alpha = 0.0040
Simulating trial. . .
espilon = 0.0217; alpha = 0.0040
Simulating trial. . .
espilon = 0.0217; alpha = 0.0040
Simulating trial. . .
espilon = 0.0217; alpha = 0.0040
Simulating trial. . .
espilon = 0.0217; alpha = 0.0040
Simulating trial. . .
espilon = 0.0217; alpha = 0.0040
Simulating trial. . .
espilon = 0.0217; alpha = 0.0040
Simulating trial. . .
espilon = 0.0217; alpha = 0.0040
Simulating trial. . .
espilon = 0.0217; alpha = 0.0040
Simulating trial. . .
espilon = 0.0217; alpha = 0.0040
Simulating trial. . .
espilon = 0.0217; alpha = 0.0040
Simulating trial. . .
espilon = 0.0217; alpha = 0.0040
Simulating trial. . .
espilon = 0.0217; alpha = 0.0040
Simulating trial. . .
espilon = 0.0217; alpha = 0.0040
Simulating trial. . .
espilon = 0.0217; alpha = 0.0040
Simulating trial. . .
espilon = 0.0217; alpha = 0.0040
Simulating trial. . .
espilon = 0.0217; alpha = 0.0040
Simulating trial. . .
espilon = 0.0217; alpha = 0.0040
Simulating trial. . .
espilon = 0.0217; alpha = 0.0040
Simulating trial. . .
espilon = 0.0217; alpha = 0.0040
Simulating trial. . .
espilon = 0.0217; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'right')
Agent drove right instead of forward. (rewarded 1.40)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 2.76)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.83)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.93)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.47)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 1.57)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.54)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent followed the waypoint right. (rewarded 2.84)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.18)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 1.27)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.62)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'left')
Agent followed the waypoint left. (rewarded 1.25)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 0.68)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.66)
30% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 960
\-------------------------
Simulating trial. . .
espilon = 0.0216; alpha = 0.0040
Simulating trial. . .
espilon = 0.0216; alpha = 0.0040
Simulating trial. . .
espilon = 0.0216; alpha = 0.0040
Simulating trial. . .
espilon = 0.0216; alpha = 0.0040
Simulating trial. . .
espilon = 0.0216; alpha = 0.0040
Simulating trial. . .
espilon = 0.0216; alpha = 0.0040
Simulating trial. . .
espilon = 0.0216; alpha = 0.0040
Simulating trial. . .
espilon = 0.0216; alpha = 0.0040
Simulating trial. . .
espilon = 0.0216; alpha = 0.0040
Simulating trial. . .
espilon = 0.0216; alpha = 0.0040
Simulating trial. . .
espilon = 0.0216; alpha = 0.0040
Simulating trial. . .
espilon = 0.0216; alpha = 0.0040
Simulating trial. . .
espilon = 0.0216; alpha = 0.0040
Simulating trial. . .
espilon = 0.0216; alpha = 0.0040
Simulating trial. . .
espilon = 0.0216; alpha = 0.0040
Simulating trial. . .
espilon = 0.0216; alpha = 0.0040
Simulating trial. . .
espilon = 0.0216; alpha = 0.0040
Simulating trial. . .
espilon = 0.0216; alpha = 0.0040
Simulating trial. . .
espilon = 0.0216; alpha = 0.0040
Simulating trial. . .
espilon = 0.0216; alpha = 0.0040
Simulating trial. . .
espilon = 0.0216; alpha = 0.0040
Simulating trial. . .
espilon = 0.0216; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.00)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.86)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.80)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.25)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.08)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.73)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.03)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.26)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.31)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.25)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 1.70)
45% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 961
\-------------------------
Simulating trial. . .
espilon = 0.0215; alpha = 0.0040
Simulating trial. . .
espilon = 0.0215; alpha = 0.0040
Simulating trial. . .
espilon = 0.0215; alpha = 0.0040
Simulating trial. . .
espilon = 0.0215; alpha = 0.0040
Simulating trial. . .
espilon = 0.0215; alpha = 0.0040
Simulating trial. . .
espilon = 0.0215; alpha = 0.0040
Simulating trial. . .
espilon = 0.0215; alpha = 0.0040
Simulating trial. . .
espilon = 0.0215; alpha = 0.0040
Simulating trial. . .
espilon = 0.0215; alpha = 0.0040
Simulating trial. . .
espilon = 0.0215; alpha = 0.0040
Simulating trial. . .
espilon = 0.0215; alpha = 0.0040
Simulating trial. . .
espilon = 0.0215; alpha = 0.0040
Simulating trial. . .
espilon = 0.0215; alpha = 0.0040
Simulating trial. . .
espilon = 0.0215; alpha = 0.0040
Simulating trial. . .
espilon = 0.0215; alpha = 0.0040
Simulating trial. . .
espilon = 0.0215; alpha = 0.0040
Simulating trial. . .
espilon = 0.0215; alpha = 0.0040
Simulating trial. . .
espilon = 0.0215; alpha = 0.0040
Simulating trial. . .
espilon = 0.0215; alpha = 0.0040
Simulating trial. . .
espilon = 0.0215; alpha = 0.0040
Simulating trial. . .
espilon = 0.0215; alpha = 0.0040
Simulating trial. . .
espilon = 0.0215; alpha = 0.0040
Simulating trial. . .
espilon = 0.0215; alpha = 0.0040
Simulating trial. . .
espilon = 0.0215; alpha = 0.0040
Simulating trial. . .
espilon = 0.0215; alpha = 0.0040
Simulating trial. . .
espilon = 0.0215; alpha = 0.0040
Simulating trial. . .
espilon = 0.0215; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.59)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.00)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.47)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.62)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.03)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.96)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.68)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.61)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.56)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.66)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.62)
56% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 962
\-------------------------
Simulating trial. . .
espilon = 0.0214; alpha = 0.0040
Simulating trial. . .
espilon = 0.0214; alpha = 0.0040
Simulating trial. . .
espilon = 0.0214; alpha = 0.0040
Simulating trial. . .
espilon = 0.0214; alpha = 0.0040
Simulating trial. . .
espilon = 0.0214; alpha = 0.0040
Simulating trial. . .
espilon = 0.0214; alpha = 0.0040
Simulating trial. . .
espilon = 0.0214; alpha = 0.0040
Simulating trial. . .
espilon = 0.0214; alpha = 0.0040
Simulating trial. . .
espilon = 0.0214; alpha = 0.0040
Simulating trial. . .
espilon = 0.0214; alpha = 0.0040
Simulating trial. . .
espilon = 0.0214; alpha = 0.0040
Simulating trial. . .
espilon = 0.0214; alpha = 0.0040
Simulating trial. . .
espilon = 0.0214; alpha = 0.0040
Simulating trial. . .
espilon = 0.0214; alpha = 0.0040
Simulating trial. . .
espilon = 0.0214; alpha = 0.0040
Simulating trial. . .
espilon = 0.0214; alpha = 0.0040
Simulating trial. . .
espilon = 0.0214; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 1.73)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.63)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.30)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.27)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.78)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.98)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.65)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.50)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 963
\-------------------------
Simulating trial. . .
espilon = 0.0213; alpha = 0.0040
Simulating trial. . .
espilon = 0.0213; alpha = 0.0040
Simulating trial. . .
espilon = 0.0213; alpha = 0.0040
Simulating trial. . .
espilon = 0.0213; alpha = 0.0040
Simulating trial. . .
espilon = 0.0213; alpha = 0.0040
Simulating trial. . .
espilon = 0.0213; alpha = 0.0040
Simulating trial. . .
espilon = 0.0213; alpha = 0.0040
Simulating trial. . .
espilon = 0.0213; alpha = 0.0040
Simulating trial. . .
espilon = 0.0213; alpha = 0.0040
Simulating trial. . .
espilon = 0.0213; alpha = 0.0040
Simulating trial. . .
espilon = 0.0213; alpha = 0.0040
Simulating trial. . .
espilon = 0.0213; alpha = 0.0040
Simulating trial. . .
espilon = 0.0213; alpha = 0.0040
Simulating trial. . .
espilon = 0.0213; alpha = 0.0040
Simulating trial. . .
espilon = 0.0213; alpha = 0.0040
Simulating trial. . .
espilon = 0.0213; alpha = 0.0040
Simulating trial. . .
espilon = 0.0213; alpha = 0.0040
Simulating trial. . .
espilon = 0.0213; alpha = 0.0040
Simulating trial. . .
espilon = 0.0213; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 2.61)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 2.14)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 2.88)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.37)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.42)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.16)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.16)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.37)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.79)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.82)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.30)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 2.84)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.48)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.72)
53% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 964
\-------------------------
Simulating trial. . .
espilon = 0.0212; alpha = 0.0040
Simulating trial. . .
espilon = 0.0212; alpha = 0.0040
Simulating trial. . .
espilon = 0.0212; alpha = 0.0040
Simulating trial. . .
espilon = 0.0212; alpha = 0.0040
Simulating trial. . .
espilon = 0.0212; alpha = 0.0040
Simulating trial. . .
espilon = 0.0212; alpha = 0.0040
Simulating trial. . .
espilon = 0.0212; alpha = 0.0040
Simulating trial. . .
espilon = 0.0212; alpha = 0.0040
Simulating trial. . .
espilon = 0.0212; alpha = 0.0040
Simulating trial. . .
espilon = 0.0212; alpha = 0.0040
Simulating trial. . .
espilon = 0.0212; alpha = 0.0040
Simulating trial. . .
espilon = 0.0212; alpha = 0.0040
Simulating trial. . .
espilon = 0.0212; alpha = 0.0040
Simulating trial. . .
espilon = 0.0212; alpha = 0.0040
Simulating trial. . .
espilon = 0.0212; alpha = 0.0040
Simulating trial. . .
espilon = 0.0212; alpha = 0.0040
Simulating trial. . .
espilon = 0.0212; alpha = 0.0040
Simulating trial. . .
espilon = 0.0212; alpha = 0.0040
Simulating trial. . .
espilon = 0.0212; alpha = 0.0040
Simulating trial. . .
espilon = 0.0212; alpha = 0.0040
Simulating trial. . .
espilon = 0.0212; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.39)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.51)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.79)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.19)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent followed the waypoint right. (rewarded 2.61)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 0.99)
70% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 965
\-------------------------
Simulating trial. . .
espilon = 0.0212; alpha = 0.0040
Simulating trial. . .
espilon = 0.0212; alpha = 0.0040
Simulating trial. . .
espilon = 0.0212; alpha = 0.0040
Simulating trial. . .
espilon = 0.0212; alpha = 0.0040
Simulating trial. . .
espilon = 0.0212; alpha = 0.0040
Simulating trial. . .
espilon = 0.0212; alpha = 0.0040
Simulating trial. . .
espilon = 0.0212; alpha = 0.0040
Simulating trial. . .
espilon = 0.0212; alpha = 0.0040
Simulating trial. . .
espilon = 0.0212; alpha = 0.0040
Simulating trial. . .
espilon = 0.0212; alpha = 0.0040
Simulating trial. . .
espilon = 0.0212; alpha = 0.0040
Simulating trial. . .
espilon = 0.0212; alpha = 0.0040
Simulating trial. . .
espilon = 0.0212; alpha = 0.0040
Simulating trial. . .
espilon = 0.0212; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'right', 'left')
Agent drove forward instead of left. (rewarded 1.72)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'right')
Agent followed the waypoint forward. (rewarded 2.71)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.29)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 1.44)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.71)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 1.60)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.50)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'right')
Agent drove right instead of left. (rewarded 1.18)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent attempted driving left through a red light. (rewarded -10.52)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.92)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'right', 'left')
Agent properly idled at a red light. (rewarded 1.96)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.71)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.86)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -4.89)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent followed the waypoint forward. (rewarded 1.13)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'right')
Agent drove right instead of forward. (rewarded 0.22)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.53)
43% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 966
\-------------------------
Simulating trial. . .
espilon = 0.0211; alpha = 0.0040
Simulating trial. . .
espilon = 0.0211; alpha = 0.0040
Simulating trial. . .
espilon = 0.0211; alpha = 0.0040
Simulating trial. . .
espilon = 0.0211; alpha = 0.0040
Simulating trial. . .
espilon = 0.0211; alpha = 0.0040
Simulating trial. . .
espilon = 0.0211; alpha = 0.0040
Simulating trial. . .
espilon = 0.0211; alpha = 0.0040
Simulating trial. . .
espilon = 0.0211; alpha = 0.0040
Simulating trial. . .
espilon = 0.0211; alpha = 0.0040
Simulating trial. . .
espilon = 0.0211; alpha = 0.0040
Simulating trial. . .
espilon = 0.0211; alpha = 0.0040
Simulating trial. . .
espilon = 0.0211; alpha = 0.0040
Simulating trial. . .
espilon = 0.0211; alpha = 0.0040
Simulating trial. . .
espilon = 0.0211; alpha = 0.0040
Simulating trial. . .
espilon = 0.0211; alpha = 0.0040
Simulating trial. . .
espilon = 0.0211; alpha = 0.0040
Simulating trial. . .
espilon = 0.0211; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.02)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.70)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.16)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.37)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.12)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.49)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent followed the waypoint forward. (rewarded 1.73)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 0.88)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.35)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.84)
50% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 967
\-------------------------
Simulating trial. . .
espilon = 0.0210; alpha = 0.0040
Simulating trial. . .
espilon = 0.0210; alpha = 0.0040
Simulating trial. . .
espilon = 0.0210; alpha = 0.0040
Simulating trial. . .
espilon = 0.0210; alpha = 0.0040
Simulating trial. . .
espilon = 0.0210; alpha = 0.0040
Simulating trial. . .
espilon = 0.0210; alpha = 0.0040
Simulating trial. . .
espilon = 0.0210; alpha = 0.0040
Simulating trial. . .
espilon = 0.0210; alpha = 0.0040
Simulating trial. . .
espilon = 0.0210; alpha = 0.0040
Simulating trial. . .
espilon = 0.0210; alpha = 0.0040
Simulating trial. . .
espilon = 0.0210; alpha = 0.0040
Simulating trial. . .
espilon = 0.0210; alpha = 0.0040
Simulating trial. . .
espilon = 0.0210; alpha = 0.0040
Simulating trial. . .
espilon = 0.0210; alpha = 0.0040
Simulating trial. . .
espilon = 0.0210; alpha = 0.0040
Simulating trial. . .
espilon = 0.0210; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.04)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.45)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 2.10)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.50)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.05)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.04)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.22)
65% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 968
\-------------------------
Simulating trial. . .
espilon = 0.0209; alpha = 0.0040
Simulating trial. . .
espilon = 0.0209; alpha = 0.0040
Simulating trial. . .
espilon = 0.0209; alpha = 0.0040
Simulating trial. . .
espilon = 0.0209; alpha = 0.0040
Simulating trial. . .
espilon = 0.0209; alpha = 0.0040
Simulating trial. . .
espilon = 0.0209; alpha = 0.0040
Simulating trial. . .
espilon = 0.0209; alpha = 0.0040
Simulating trial. . .
espilon = 0.0209; alpha = 0.0040
Simulating trial. . .
espilon = 0.0209; alpha = 0.0040
Simulating trial. . .
espilon = 0.0209; alpha = 0.0040
Simulating trial. . .
espilon = 0.0209; alpha = 0.0040
Simulating trial. . .
espilon = 0.0209; alpha = 0.0040
Simulating trial. . .
espilon = 0.0209; alpha = 0.0040
Simulating trial. . .
espilon = 0.0209; alpha = 0.0040
Simulating trial. . .
espilon = 0.0209; alpha = 0.0040
Simulating trial. . .
espilon = 0.0209; alpha = 0.0040
Simulating trial. . .
espilon = 0.0209; alpha = 0.0040
Simulating trial. . .
espilon = 0.0209; alpha = 0.0040
Simulating trial. . .
espilon = 0.0209; alpha = 0.0040
Simulating trial. . .
espilon = 0.0209; alpha = 0.0040
Simulating trial. . .
espilon = 0.0209; alpha = 0.0040
Simulating trial. . .
espilon = 0.0209; alpha = 0.0040
Simulating trial. . .
espilon = 0.0209; alpha = 0.0040
Simulating trial. . .
espilon = 0.0209; alpha = 0.0040
Simulating trial. . .
espilon = 0.0209; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'right')
Agent properly idled at a red light. (rewarded 2.84)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.44)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.39)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.85)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'right')
Agent followed the waypoint forward. (rewarded 2.58)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.89)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.80)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.75)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 1.83)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.89)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.68)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.41)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 0.90)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.41)
44% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 969
\-------------------------
Simulating trial. . .
espilon = 0.0208; alpha = 0.0040
Simulating trial. . .
espilon = 0.0208; alpha = 0.0040
Simulating trial. . .
espilon = 0.0208; alpha = 0.0040
Simulating trial. . .
espilon = 0.0208; alpha = 0.0040
Simulating trial. . .
espilon = 0.0208; alpha = 0.0040
Simulating trial. . .
espilon = 0.0208; alpha = 0.0040
Simulating trial. . .
espilon = 0.0208; alpha = 0.0040
Simulating trial. . .
espilon = 0.0208; alpha = 0.0040
Simulating trial. . .
espilon = 0.0208; alpha = 0.0040
Simulating trial. . .
espilon = 0.0208; alpha = 0.0040
Simulating trial. . .
espilon = 0.0208; alpha = 0.0040
Simulating trial. . .
espilon = 0.0208; alpha = 0.0040
Simulating trial. . .
espilon = 0.0208; alpha = 0.0040
Simulating trial. . .
espilon = 0.0208; alpha = 0.0040
Simulating trial. . .
espilon = 0.0208; alpha = 0.0040
Simulating trial. . .
espilon = 0.0208; alpha = 0.0040
Simulating trial. . .
espilon = 0.0208; alpha = 0.0040
Simulating trial. . .
espilon = 0.0208; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent followed the waypoint right. (rewarded 2.48)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'right', None)
Agent followed the waypoint forward. (rewarded 2.18)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 0.10)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.25)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.14)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.02)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.46)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.46)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.27)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.47)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 2.73)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.53)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.79)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.60)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 0.97)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.82)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.61)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'left')
Agent followed the waypoint left. (rewarded 1.62)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.49)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 0.23)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 970
\-------------------------
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.27)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', 'forward')
Agent properly idled at a red light. (rewarded 2.52)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 1.57)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 0.75)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.87)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.18)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.23)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 2.05)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.85)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.22)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.16)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.05)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.47)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.63)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.12)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.68)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.22)
32% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 971
\-------------------------
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
Simulating trial. . .
espilon = 0.0207; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.89)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.18)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.18)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.89)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.32)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.42)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.71)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.57)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.37)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.19)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.59)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.95)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.17)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.92)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.02)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.12)
20% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 972
\-------------------------
Simulating trial. . .
espilon = 0.0206; alpha = 0.0040
Simulating trial. . .
espilon = 0.0206; alpha = 0.0040
Simulating trial. . .
espilon = 0.0206; alpha = 0.0040
Simulating trial. . .
espilon = 0.0206; alpha = 0.0040
Simulating trial. . .
espilon = 0.0206; alpha = 0.0040
Simulating trial. . .
espilon = 0.0206; alpha = 0.0040
Simulating trial. . .
espilon = 0.0206; alpha = 0.0040
Simulating trial. . .
espilon = 0.0206; alpha = 0.0040
Simulating trial. . .
espilon = 0.0206; alpha = 0.0040
Simulating trial. . .
espilon = 0.0206; alpha = 0.0040
Simulating trial. . .
espilon = 0.0206; alpha = 0.0040
Simulating trial. . .
espilon = 0.0206; alpha = 0.0040
Simulating trial. . .
espilon = 0.0206; alpha = 0.0040
Simulating trial. . .
espilon = 0.0206; alpha = 0.0040
Simulating trial. . .
espilon = 0.0206; alpha = 0.0040
Simulating trial. . .
espilon = 0.0206; alpha = 0.0040
Simulating trial. . .
espilon = 0.0206; alpha = 0.0040
Simulating trial. . .
espilon = 0.0206; alpha = 0.0040
Simulating trial. . .
espilon = 0.0206; alpha = 0.0040
Simulating trial. . .
espilon = 0.0206; alpha = 0.0040
Simulating trial. . .
espilon = 0.0206; alpha = 0.0040
Simulating trial. . .
espilon = 0.0206; alpha = 0.0040
Simulating trial. . .
espilon = 0.0206; alpha = 0.0040
Simulating trial. . .
espilon = 0.0206; alpha = 0.0040
Simulating trial. . .
espilon = 0.0206; alpha = 0.0040
Simulating trial. . .
espilon = 0.0206; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.27)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.17)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'forward')
Agent drove right instead of forward. (rewarded 0.62)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.37)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.84)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.36)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.27)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'forward')
Agent followed the waypoint right. (rewarded 1.04)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded -0.15)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', 'forward')
Agent drove forward instead of left. (rewarded 0.64)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'right')
Agent drove right instead of left. (rewarded 0.30)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.43)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.79)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.33)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', 'left')
Agent properly idled at a red light. (rewarded 1.65)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 0.68)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.17)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.99)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded -0.41)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 0.22)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 973
\-------------------------
Simulating trial. . .
espilon = 0.0205; alpha = 0.0040
Simulating trial. . .
espilon = 0.0205; alpha = 0.0040
Simulating trial. . .
espilon = 0.0205; alpha = 0.0040
Simulating trial. . .
espilon = 0.0205; alpha = 0.0040
Simulating trial. . .
espilon = 0.0205; alpha = 0.0040
Simulating trial. . .
espilon = 0.0205; alpha = 0.0040
Simulating trial. . .
espilon = 0.0205; alpha = 0.0040
Simulating trial. . .
espilon = 0.0205; alpha = 0.0040
Simulating trial. . .
espilon = 0.0205; alpha = 0.0040
Simulating trial. . .
espilon = 0.0205; alpha = 0.0040
Simulating trial. . .
espilon = 0.0205; alpha = 0.0040
Simulating trial. . .
espilon = 0.0205; alpha = 0.0040
Simulating trial. . .
espilon = 0.0205; alpha = 0.0040
Simulating trial. . .
espilon = 0.0205; alpha = 0.0040
Simulating trial. . .
espilon = 0.0205; alpha = 0.0040
Simulating trial. . .
espilon = 0.0205; alpha = 0.0040
Simulating trial. . .
espilon = 0.0205; alpha = 0.0040
Simulating trial. . .
espilon = 0.0205; alpha = 0.0040
Simulating trial. . .
espilon = 0.0205; alpha = 0.0040
Simulating trial. . .
espilon = 0.0205; alpha = 0.0040
Simulating trial. . .
espilon = 0.0205; alpha = 0.0040
Simulating trial. . .
espilon = 0.0205; alpha = 0.0040
Simulating trial. . .
espilon = 0.0205; alpha = 0.0040
Simulating trial. . .
espilon = 0.0205; alpha = 0.0040
Simulating trial. . .
espilon = 0.0205; alpha = 0.0040
Simulating trial. . .
espilon = 0.0205; alpha = 0.0040
Simulating trial. . .
espilon = 0.0205; alpha = 0.0040
Simulating trial. . .
espilon = 0.0205; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.16)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 0.99)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.80)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.04)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.77)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.36)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.03)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.50)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.28)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'right')
Agent drove right instead of forward. (rewarded -0.11)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.61)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'left')
Agent followed the waypoint left. (rewarded 1.42)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', 'left')
Agent followed the waypoint left. (rewarded 2.53)
35% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 974
\-------------------------
Simulating trial. . .
espilon = 0.0204; alpha = 0.0040
Simulating trial. . .
espilon = 0.0204; alpha = 0.0040
Simulating trial. . .
espilon = 0.0204; alpha = 0.0040
Simulating trial. . .
espilon = 0.0204; alpha = 0.0040
Simulating trial. . .
espilon = 0.0204; alpha = 0.0040
Simulating trial. . .
espilon = 0.0204; alpha = 0.0040
Simulating trial. . .
espilon = 0.0204; alpha = 0.0040
Simulating trial. . .
espilon = 0.0204; alpha = 0.0040
Simulating trial. . .
espilon = 0.0204; alpha = 0.0040
Simulating trial. . .
espilon = 0.0204; alpha = 0.0040
Simulating trial. . .
espilon = 0.0204; alpha = 0.0040
Simulating trial. . .
espilon = 0.0204; alpha = 0.0040
Simulating trial. . .
espilon = 0.0204; alpha = 0.0040
Simulating trial. . .
espilon = 0.0204; alpha = 0.0040
Simulating trial. . .
espilon = 0.0204; alpha = 0.0040
Simulating trial. . .
espilon = 0.0204; alpha = 0.0040
Simulating trial. . .
espilon = 0.0204; alpha = 0.0040
Simulating trial. . .
espilon = 0.0204; alpha = 0.0040
Simulating trial. . .
espilon = 0.0204; alpha = 0.0040
Simulating trial. . .
espilon = 0.0204; alpha = 0.0040
Simulating trial. . .
espilon = 0.0204; alpha = 0.0040
Simulating trial. . .
espilon = 0.0204; alpha = 0.0040
Simulating trial. . .
espilon = 0.0204; alpha = 0.0040
Simulating trial. . .
espilon = 0.0204; alpha = 0.0040
Simulating trial. . .
espilon = 0.0204; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.75)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.19)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.62)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.95)
80% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 975
\-------------------------
Simulating trial. . .
espilon = 0.0203; alpha = 0.0040
Simulating trial. . .
espilon = 0.0203; alpha = 0.0040
Simulating trial. . .
espilon = 0.0203; alpha = 0.0040
Simulating trial. . .
espilon = 0.0203; alpha = 0.0040
Simulating trial. . .
espilon = 0.0203; alpha = 0.0040
Simulating trial. . .
espilon = 0.0203; alpha = 0.0040
Simulating trial. . .
espilon = 0.0203; alpha = 0.0040
Simulating trial. . .
espilon = 0.0203; alpha = 0.0040
Simulating trial. . .
espilon = 0.0203; alpha = 0.0040
Simulating trial. . .
espilon = 0.0203; alpha = 0.0040
Simulating trial. . .
espilon = 0.0203; alpha = 0.0040
Simulating trial. . .
espilon = 0.0203; alpha = 0.0040
Simulating trial. . .
espilon = 0.0203; alpha = 0.0040
Simulating trial. . .
espilon = 0.0203; alpha = 0.0040
Simulating trial. . .
espilon = 0.0203; alpha = 0.0040
Simulating trial. . .
espilon = 0.0203; alpha = 0.0040
Simulating trial. . .
espilon = 0.0203; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 2.36)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.18)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.70)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.46)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 1.74)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 0.97)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.68)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 0.99)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.33)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.79)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 976
\-------------------------
Simulating trial. . .
espilon = 0.0202; alpha = 0.0040
Simulating trial. . .
espilon = 0.0202; alpha = 0.0040
Simulating trial. . .
espilon = 0.0202; alpha = 0.0040
Simulating trial. . .
espilon = 0.0202; alpha = 0.0040
Simulating trial. . .
espilon = 0.0202; alpha = 0.0040
Simulating trial. . .
espilon = 0.0202; alpha = 0.0040
Simulating trial. . .
espilon = 0.0202; alpha = 0.0040
Simulating trial. . .
espilon = 0.0202; alpha = 0.0040
Simulating trial. . .
espilon = 0.0202; alpha = 0.0040
Simulating trial. . .
espilon = 0.0202; alpha = 0.0040
Simulating trial. . .
espilon = 0.0202; alpha = 0.0040
Simulating trial. . .
espilon = 0.0202; alpha = 0.0040
Simulating trial. . .
espilon = 0.0202; alpha = 0.0040
Simulating trial. . .
espilon = 0.0202; alpha = 0.0040
Simulating trial. . .
espilon = 0.0202; alpha = 0.0040
Simulating trial. . .
espilon = 0.0202; alpha = 0.0040
Simulating trial. . .
espilon = 0.0202; alpha = 0.0040
Simulating trial. . .
espilon = 0.0202; alpha = 0.0040
Simulating trial. . .
espilon = 0.0202; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'right')
Agent drove right instead of left. (rewarded 1.87)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.36)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.66)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.64)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.84)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.18)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.00)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.20)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.51)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.25)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 2.65)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.38)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.37)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.60)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', 'right')
Agent drove forward instead of right. (rewarded 0.25)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.11)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 1.27)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.24)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.57)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 1.10)
20% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 977
\-------------------------
Simulating trial. . .
espilon = 0.0202; alpha = 0.0040
Simulating trial. . .
espilon = 0.0202; alpha = 0.0040
Simulating trial. . .
espilon = 0.0202; alpha = 0.0040
Simulating trial. . .
espilon = 0.0202; alpha = 0.0040
Simulating trial. . .
espilon = 0.0202; alpha = 0.0040
Simulating trial. . .
espilon = 0.0202; alpha = 0.0040
Simulating trial. . .
espilon = 0.0202; alpha = 0.0040
Simulating trial. . .
espilon = 0.0202; alpha = 0.0040
Simulating trial. . .
espilon = 0.0202; alpha = 0.0040
Simulating trial. . .
espilon = 0.0202; alpha = 0.0040
Simulating trial. . .
espilon = 0.0202; alpha = 0.0040
Simulating trial. . .
espilon = 0.0202; alpha = 0.0040
Simulating trial. . .
espilon = 0.0202; alpha = 0.0040
Simulating trial. . .
espilon = 0.0202; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.81)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.15)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.95)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.96)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.19)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 0.99)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.78)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.08)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.45)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 2.25)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.69)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 1.27)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.34)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.83)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.11)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.18)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', 'forward')
Agent drove forward instead of right. (rewarded -0.24)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 0.82)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent followed the waypoint right. (rewarded 1.17)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.13)
20% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 978
\-------------------------
Simulating trial. . .
espilon = 0.0201; alpha = 0.0040
Simulating trial. . .
espilon = 0.0201; alpha = 0.0040
Simulating trial. . .
espilon = 0.0201; alpha = 0.0040
Simulating trial. . .
espilon = 0.0201; alpha = 0.0040
Simulating trial. . .
espilon = 0.0201; alpha = 0.0040
Simulating trial. . .
espilon = 0.0201; alpha = 0.0040
Simulating trial. . .
espilon = 0.0201; alpha = 0.0040
Simulating trial. . .
espilon = 0.0201; alpha = 0.0040
Simulating trial. . .
espilon = 0.0201; alpha = 0.0040
Simulating trial. . .
espilon = 0.0201; alpha = 0.0040
Simulating trial. . .
espilon = 0.0201; alpha = 0.0040
Simulating trial. . .
espilon = 0.0201; alpha = 0.0040
Simulating trial. . .
espilon = 0.0201; alpha = 0.0040
Simulating trial. . .
espilon = 0.0201; alpha = 0.0040
Simulating trial. . .
espilon = 0.0201; alpha = 0.0040
Simulating trial. . .
espilon = 0.0201; alpha = 0.0040
Simulating trial. . .
espilon = 0.0201; alpha = 0.0040
Simulating trial. . .
espilon = 0.0201; alpha = 0.0040
Simulating trial. . .
espilon = 0.0201; alpha = 0.0040
Simulating trial. . .
espilon = 0.0201; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', 'left')
Agent followed the waypoint right. (rewarded 1.39)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove right instead of forward. (rewarded 0.99)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.93)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.30)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'right', None)
Agent drove right instead of left. (rewarded 1.25)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.72)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.29)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent followed the waypoint left. (rewarded 2.19)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.59)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.90)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.57)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.76)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.49)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.97)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', 'right')
Agent followed the waypoint forward. (rewarded 1.29)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent followed the waypoint right. (rewarded 2.68)
47% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 979
\-------------------------
Simulating trial. . .
espilon = 0.0200; alpha = 0.0040
Simulating trial. . .
espilon = 0.0200; alpha = 0.0040
Simulating trial. . .
espilon = 0.0200; alpha = 0.0040
Simulating trial. . .
espilon = 0.0200; alpha = 0.0040
Simulating trial. . .
espilon = 0.0200; alpha = 0.0040
Simulating trial. . .
espilon = 0.0200; alpha = 0.0040
Simulating trial. . .
espilon = 0.0200; alpha = 0.0040
Simulating trial. . .
espilon = 0.0200; alpha = 0.0040
Simulating trial. . .
espilon = 0.0200; alpha = 0.0040
Simulating trial. . .
espilon = 0.0200; alpha = 0.0040
Simulating trial. . .
espilon = 0.0200; alpha = 0.0040
Simulating trial. . .
espilon = 0.0200; alpha = 0.0040
Simulating trial. . .
espilon = 0.0200; alpha = 0.0040
Simulating trial. . .
espilon = 0.0200; alpha = 0.0040
Simulating trial. . .
espilon = 0.0200; alpha = 0.0040
Simulating trial. . .
espilon = 0.0200; alpha = 0.0040
Simulating trial. . .
espilon = 0.0200; alpha = 0.0040
Simulating trial. . .
espilon = 0.0200; alpha = 0.0040
Simulating trial. . .
espilon = 0.0200; alpha = 0.0040
Simulating trial. . .
espilon = 0.0200; alpha = 0.0040
Simulating trial. . .
espilon = 0.0200; alpha = 0.0040
Simulating trial. . .
espilon = 0.0200; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.70)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.69)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.60)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.59)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.57)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'right')
Agent drove right instead of forward. (rewarded 0.13)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.59)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'right', None)
Agent followed the waypoint left. (rewarded 0.94)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', 'right')
Agent drove right instead of forward. (rewarded 1.51)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.54)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'right', 'right')
Agent attempted driving forward through a red light. (rewarded -10.70)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.12)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'right')
Agent followed the waypoint right. (rewarded 1.07)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.44)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.36)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.28)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.33)
32% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 980
\-------------------------
Simulating trial. . .
espilon = 0.0199; alpha = 0.0040
Simulating trial. . .
espilon = 0.0199; alpha = 0.0040
Simulating trial. . .
espilon = 0.0199; alpha = 0.0040
Simulating trial. . .
espilon = 0.0199; alpha = 0.0040
Simulating trial. . .
espilon = 0.0199; alpha = 0.0040
Simulating trial. . .
espilon = 0.0199; alpha = 0.0040
Simulating trial. . .
espilon = 0.0199; alpha = 0.0040
Simulating trial. . .
espilon = 0.0199; alpha = 0.0040
Simulating trial. . .
espilon = 0.0199; alpha = 0.0040
Simulating trial. . .
espilon = 0.0199; alpha = 0.0040
Simulating trial. . .
espilon = 0.0199; alpha = 0.0040
Simulating trial. . .
espilon = 0.0199; alpha = 0.0040
Simulating trial. . .
espilon = 0.0199; alpha = 0.0040
Simulating trial. . .
espilon = 0.0199; alpha = 0.0040
Simulating trial. . .
espilon = 0.0199; alpha = 0.0040
Simulating trial. . .
espilon = 0.0199; alpha = 0.0040
Simulating trial. . .
espilon = 0.0199; alpha = 0.0040
Simulating trial. . .
espilon = 0.0199; alpha = 0.0040
Simulating trial. . .
espilon = 0.0199; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 2.14)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'right')
Agent followed the waypoint forward. (rewarded 2.84)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 1.30)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 0.07)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 1.76)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.90)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent followed the waypoint right. (rewarded 2.34)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.72)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.40)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.75)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.37)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.52)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.36)
48% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 981
\-------------------------
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'left')
Agent followed the waypoint right. (rewarded 2.00)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 2.85)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.15)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 1.41)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.32)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.02)
70% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 982
\-------------------------
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
Simulating trial. . .
espilon = 0.0198; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'right')
Agent drove forward instead of left. (rewarded 1.35)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'forward')
Agent drove right instead of forward. (rewarded 0.17)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.37)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.90)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.43)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.08)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'left')
Agent drove right instead of left. (rewarded 1.37)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.83)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.60)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.91)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'left', None)
Agent followed the waypoint forward. (rewarded 1.57)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 0.86)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 1.90)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.19)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.65)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 2.60)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.22)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent followed the waypoint forward. (rewarded 0.84)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'forward')
Agent properly idled at a red light. (rewarded 1.96)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 0.73)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.02)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 1.69)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.72)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.38)
20% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 983
\-------------------------
Simulating trial. . .
espilon = 0.0197; alpha = 0.0040
Simulating trial. . .
espilon = 0.0197; alpha = 0.0040
Simulating trial. . .
espilon = 0.0197; alpha = 0.0040
Simulating trial. . .
espilon = 0.0197; alpha = 0.0040
Simulating trial. . .
espilon = 0.0197; alpha = 0.0040
Simulating trial. . .
espilon = 0.0197; alpha = 0.0040
Simulating trial. . .
espilon = 0.0197; alpha = 0.0040
Simulating trial. . .
espilon = 0.0197; alpha = 0.0040
Simulating trial. . .
espilon = 0.0197; alpha = 0.0040
Simulating trial. . .
espilon = 0.0197; alpha = 0.0040
Simulating trial. . .
espilon = 0.0197; alpha = 0.0040
Simulating trial. . .
espilon = 0.0197; alpha = 0.0040
Simulating trial. . .
espilon = 0.0197; alpha = 0.0040
Simulating trial. . .
espilon = 0.0197; alpha = 0.0040
Simulating trial. . .
espilon = 0.0197; alpha = 0.0040
Simulating trial. . .
espilon = 0.0197; alpha = 0.0040
Simulating trial. . .
espilon = 0.0197; alpha = 0.0040
Simulating trial. . .
espilon = 0.0197; alpha = 0.0040
Simulating trial. . .
espilon = 0.0197; alpha = 0.0040
Simulating trial. . .
espilon = 0.0197; alpha = 0.0040
Simulating trial. . .
espilon = 0.0197; alpha = 0.0040
Simulating trial. . .
espilon = 0.0197; alpha = 0.0040
Simulating trial. . .
espilon = 0.0197; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.51)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 1.31)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.25)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.75)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.30)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.11)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 2.88)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.70)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.01)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.77)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.65)
45% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 984
\-------------------------
Simulating trial. . .
espilon = 0.0196; alpha = 0.0040
Simulating trial. . .
espilon = 0.0196; alpha = 0.0040
Simulating trial. . .
espilon = 0.0196; alpha = 0.0040
Simulating trial. . .
espilon = 0.0196; alpha = 0.0040
Simulating trial. . .
espilon = 0.0196; alpha = 0.0040
Simulating trial. . .
espilon = 0.0196; alpha = 0.0040
Simulating trial. . .
espilon = 0.0196; alpha = 0.0040
Simulating trial. . .
espilon = 0.0196; alpha = 0.0040
Simulating trial. . .
espilon = 0.0196; alpha = 0.0040
Simulating trial. . .
espilon = 0.0196; alpha = 0.0040
Simulating trial. . .
espilon = 0.0196; alpha = 0.0040
Simulating trial. . .
espilon = 0.0196; alpha = 0.0040
Simulating trial. . .
espilon = 0.0196; alpha = 0.0040
Simulating trial. . .
espilon = 0.0196; alpha = 0.0040
Simulating trial. . .
espilon = 0.0196; alpha = 0.0040
Simulating trial. . .
espilon = 0.0196; alpha = 0.0040
Simulating trial. . .
espilon = 0.0196; alpha = 0.0040
Simulating trial. . .
espilon = 0.0196; alpha = 0.0040
Simulating trial. . .
espilon = 0.0196; alpha = 0.0040
Simulating trial. . .
espilon = 0.0196; alpha = 0.0040
Simulating trial. . .
espilon = 0.0196; alpha = 0.0040
Simulating trial. . .
espilon = 0.0196; alpha = 0.0040
Simulating trial. . .
espilon = 0.0196; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded 1.75)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.16)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.19)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.06)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.13)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.00)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'forward', None)
Agent drove right instead of left. (rewarded 0.16)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.24)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'forward')
Agent followed the waypoint right. (rewarded 1.94)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', None)
Agent followed the waypoint right. (rewarded 2.00)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.84)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'left', None)
Agent drove forward instead of left. (rewarded 0.12)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.08)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.54)
44% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 985
\-------------------------
Simulating trial. . .
espilon = 0.0195; alpha = 0.0040
Simulating trial. . .
espilon = 0.0195; alpha = 0.0040
Simulating trial. . .
espilon = 0.0195; alpha = 0.0040
Simulating trial. . .
espilon = 0.0195; alpha = 0.0040
Simulating trial. . .
espilon = 0.0195; alpha = 0.0040
Simulating trial. . .
espilon = 0.0195; alpha = 0.0040
Simulating trial. . .
espilon = 0.0195; alpha = 0.0040
Simulating trial. . .
espilon = 0.0195; alpha = 0.0040
Simulating trial. . .
espilon = 0.0195; alpha = 0.0040
Simulating trial. . .
espilon = 0.0195; alpha = 0.0040
Simulating trial. . .
espilon = 0.0195; alpha = 0.0040
Simulating trial. . .
espilon = 0.0195; alpha = 0.0040
Simulating trial. . .
espilon = 0.0195; alpha = 0.0040
Simulating trial. . .
espilon = 0.0195; alpha = 0.0040
Simulating trial. . .
espilon = 0.0195; alpha = 0.0040
Simulating trial. . .
espilon = 0.0195; alpha = 0.0040
Simulating trial. . .
espilon = 0.0195; alpha = 0.0040
Simulating trial. . .
espilon = 0.0195; alpha = 0.0040
Simulating trial. . .
espilon = 0.0195; alpha = 0.0040
Simulating trial. . .
espilon = 0.0195; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent drove right instead of left. (rewarded 0.62)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'right')
Agent followed the waypoint right. (rewarded 2.25)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.28)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.75)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.28)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.94)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 1.18)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.91)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.23)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.27)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.33)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 2.02)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent followed the waypoint right. (rewarded 1.40)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.01)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.04)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -9.65)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.81)
32% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 986
\-------------------------
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', 'right')
Agent attempted driving forward through a red light with traffic and cause a major accident. (rewarded -39.85)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', 'forward')
Agent drove right instead of left. (rewarded 0.22)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent followed the waypoint right. (rewarded 1.53)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.76)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.52)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.94)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.02)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'right', 'left')
Agent drove right instead of forward. (rewarded 0.68)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent followed the waypoint left. (rewarded 2.44)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.79)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.88)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.15)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.44)
48% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 987
\-------------------------
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
Simulating trial. . .
espilon = 0.0194; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 2.94)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.61)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.41)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.45)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent drove right instead of left. (rewarded 0.46)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.54)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.57)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent followed the waypoint forward. (rewarded 2.20)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.46)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.62)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 0.81)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.88)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent followed the waypoint left. (rewarded 2.17)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 1.23)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 0.97)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.50)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 1.54)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'left', None)
Agent drove forward instead of left. (rewarded 1.12)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.08)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.16)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.19)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.50)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'right')
Agent drove forward instead of left. (rewarded 0.43)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.34)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.80)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 0.57)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.42)
10% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 988
\-------------------------
Simulating trial. . .
espilon = 0.0193; alpha = 0.0040
Simulating trial. . .
espilon = 0.0193; alpha = 0.0040
Simulating trial. . .
espilon = 0.0193; alpha = 0.0040
Simulating trial. . .
espilon = 0.0193; alpha = 0.0040
Simulating trial. . .
espilon = 0.0193; alpha = 0.0040
Simulating trial. . .
espilon = 0.0193; alpha = 0.0040
Simulating trial. . .
espilon = 0.0193; alpha = 0.0040
Simulating trial. . .
espilon = 0.0193; alpha = 0.0040
Simulating trial. . .
espilon = 0.0193; alpha = 0.0040
Simulating trial. . .
espilon = 0.0193; alpha = 0.0040
Simulating trial. . .
espilon = 0.0193; alpha = 0.0040
Simulating trial. . .
espilon = 0.0193; alpha = 0.0040
Simulating trial. . .
espilon = 0.0193; alpha = 0.0040
Simulating trial. . .
espilon = 0.0193; alpha = 0.0040
Simulating trial. . .
espilon = 0.0193; alpha = 0.0040
Simulating trial. . .
espilon = 0.0193; alpha = 0.0040
Simulating trial. . .
espilon = 0.0193; alpha = 0.0040
Simulating trial. . .
espilon = 0.0193; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.90)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.75)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.97)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.22)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.17)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.03)
70% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 989
\-------------------------
Simulating trial. . .
espilon = 0.0192; alpha = 0.0040
Simulating trial. . .
espilon = 0.0192; alpha = 0.0040
Simulating trial. . .
espilon = 0.0192; alpha = 0.0040
Simulating trial. . .
espilon = 0.0192; alpha = 0.0040
Simulating trial. . .
espilon = 0.0192; alpha = 0.0040
Simulating trial. . .
espilon = 0.0192; alpha = 0.0040
Simulating trial. . .
espilon = 0.0192; alpha = 0.0040
Simulating trial. . .
espilon = 0.0192; alpha = 0.0040
Simulating trial. . .
espilon = 0.0192; alpha = 0.0040
Simulating trial. . .
espilon = 0.0192; alpha = 0.0040
Simulating trial. . .
espilon = 0.0192; alpha = 0.0040
Simulating trial. . .
espilon = 0.0192; alpha = 0.0040
Simulating trial. . .
espilon = 0.0192; alpha = 0.0040
Simulating trial. . .
espilon = 0.0192; alpha = 0.0040
Simulating trial. . .
espilon = 0.0192; alpha = 0.0040
Simulating trial. . .
espilon = 0.0192; alpha = 0.0040
Simulating trial. . .
espilon = 0.0192; alpha = 0.0040
Simulating trial. . .
espilon = 0.0192; alpha = 0.0040
Simulating trial. . .
espilon = 0.0192; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.70)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 0.71)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.78)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.88)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.75)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.82)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.40)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.28)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.38)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.98)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.20)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'left')
Agent drove right instead of left. (rewarded 1.42)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.02)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent followed the waypoint forward. (rewarded 1.41)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.50)
50% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 990
\-------------------------
Simulating trial. . .
espilon = 0.0191; alpha = 0.0040
Simulating trial. . .
espilon = 0.0191; alpha = 0.0040
Simulating trial. . .
espilon = 0.0191; alpha = 0.0040
Simulating trial. . .
espilon = 0.0191; alpha = 0.0040
Simulating trial. . .
espilon = 0.0191; alpha = 0.0040
Simulating trial. . .
espilon = 0.0191; alpha = 0.0040
Simulating trial. . .
espilon = 0.0191; alpha = 0.0040
Simulating trial. . .
espilon = 0.0191; alpha = 0.0040
Simulating trial. . .
espilon = 0.0191; alpha = 0.0040
Simulating trial. . .
espilon = 0.0191; alpha = 0.0040
Simulating trial. . .
espilon = 0.0191; alpha = 0.0040
Simulating trial. . .
espilon = 0.0191; alpha = 0.0040
Simulating trial. . .
espilon = 0.0191; alpha = 0.0040
Simulating trial. . .
espilon = 0.0191; alpha = 0.0040
Simulating trial. . .
espilon = 0.0191; alpha = 0.0040
Simulating trial. . .
espilon = 0.0191; alpha = 0.0040
Simulating trial. . .
espilon = 0.0191; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.08)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 2.26)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 2.87)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.45)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'left')
Agent followed the waypoint right. (rewarded 2.57)
75% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 991
\-------------------------
Simulating trial. . .
espilon = 0.0191; alpha = 0.0040
Simulating trial. . .
espilon = 0.0191; alpha = 0.0040
Simulating trial. . .
espilon = 0.0191; alpha = 0.0040
Simulating trial. . .
espilon = 0.0191; alpha = 0.0040
Simulating trial. . .
espilon = 0.0191; alpha = 0.0040
Simulating trial. . .
espilon = 0.0191; alpha = 0.0040
Simulating trial. . .
espilon = 0.0191; alpha = 0.0040
Simulating trial. . .
espilon = 0.0191; alpha = 0.0040
Simulating trial. . .
espilon = 0.0191; alpha = 0.0040
Simulating trial. . .
espilon = 0.0191; alpha = 0.0040
Simulating trial. . .
espilon = 0.0191; alpha = 0.0040
Simulating trial. . .
espilon = 0.0191; alpha = 0.0040
Simulating trial. . .
espilon = 0.0191; alpha = 0.0040
Simulating trial. . .
espilon = 0.0191; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.17)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.04)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.25)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.26)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.01)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.39)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.71)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.74)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'right', None)
Agent drove forward instead of left. (rewarded 0.14)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 0.95)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.23)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.46)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.27)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.98)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 2.44)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 992
\-------------------------
Simulating trial. . .
espilon = 0.0190; alpha = 0.0040
Simulating trial. . .
espilon = 0.0190; alpha = 0.0040
Simulating trial. . .
espilon = 0.0190; alpha = 0.0040
Simulating trial. . .
espilon = 0.0190; alpha = 0.0040
Simulating trial. . .
espilon = 0.0190; alpha = 0.0040
Simulating trial. . .
espilon = 0.0190; alpha = 0.0040
Simulating trial. . .
espilon = 0.0190; alpha = 0.0040
Simulating trial. . .
espilon = 0.0190; alpha = 0.0040
Simulating trial. . .
espilon = 0.0190; alpha = 0.0040
Simulating trial. . .
espilon = 0.0190; alpha = 0.0040
Simulating trial. . .
espilon = 0.0190; alpha = 0.0040
Simulating trial. . .
espilon = 0.0190; alpha = 0.0040
Simulating trial. . .
espilon = 0.0190; alpha = 0.0040
Simulating trial. . .
espilon = 0.0190; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'left', 'left')
Agent drove forward instead of left. (rewarded 1.35)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent drove right instead of left. (rewarded 0.92)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.05)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.76)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.33)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.24)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent followed the waypoint forward. (rewarded 1.04)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.88)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.29)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.05)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 2.26)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'right')
Agent properly idled at a red light. (rewarded 1.74)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.83)
63% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.97)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 993
\-------------------------
Simulating trial. . .
espilon = 0.0189; alpha = 0.0040
Simulating trial. . .
espilon = 0.0189; alpha = 0.0040
Simulating trial. . .
espilon = 0.0189; alpha = 0.0040
Simulating trial. . .
espilon = 0.0189; alpha = 0.0040
Simulating trial. . .
espilon = 0.0189; alpha = 0.0040
Simulating trial. . .
espilon = 0.0189; alpha = 0.0040
Simulating trial. . .
espilon = 0.0189; alpha = 0.0040
Simulating trial. . .
espilon = 0.0189; alpha = 0.0040
Simulating trial. . .
espilon = 0.0189; alpha = 0.0040
Simulating trial. . .
espilon = 0.0189; alpha = 0.0040
Simulating trial. . .
espilon = 0.0189; alpha = 0.0040
Simulating trial. . .
espilon = 0.0189; alpha = 0.0040
Simulating trial. . .
espilon = 0.0189; alpha = 0.0040
Simulating trial. . .
espilon = 0.0189; alpha = 0.0040
Simulating trial. . .
espilon = 0.0189; alpha = 0.0040
Simulating trial. . .
espilon = 0.0189; alpha = 0.0040
Simulating trial. . .
espilon = 0.0189; alpha = 0.0040
Simulating trial. . .
espilon = 0.0189; alpha = 0.0040
Simulating trial. . .
espilon = 0.0189; alpha = 0.0040
Simulating trial. . .
espilon = 0.0189; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.32)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.28)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.66)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.22)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.47)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.94)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.66)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.59)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.41)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.75)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.86)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 0.73)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'left', None)
Agent drove forward instead of left. (rewarded 0.17)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.18)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 1.18)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 1.49)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.29)
15% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 994
\-------------------------
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.49)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.28)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'forward', None)
Agent drove left instead of right. (rewarded 1.34)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.16)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.99)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.95)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.28)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.24)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.31)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.78)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 1.75)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 1.19)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'left')
Agent followed the waypoint right. (rewarded 2.24)
57% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 995
\-------------------------
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
Simulating trial. . .
espilon = 0.0188; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.63)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.00)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.42)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 0.70)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded 1.03)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.62)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.83)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 1.84)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.92)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.82)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'right', None)
Agent drove right instead of forward. (rewarded 1.37)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.57)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'forward', 'forward')
Agent properly idled at a red light. (rewarded 1.78)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.76)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.90)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.82)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.40)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.58)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.40)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 1.77)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.11)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.86)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.99)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.04)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.88)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 996
\-------------------------
Simulating trial. . .
espilon = 0.0187; alpha = 0.0040
Simulating trial. . .
espilon = 0.0187; alpha = 0.0040
Simulating trial. . .
espilon = 0.0187; alpha = 0.0040
Simulating trial. . .
espilon = 0.0187; alpha = 0.0040
Simulating trial. . .
espilon = 0.0187; alpha = 0.0040
Simulating trial. . .
espilon = 0.0187; alpha = 0.0040
Simulating trial. . .
espilon = 0.0187; alpha = 0.0040
Simulating trial. . .
espilon = 0.0187; alpha = 0.0040
Simulating trial. . .
espilon = 0.0187; alpha = 0.0040
Simulating trial. . .
espilon = 0.0187; alpha = 0.0040
Simulating trial. . .
espilon = 0.0187; alpha = 0.0040
Simulating trial. . .
espilon = 0.0187; alpha = 0.0040
Simulating trial. . .
espilon = 0.0187; alpha = 0.0040
Simulating trial. . .
espilon = 0.0187; alpha = 0.0040
Simulating trial. . .
espilon = 0.0187; alpha = 0.0040
Simulating trial. . .
espilon = 0.0187; alpha = 0.0040
Simulating trial. . .
espilon = 0.0187; alpha = 0.0040
Simulating trial. . .
espilon = 0.0187; alpha = 0.0040
Simulating trial. . .
espilon = 0.0187; alpha = 0.0040
Simulating trial. . .
espilon = 0.0187; alpha = 0.0040
Simulating trial. . .
espilon = 0.0187; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.22)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 2.13)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 1.41)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.21)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.71)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.34)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.91)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 1.02)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', 'forward')
Agent properly idled at a red light. (rewarded 1.76)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.94)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'right', 'forward')
Agent drove right instead of left. (rewarded 0.02)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.60)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent attempted driving forward through a red light. (rewarded -10.11)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 2.20)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 2.28)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 1.45)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.40)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.66)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 997
\-------------------------
Simulating trial. . .
espilon = 0.0186; alpha = 0.0040
Simulating trial. . .
espilon = 0.0186; alpha = 0.0040
Simulating trial. . .
espilon = 0.0186; alpha = 0.0040
Simulating trial. . .
espilon = 0.0186; alpha = 0.0040
Simulating trial. . .
espilon = 0.0186; alpha = 0.0040
Simulating trial. . .
espilon = 0.0186; alpha = 0.0040
Simulating trial. . .
espilon = 0.0186; alpha = 0.0040
Simulating trial. . .
espilon = 0.0186; alpha = 0.0040
Simulating trial. . .
espilon = 0.0186; alpha = 0.0040
Simulating trial. . .
espilon = 0.0186; alpha = 0.0040
Simulating trial. . .
espilon = 0.0186; alpha = 0.0040
Simulating trial. . .
espilon = 0.0186; alpha = 0.0040
Simulating trial. . .
espilon = 0.0186; alpha = 0.0040
Simulating trial. . .
espilon = 0.0186; alpha = 0.0040
Simulating trial. . .
espilon = 0.0186; alpha = 0.0040
Simulating trial. . .
espilon = 0.0186; alpha = 0.0040
Simulating trial. . .
espilon = 0.0186; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'right', 'left')
Agent followed the waypoint right. (rewarded 2.64)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', 'left')
Agent drove forward instead of right. (rewarded 1.01)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 2.39)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'forward')
Agent followed the waypoint right. (rewarded 1.92)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.72)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 1.37)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.09)
65% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 998
\-------------------------
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', 'left')
Agent followed the waypoint right. (rewarded 2.26)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 2.62)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, None)
Agent followed the waypoint right. (rewarded 1.15)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.59)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.67)
80% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 999
\-------------------------
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
Simulating trial. . .
espilon = 0.0185; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'forward')
Agent followed the waypoint right. (rewarded 1.75)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.99)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.56)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.33)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.28)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.11)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.92)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.45)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.69)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.23)
67% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1000
\-------------------------
Simulating trial. . .
espilon = 0.0184; alpha = 0.0040
Simulating trial. . .
espilon = 0.0184; alpha = 0.0040
Simulating trial. . .
espilon = 0.0184; alpha = 0.0040
Simulating trial. . .
espilon = 0.0184; alpha = 0.0040
Simulating trial. . .
espilon = 0.0184; alpha = 0.0040
Simulating trial. . .
espilon = 0.0184; alpha = 0.0040
Simulating trial. . .
espilon = 0.0184; alpha = 0.0040
Simulating trial. . .
espilon = 0.0184; alpha = 0.0040
Simulating trial. . .
espilon = 0.0184; alpha = 0.0040
Simulating trial. . .
espilon = 0.0184; alpha = 0.0040
Simulating trial. . .
espilon = 0.0184; alpha = 0.0040
Simulating trial. . .
espilon = 0.0184; alpha = 0.0040
Simulating trial. . .
espilon = 0.0184; alpha = 0.0040
Simulating trial. . .
espilon = 0.0184; alpha = 0.0040
Simulating trial. . .
espilon = 0.0184; alpha = 0.0040
Simulating trial. . .
espilon = 0.0184; alpha = 0.0040
Simulating trial. . .
espilon = 0.0184; alpha = 0.0040
Simulating trial. . .
espilon = 0.0184; alpha = 0.0040
Simulating trial. . .
espilon = 0.0184; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', 'left')
Agent drove left instead of right. (rewarded 1.24)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent attempted driving forward through a red light. (rewarded -10.89)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.14)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent drove right instead of left. (rewarded 0.18)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.82)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', 'left')
Agent drove left instead of right. (rewarded 1.19)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.02)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 1.09)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 2.81)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.50)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.60)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.02)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 0.82)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.66)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.02)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.62)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.70)
32% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1001
\-------------------------
Simulating trial. . .
espilon = 0.0183; alpha = 0.0040
Simulating trial. . .
espilon = 0.0183; alpha = 0.0040
Simulating trial. . .
espilon = 0.0183; alpha = 0.0040
Simulating trial. . .
espilon = 0.0183; alpha = 0.0040
Simulating trial. . .
espilon = 0.0183; alpha = 0.0040
Simulating trial. . .
espilon = 0.0183; alpha = 0.0040
Simulating trial. . .
espilon = 0.0183; alpha = 0.0040
Simulating trial. . .
espilon = 0.0183; alpha = 0.0040
Simulating trial. . .
espilon = 0.0183; alpha = 0.0040
Simulating trial. . .
espilon = 0.0183; alpha = 0.0040
Simulating trial. . .
espilon = 0.0183; alpha = 0.0040
Simulating trial. . .
espilon = 0.0183; alpha = 0.0040
Simulating trial. . .
espilon = 0.0183; alpha = 0.0040
Simulating trial. . .
espilon = 0.0183; alpha = 0.0040
Simulating trial. . .
espilon = 0.0183; alpha = 0.0040
Simulating trial. . .
espilon = 0.0183; alpha = 0.0040
Simulating trial. . .
espilon = 0.0183; alpha = 0.0040
Simulating trial. . .
espilon = 0.0183; alpha = 0.0040
Simulating trial. . .
espilon = 0.0183; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.07)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent followed the waypoint forward. (rewarded 1.46)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.20)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.68)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.64)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.22)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 0.99)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.00)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.61)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.36)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.17)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.83)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1002
\-------------------------
Simulating trial. . .
espilon = 0.0182; alpha = 0.0040
Simulating trial. . .
espilon = 0.0182; alpha = 0.0040
Simulating trial. . .
espilon = 0.0182; alpha = 0.0040
Simulating trial. . .
espilon = 0.0182; alpha = 0.0040
Simulating trial. . .
espilon = 0.0182; alpha = 0.0040
Simulating trial. . .
espilon = 0.0182; alpha = 0.0040
Simulating trial. . .
espilon = 0.0182; alpha = 0.0040
Simulating trial. . .
espilon = 0.0182; alpha = 0.0040
Simulating trial. . .
espilon = 0.0182; alpha = 0.0040
Simulating trial. . .
espilon = 0.0182; alpha = 0.0040
Simulating trial. . .
espilon = 0.0182; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.12)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.70)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.37)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.75)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.21)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.49)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.62)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.48)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.83)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.57)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'forward', None)
Agent properly idled at a red light. (rewarded 1.54)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.16)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'right')
Agent drove right instead of forward. (rewarded -0.14)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 2.15)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.19)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1003
\-------------------------
Simulating trial. . .
espilon = 0.0182; alpha = 0.0040
Simulating trial. . .
espilon = 0.0182; alpha = 0.0040
Simulating trial. . .
espilon = 0.0182; alpha = 0.0040
Simulating trial. . .
espilon = 0.0182; alpha = 0.0040
Simulating trial. . .
espilon = 0.0182; alpha = 0.0040
Simulating trial. . .
espilon = 0.0182; alpha = 0.0040
Simulating trial. . .
espilon = 0.0182; alpha = 0.0040
Simulating trial. . .
espilon = 0.0182; alpha = 0.0040
Simulating trial. . .
espilon = 0.0182; alpha = 0.0040
Simulating trial. . .
espilon = 0.0182; alpha = 0.0040
Simulating trial. . .
espilon = 0.0182; alpha = 0.0040
Simulating trial. . .
espilon = 0.0182; alpha = 0.0040
Simulating trial. . .
espilon = 0.0182; alpha = 0.0040
Simulating trial. . .
espilon = 0.0182; alpha = 0.0040
Simulating trial. . .
espilon = 0.0182; alpha = 0.0040
Simulating trial. . .
espilon = 0.0182; alpha = 0.0040
Simulating trial. . .
espilon = 0.0182; alpha = 0.0040
Simulating trial. . .
espilon = 0.0182; alpha = 0.0040
Simulating trial. . .
espilon = 0.0182; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 1.88)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.14)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 0.43)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 0.98)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.87)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.73)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'right', None)
Agent followed the waypoint right. (rewarded 2.01)
77% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1004
\-------------------------
Simulating trial. . .
espilon = 0.0181; alpha = 0.0040
Simulating trial. . .
espilon = 0.0181; alpha = 0.0040
Simulating trial. . .
espilon = 0.0181; alpha = 0.0040
Simulating trial. . .
espilon = 0.0181; alpha = 0.0040
Simulating trial. . .
espilon = 0.0181; alpha = 0.0040
Simulating trial. . .
espilon = 0.0181; alpha = 0.0040
Simulating trial. . .
espilon = 0.0181; alpha = 0.0040
Simulating trial. . .
espilon = 0.0181; alpha = 0.0040
Simulating trial. . .
espilon = 0.0181; alpha = 0.0040
Simulating trial. . .
espilon = 0.0181; alpha = 0.0040
Simulating trial. . .
espilon = 0.0181; alpha = 0.0040
Simulating trial. . .
espilon = 0.0181; alpha = 0.0040
Simulating trial. . .
espilon = 0.0181; alpha = 0.0040
Simulating trial. . .
espilon = 0.0181; alpha = 0.0040
Simulating trial. . .
espilon = 0.0181; alpha = 0.0040
Simulating trial. . .
espilon = 0.0181; alpha = 0.0040
Simulating trial. . .
espilon = 0.0181; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'forward', None)
Agent properly idled at a red light. (rewarded 1.39)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.19)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.72)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.21)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.61)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.14)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 1.21)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.14)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.02)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.66)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', 'forward')
Agent drove right instead of forward. (rewarded 0.65)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.97)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.47)
63% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.65)
60% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.86)
57% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 1.46)
54% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.53)
51% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1005
\-------------------------
Simulating trial. . .
espilon = 0.0180; alpha = 0.0040
Simulating trial. . .
espilon = 0.0180; alpha = 0.0040
Simulating trial. . .
espilon = 0.0180; alpha = 0.0040
Simulating trial. . .
espilon = 0.0180; alpha = 0.0040
Simulating trial. . .
espilon = 0.0180; alpha = 0.0040
Simulating trial. . .
espilon = 0.0180; alpha = 0.0040
Simulating trial. . .
espilon = 0.0180; alpha = 0.0040
Simulating trial. . .
espilon = 0.0180; alpha = 0.0040
Simulating trial. . .
espilon = 0.0180; alpha = 0.0040
Simulating trial. . .
espilon = 0.0180; alpha = 0.0040
Simulating trial. . .
espilon = 0.0180; alpha = 0.0040
Simulating trial. . .
espilon = 0.0180; alpha = 0.0040
Simulating trial. . .
espilon = 0.0180; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', 'right')
Agent followed the waypoint right. (rewarded 2.56)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.98)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 1.62)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.90)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.46)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.95)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.50)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 2.39)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.65)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'right')
Agent drove right instead of forward. (rewarded 0.94)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.25)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 2.10)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.88)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.36)
44% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1006
\-------------------------
Simulating trial. . .
espilon = 0.0180; alpha = 0.0040
Simulating trial. . .
espilon = 0.0180; alpha = 0.0040
Simulating trial. . .
espilon = 0.0180; alpha = 0.0040
Simulating trial. . .
espilon = 0.0180; alpha = 0.0040
Simulating trial. . .
espilon = 0.0180; alpha = 0.0040
Simulating trial. . .
espilon = 0.0180; alpha = 0.0040
Simulating trial. . .
espilon = 0.0180; alpha = 0.0040
Simulating trial. . .
espilon = 0.0180; alpha = 0.0040
Simulating trial. . .
espilon = 0.0180; alpha = 0.0040
Simulating trial. . .
espilon = 0.0180; alpha = 0.0040
Simulating trial. . .
espilon = 0.0180; alpha = 0.0040
Simulating trial. . .
espilon = 0.0180; alpha = 0.0040
Simulating trial. . .
espilon = 0.0180; alpha = 0.0040
Simulating trial. . .
espilon = 0.0180; alpha = 0.0040
Simulating trial. . .
espilon = 0.0180; alpha = 0.0040
Simulating trial. . .
espilon = 0.0180; alpha = 0.0040
Simulating trial. . .
espilon = 0.0180; alpha = 0.0040
Simulating trial. . .
espilon = 0.0180; alpha = 0.0040
Simulating trial. . .
espilon = 0.0180; alpha = 0.0040
Simulating trial. . .
espilon = 0.0180; alpha = 0.0040
Simulating trial. . .
espilon = 0.0180; alpha = 0.0040
Simulating trial. . .
espilon = 0.0180; alpha = 0.0040
Simulating trial. . .
espilon = 0.0180; alpha = 0.0040
Simulating trial. . .
espilon = 0.0180; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.03)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent drove right instead of left. (rewarded 0.20)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent drove right instead of left. (rewarded 0.37)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.04)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'right')
Agent drove right instead of forward. (rewarded 1.57)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.46)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.43)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.22)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.67)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.96)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.71)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.11)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'right')
Agent drove right instead of forward. (rewarded 0.95)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent drove right instead of left. (rewarded 0.58)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 0.90)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 0.83)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'forward')
Agent followed the waypoint forward. (rewarded 1.82)
43% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1007
\-------------------------
Simulating trial. . .
espilon = 0.0179; alpha = 0.0040
Simulating trial. . .
espilon = 0.0179; alpha = 0.0040
Simulating trial. . .
espilon = 0.0179; alpha = 0.0040
Simulating trial. . .
espilon = 0.0179; alpha = 0.0040
Simulating trial. . .
espilon = 0.0179; alpha = 0.0040
Simulating trial. . .
espilon = 0.0179; alpha = 0.0040
Simulating trial. . .
espilon = 0.0179; alpha = 0.0040
Simulating trial. . .
espilon = 0.0179; alpha = 0.0040
Simulating trial. . .
espilon = 0.0179; alpha = 0.0040
Simulating trial. . .
espilon = 0.0179; alpha = 0.0040
Simulating trial. . .
espilon = 0.0179; alpha = 0.0040
Simulating trial. . .
espilon = 0.0179; alpha = 0.0040
Simulating trial. . .
espilon = 0.0179; alpha = 0.0040
Simulating trial. . .
espilon = 0.0179; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.12)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.66)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 2.71)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.15)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.53)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.81)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.67)
72% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1008
\-------------------------
Simulating trial. . .
espilon = 0.0178; alpha = 0.0040
Simulating trial. . .
espilon = 0.0178; alpha = 0.0040
Simulating trial. . .
espilon = 0.0178; alpha = 0.0040
Simulating trial. . .
espilon = 0.0178; alpha = 0.0040
Simulating trial. . .
espilon = 0.0178; alpha = 0.0040
Simulating trial. . .
espilon = 0.0178; alpha = 0.0040
Simulating trial. . .
espilon = 0.0178; alpha = 0.0040
Simulating trial. . .
espilon = 0.0178; alpha = 0.0040
Simulating trial. . .
espilon = 0.0178; alpha = 0.0040
Simulating trial. . .
espilon = 0.0178; alpha = 0.0040
Simulating trial. . .
espilon = 0.0178; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.08)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 2.00)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 2.05)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.01)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.83)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.77)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.44)
65% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1009
\-------------------------
Simulating trial. . .
espilon = 0.0177; alpha = 0.0040
Simulating trial. . .
espilon = 0.0177; alpha = 0.0040
Simulating trial. . .
espilon = 0.0177; alpha = 0.0040
Simulating trial. . .
espilon = 0.0177; alpha = 0.0040
Simulating trial. . .
espilon = 0.0177; alpha = 0.0040
Simulating trial. . .
espilon = 0.0177; alpha = 0.0040
Simulating trial. . .
espilon = 0.0177; alpha = 0.0040
Simulating trial. . .
espilon = 0.0177; alpha = 0.0040
Simulating trial. . .
espilon = 0.0177; alpha = 0.0040
Simulating trial. . .
espilon = 0.0177; alpha = 0.0040
Simulating trial. . .
espilon = 0.0177; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 2.04)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.58)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.01)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.36)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'forward')
Agent followed the waypoint right. (rewarded 2.16)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.48)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.82)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.46)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.57)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.19)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 2.04)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.91)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1010
\-------------------------
Simulating trial. . .
espilon = 0.0177; alpha = 0.0040
Simulating trial. . .
espilon = 0.0177; alpha = 0.0040
Simulating trial. . .
espilon = 0.0177; alpha = 0.0040
Simulating trial. . .
espilon = 0.0177; alpha = 0.0040
Simulating trial. . .
espilon = 0.0177; alpha = 0.0040
Simulating trial. . .
espilon = 0.0177; alpha = 0.0040
Simulating trial. . .
espilon = 0.0177; alpha = 0.0040
Simulating trial. . .
espilon = 0.0177; alpha = 0.0040
Simulating trial. . .
espilon = 0.0177; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.69)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'right', None)
Agent drove forward instead of left. (rewarded 1.11)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.91)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.42)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent followed the waypoint left. (rewarded 2.93)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.99)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.06)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.96)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.22)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.79)
50% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1011
\-------------------------
Simulating trial. . .
espilon = 0.0176; alpha = 0.0040
Simulating trial. . .
espilon = 0.0176; alpha = 0.0040
Simulating trial. . .
espilon = 0.0176; alpha = 0.0040
Simulating trial. . .
espilon = 0.0176; alpha = 0.0040
Simulating trial. . .
espilon = 0.0176; alpha = 0.0040
Simulating trial. . .
espilon = 0.0176; alpha = 0.0040
Simulating trial. . .
espilon = 0.0176; alpha = 0.0040
Simulating trial. . .
espilon = 0.0176; alpha = 0.0040
Simulating trial. . .
espilon = 0.0176; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.74)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 1.18)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.49)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.60)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.60)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.91)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.46)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.36)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.04)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.22)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.41)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', 'left')
Agent followed the waypoint left. (rewarded 2.34)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.50)
35% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1012
\-------------------------
Simulating trial. . .
espilon = 0.0175; alpha = 0.0040
Simulating trial. . .
espilon = 0.0175; alpha = 0.0040
Simulating trial. . .
espilon = 0.0175; alpha = 0.0040
Simulating trial. . .
espilon = 0.0175; alpha = 0.0040
Simulating trial. . .
espilon = 0.0175; alpha = 0.0040
Simulating trial. . .
espilon = 0.0175; alpha = 0.0040
Simulating trial. . .
espilon = 0.0175; alpha = 0.0040
Simulating trial. . .
espilon = 0.0175; alpha = 0.0040
Simulating trial. . .
espilon = 0.0175; alpha = 0.0040
Simulating trial. . .
espilon = 0.0175; alpha = 0.0040
Simulating trial. . .
espilon = 0.0175; alpha = 0.0040
Simulating trial. . .
espilon = 0.0175; alpha = 0.0040
Simulating trial. . .
espilon = 0.0175; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', 'left')
Agent drove forward instead of right. (rewarded 0.16)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.11)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.13)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', None)
Agent properly idled at a red light. (rewarded 2.92)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.10)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', 'forward')
Agent drove right instead of forward. (rewarded 0.34)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.75)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.50)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.52)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.07)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.84)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.79)
52% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1013
\-------------------------
Simulating trial. . .
espilon = 0.0175; alpha = 0.0040
Simulating trial. . .
espilon = 0.0175; alpha = 0.0040
Simulating trial. . .
espilon = 0.0175; alpha = 0.0040
Simulating trial. . .
espilon = 0.0175; alpha = 0.0040
Simulating trial. . .
espilon = 0.0175; alpha = 0.0040
Simulating trial. . .
espilon = 0.0175; alpha = 0.0040
Simulating trial. . .
espilon = 0.0175; alpha = 0.0040
Simulating trial. . .
espilon = 0.0175; alpha = 0.0040
Simulating trial. . .
espilon = 0.0175; alpha = 0.0040
Simulating trial. . .
espilon = 0.0175; alpha = 0.0040
Simulating trial. . .
espilon = 0.0175; alpha = 0.0040
Simulating trial. . .
espilon = 0.0175; alpha = 0.0040
Simulating trial. . .
espilon = 0.0175; alpha = 0.0040
Simulating trial. . .
espilon = 0.0175; alpha = 0.0040
Simulating trial. . .
espilon = 0.0175; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.19)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'left')
Agent followed the waypoint right. (rewarded 2.65)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.56)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.05)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 2.31)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.57)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.27)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.36)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'right', 'forward')
Agent properly idled at a red light. (rewarded 2.80)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.65)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'right')
Agent drove right instead of forward. (rewarded 1.69)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 2.70)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.92)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent drove right instead of left. (rewarded 0.53)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 0.84)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 0.76)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', 'right')
Agent attempted driving left through a red light with traffic and cause a major accident. (rewarded -40.01)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 0.85)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.00)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 1.73)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.40)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.75)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 0.32)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.81)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.58)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 1014
\-------------------------
Simulating trial. . .
espilon = 0.0174; alpha = 0.0040
Simulating trial. . .
espilon = 0.0174; alpha = 0.0040
Simulating trial. . .
espilon = 0.0174; alpha = 0.0040
Simulating trial. . .
espilon = 0.0174; alpha = 0.0040
Simulating trial. . .
espilon = 0.0174; alpha = 0.0040
Simulating trial. . .
espilon = 0.0174; alpha = 0.0040
Simulating trial. . .
espilon = 0.0174; alpha = 0.0040
Simulating trial. . .
espilon = 0.0174; alpha = 0.0040
Simulating trial. . .
espilon = 0.0174; alpha = 0.0040
Simulating trial. . .
espilon = 0.0174; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.43)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.35)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.49)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.74)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.36)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.89)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.48)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'right', None)
Agent properly idled at a red light. (rewarded 1.91)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.66)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.94)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.67)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.88)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.79)
57% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1015
\-------------------------
Simulating trial. . .
espilon = 0.0173; alpha = 0.0040
Simulating trial. . .
espilon = 0.0173; alpha = 0.0040
Simulating trial. . .
espilon = 0.0173; alpha = 0.0040
Simulating trial. . .
espilon = 0.0173; alpha = 0.0040
Simulating trial. . .
espilon = 0.0173; alpha = 0.0040
Simulating trial. . .
espilon = 0.0173; alpha = 0.0040
Simulating trial. . .
espilon = 0.0173; alpha = 0.0040
Simulating trial. . .
espilon = 0.0173; alpha = 0.0040
Simulating trial. . .
espilon = 0.0173; alpha = 0.0040
Simulating trial. . .
espilon = 0.0173; alpha = 0.0040
Simulating trial. . .
espilon = 0.0173; alpha = 0.0040
Simulating trial. . .
espilon = 0.0173; alpha = 0.0040
Simulating trial. . .
espilon = 0.0173; alpha = 0.0040
Simulating trial. . .
espilon = 0.0173; alpha = 0.0040
Simulating trial. . .
espilon = 0.0173; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', 'right')
Agent followed the waypoint right. (rewarded 1.99)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.42)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.67)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.79)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.11)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 2.66)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.74)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.20)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.72)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.53)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.94)
45% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1016
\-------------------------
Simulating trial. . .
espilon = 0.0172; alpha = 0.0040
Simulating trial. . .
espilon = 0.0172; alpha = 0.0040
Simulating trial. . .
espilon = 0.0172; alpha = 0.0040
Simulating trial. . .
espilon = 0.0172; alpha = 0.0040
Simulating trial. . .
espilon = 0.0172; alpha = 0.0040
Simulating trial. . .
espilon = 0.0172; alpha = 0.0040
Simulating trial. . .
espilon = 0.0172; alpha = 0.0040
Simulating trial. . .
espilon = 0.0172; alpha = 0.0040
Simulating trial. . .
espilon = 0.0172; alpha = 0.0040
Simulating trial. . .
espilon = 0.0172; alpha = 0.0040
Simulating trial. . .
espilon = 0.0172; alpha = 0.0040
Simulating trial. . .
espilon = 0.0172; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', 'left')
Agent drove right instead of left. (rewarded 1.82)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.30)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', 'left')
Agent properly idled at a red light. (rewarded 1.85)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.43)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.82)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.74)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.92)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 1.87)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 0.97)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.05)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.71)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.35)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.12)
63% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.68)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1017
\-------------------------
Simulating trial. . .
espilon = 0.0172; alpha = 0.0040
Simulating trial. . .
espilon = 0.0172; alpha = 0.0040
Simulating trial. . .
espilon = 0.0172; alpha = 0.0040
Simulating trial. . .
espilon = 0.0172; alpha = 0.0040
Simulating trial. . .
espilon = 0.0172; alpha = 0.0040
Simulating trial. . .
espilon = 0.0172; alpha = 0.0040
Simulating trial. . .
espilon = 0.0172; alpha = 0.0040
Simulating trial. . .
espilon = 0.0172; alpha = 0.0040
Simulating trial. . .
espilon = 0.0172; alpha = 0.0040
Simulating trial. . .
espilon = 0.0172; alpha = 0.0040
Simulating trial. . .
espilon = 0.0172; alpha = 0.0040
Simulating trial. . .
espilon = 0.0172; alpha = 0.0040
Simulating trial. . .
espilon = 0.0172; alpha = 0.0040
Simulating trial. . .
espilon = 0.0172; alpha = 0.0040
Simulating trial. . .
espilon = 0.0172; alpha = 0.0040
Simulating trial. . .
espilon = 0.0172; alpha = 0.0040
Simulating trial. . .
espilon = 0.0172; alpha = 0.0040
Simulating trial. . .
espilon = 0.0172; alpha = 0.0040
Simulating trial. . .
espilon = 0.0172; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.02)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.12)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent attempted driving left through a red light. (rewarded -10.46)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.36)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'forward')
Agent properly idled at a red light. (rewarded 2.43)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.87)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'forward')
Agent drove forward instead of left. (rewarded 0.94)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.00)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.25)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'forward')
Agent drove right instead of left. (rewarded 1.53)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.74)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.25)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.14)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.23)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.91)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 0.57)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.09)
15% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1018
\-------------------------
Simulating trial. . .
espilon = 0.0171; alpha = 0.0040
Simulating trial. . .
espilon = 0.0171; alpha = 0.0040
Simulating trial. . .
espilon = 0.0171; alpha = 0.0040
Simulating trial. . .
espilon = 0.0171; alpha = 0.0040
Simulating trial. . .
espilon = 0.0171; alpha = 0.0040
Simulating trial. . .
espilon = 0.0171; alpha = 0.0040
Simulating trial. . .
espilon = 0.0171; alpha = 0.0040
Simulating trial. . .
espilon = 0.0171; alpha = 0.0040
Simulating trial. . .
espilon = 0.0171; alpha = 0.0040
Simulating trial. . .
espilon = 0.0171; alpha = 0.0040
Simulating trial. . .
espilon = 0.0171; alpha = 0.0040
Simulating trial. . .
espilon = 0.0171; alpha = 0.0040
Simulating trial. . .
espilon = 0.0171; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 0.24)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.24)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 1.45)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.67)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.52)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.64)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.72)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 2.86)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'right')
Agent properly idled at a red light. (rewarded 1.18)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', 'forward')
Agent properly idled at a red light. (rewarded 2.01)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.24)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 0.79)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1019
\-------------------------
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.08)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.36)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.86)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.47)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.65)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.23)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.04)
72% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1020
\-------------------------
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
Simulating trial. . .
espilon = 0.0170; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.36)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.13)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.28)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.88)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.56)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'forward')
Agent properly idled at a red light. (rewarded 2.47)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent followed the waypoint left. (rewarded 1.84)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 2.76)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.36)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.26)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 1.81)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.18)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.51)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.41)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.97)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.02)
47% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1021
\-------------------------
Simulating trial. . .
espilon = 0.0169; alpha = 0.0040
Simulating trial. . .
espilon = 0.0169; alpha = 0.0040
Simulating trial. . .
espilon = 0.0169; alpha = 0.0040
Simulating trial. . .
espilon = 0.0169; alpha = 0.0040
Simulating trial. . .
espilon = 0.0169; alpha = 0.0040
Simulating trial. . .
espilon = 0.0169; alpha = 0.0040
Simulating trial. . .
espilon = 0.0169; alpha = 0.0040
Simulating trial. . .
espilon = 0.0169; alpha = 0.0040
Simulating trial. . .
espilon = 0.0169; alpha = 0.0040
Simulating trial. . .
espilon = 0.0169; alpha = 0.0040
Simulating trial. . .
espilon = 0.0169; alpha = 0.0040
Simulating trial. . .
espilon = 0.0169; alpha = 0.0040
Simulating trial. . .
espilon = 0.0169; alpha = 0.0040
Simulating trial. . .
espilon = 0.0169; alpha = 0.0040
Simulating trial. . .
espilon = 0.0169; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', 'forward')
Agent drove forward instead of left. (rewarded 0.36)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.68)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.14)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.82)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.46)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.09)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.91)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.51)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 1.36)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.67)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.18)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.90)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1022
\-------------------------
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.92)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.88)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 2.85)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.07)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.35)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 2.92)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.56)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 1.14)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'forward')
Agent drove right instead of forward. (rewarded 1.80)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.01)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.58)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.91)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.86)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'right')
Agent drove forward instead of left. (rewarded 1.48)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 0.29)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.53)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 0.80)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.13)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.35)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 0.67)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 0.84)
16% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1023
\-------------------------
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
Simulating trial. . .
espilon = 0.0168; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'forward', 'forward')
Agent properly idled at a red light. (rewarded 2.25)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 0.01)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 0.42)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.62)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.78)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.86)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.91)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'right', 'right')
Agent followed the waypoint forward. (rewarded 2.82)
68% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1024
\-------------------------
Simulating trial. . .
espilon = 0.0167; alpha = 0.0040
Simulating trial. . .
espilon = 0.0167; alpha = 0.0040
Simulating trial. . .
espilon = 0.0167; alpha = 0.0040
Simulating trial. . .
espilon = 0.0167; alpha = 0.0040
Simulating trial. . .
espilon = 0.0167; alpha = 0.0040
Simulating trial. . .
espilon = 0.0167; alpha = 0.0040
Simulating trial. . .
espilon = 0.0167; alpha = 0.0040
Simulating trial. . .
espilon = 0.0167; alpha = 0.0040
Simulating trial. . .
espilon = 0.0167; alpha = 0.0040
Simulating trial. . .
espilon = 0.0167; alpha = 0.0040
Simulating trial. . .
espilon = 0.0167; alpha = 0.0040
Simulating trial. . .
espilon = 0.0167; alpha = 0.0040
Simulating trial. . .
espilon = 0.0167; alpha = 0.0040
Simulating trial. . .
espilon = 0.0167; alpha = 0.0040
Simulating trial. . .
espilon = 0.0167; alpha = 0.0040
Simulating trial. . .
espilon = 0.0167; alpha = 0.0040
Simulating trial. . .
espilon = 0.0167; alpha = 0.0040
Simulating trial. . .
espilon = 0.0167; alpha = 0.0040
Simulating trial. . .
espilon = 0.0167; alpha = 0.0040
Simulating trial. . .
espilon = 0.0167; alpha = 0.0040
Simulating trial. . .
espilon = 0.0167; alpha = 0.0040
Simulating trial. . .
espilon = 0.0167; alpha = 0.0040
Simulating trial. . .
espilon = 0.0167; alpha = 0.0040
Simulating trial. . .
espilon = 0.0167; alpha = 0.0040
Simulating trial. . .
espilon = 0.0167; alpha = 0.0040
Simulating trial. . .
espilon = 0.0167; alpha = 0.0040
Simulating trial. . .
espilon = 0.0167; alpha = 0.0040
Simulating trial. . .
espilon = 0.0167; alpha = 0.0040
Simulating trial. . .
espilon = 0.0167; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.46)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.99)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent drove right instead of left. (rewarded 1.27)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.63)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.12)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 1.12)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.36)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.60)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.85)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.78)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 1.46)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.89)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', 'left')
Agent properly idled at a red light. (rewarded 2.25)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'left', 'left')
Agent followed the waypoint forward. (rewarded 2.43)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.97)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.35)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.87)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.65)
10% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1025
\-------------------------
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.01)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent followed the waypoint right. (rewarded 1.22)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.57)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.71)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.17)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.20)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.23)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 2.72)
68% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1026
\-------------------------
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
Simulating trial. . .
espilon = 0.0166; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 0.04)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.84)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.70)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.73)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.43)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'left')
Agent drove forward instead of right. (rewarded 1.42)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 1.20)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent followed the waypoint right. (rewarded 1.65)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1027
\-------------------------
Simulating trial. . .
espilon = 0.0165; alpha = 0.0040
Simulating trial. . .
espilon = 0.0165; alpha = 0.0040
Simulating trial. . .
espilon = 0.0165; alpha = 0.0040
Simulating trial. . .
espilon = 0.0165; alpha = 0.0040
Simulating trial. . .
espilon = 0.0165; alpha = 0.0040
Simulating trial. . .
espilon = 0.0165; alpha = 0.0040
Simulating trial. . .
espilon = 0.0165; alpha = 0.0040
Simulating trial. . .
espilon = 0.0165; alpha = 0.0040
Simulating trial. . .
espilon = 0.0165; alpha = 0.0040
Simulating trial. . .
espilon = 0.0165; alpha = 0.0040
Simulating trial. . .
espilon = 0.0165; alpha = 0.0040
Simulating trial. . .
espilon = 0.0165; alpha = 0.0040
Simulating trial. . .
espilon = 0.0165; alpha = 0.0040
Simulating trial. . .
espilon = 0.0165; alpha = 0.0040
Simulating trial. . .
espilon = 0.0165; alpha = 0.0040
Simulating trial. . .
espilon = 0.0165; alpha = 0.0040
Simulating trial. . .
espilon = 0.0165; alpha = 0.0040
Simulating trial. . .
espilon = 0.0165; alpha = 0.0040
Simulating trial. . .
espilon = 0.0165; alpha = 0.0040
Simulating trial. . .
espilon = 0.0165; alpha = 0.0040
Simulating trial. . .
espilon = 0.0165; alpha = 0.0040
Simulating trial. . .
espilon = 0.0165; alpha = 0.0040
Simulating trial. . .
espilon = 0.0165; alpha = 0.0040
Simulating trial. . .
espilon = 0.0165; alpha = 0.0040
Simulating trial. . .
espilon = 0.0165; alpha = 0.0040
Simulating trial. . .
espilon = 0.0165; alpha = 0.0040
Simulating trial. . .
espilon = 0.0165; alpha = 0.0040
Simulating trial. . .
espilon = 0.0165; alpha = 0.0040
Simulating trial. . .
espilon = 0.0165; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.42)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.79)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 2.72)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 1.49)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.32)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.15)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.01)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.45)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.36)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.78)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.57)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.42)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'right')
Agent followed the waypoint forward. (rewarded 2.80)
57% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1028
\-------------------------
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent followed the waypoint right. (rewarded 1.15)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.25)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.12)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.61)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.37)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.20)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.15)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.62)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.74)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.59)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded -0.20)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 2.22)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'right')
Agent drove right instead of left. (rewarded 0.46)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.33)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'right', 'forward')
Agent properly idled at a red light. (rewarded 2.03)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.88)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 0.50)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.22)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'right', None)
Agent followed the waypoint forward. (rewarded 2.13)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.93)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 1029
\-------------------------
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
Simulating trial. . .
espilon = 0.0164; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'forward', None)
Agent drove left instead of right. (rewarded 0.06)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.71)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 1.48)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.00)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 2.26)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 2.53)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.17)
77% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1030
\-------------------------
Simulating trial. . .
espilon = 0.0163; alpha = 0.0040
Simulating trial. . .
espilon = 0.0163; alpha = 0.0040
Simulating trial. . .
espilon = 0.0163; alpha = 0.0040
Simulating trial. . .
espilon = 0.0163; alpha = 0.0040
Simulating trial. . .
espilon = 0.0163; alpha = 0.0040
Simulating trial. . .
espilon = 0.0163; alpha = 0.0040
Simulating trial. . .
espilon = 0.0163; alpha = 0.0040
Simulating trial. . .
espilon = 0.0163; alpha = 0.0040
Simulating trial. . .
espilon = 0.0163; alpha = 0.0040
Simulating trial. . .
espilon = 0.0163; alpha = 0.0040
Simulating trial. . .
espilon = 0.0163; alpha = 0.0040
Simulating trial. . .
espilon = 0.0163; alpha = 0.0040
Simulating trial. . .
espilon = 0.0163; alpha = 0.0040
Simulating trial. . .
espilon = 0.0163; alpha = 0.0040
Simulating trial. . .
espilon = 0.0163; alpha = 0.0040
Simulating trial. . .
espilon = 0.0163; alpha = 0.0040
Simulating trial. . .
espilon = 0.0163; alpha = 0.0040
Simulating trial. . .
espilon = 0.0163; alpha = 0.0040
Simulating trial. . .
espilon = 0.0163; alpha = 0.0040
Simulating trial. . .
espilon = 0.0163; alpha = 0.0040
Simulating trial. . .
espilon = 0.0163; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.83)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.02)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.71)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.78)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.38)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.87)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 2.58)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.60)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.69)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.37)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 1.63)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.01)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'right')
Agent properly idled at a red light. (rewarded 1.77)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.61)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.84)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.34)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.24)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.23)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.95)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.15)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.66)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.72)
12% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1031
\-------------------------
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', 'left')
Agent drove right instead of left. (rewarded 0.87)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.56)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.86)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.42)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.98)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.83)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 1.85)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.47)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.42)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.18)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.19)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.34)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.55)
63% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.85)
60% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.80)
57% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.26)
54% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.95)
51% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.41)
49% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 2.48)
46% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1032
\-------------------------
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
Simulating trial. . .
espilon = 0.0162; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.72)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.96)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.89)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.48)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.86)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.00)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.02)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.71)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.03)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 1.16)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 0.45)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.58)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.79)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 0.31)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.53)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.17)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded -0.18)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.40)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.60)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 0.91)
20% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1033
\-------------------------
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.53)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'right')
Agent properly idled at a red light. (rewarded 1.62)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.47)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.78)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.87)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.04)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'left')
Agent followed the waypoint left. (rewarded 1.68)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.31)
68% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1034
\-------------------------
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
Simulating trial. . .
espilon = 0.0161; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.81)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.23)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.20)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.18)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.80)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.64)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.83)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.10)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 2.03)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.16)
67% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1035
\-------------------------
Simulating trial. . .
espilon = 0.0160; alpha = 0.0040
Simulating trial. . .
espilon = 0.0160; alpha = 0.0040
Simulating trial. . .
espilon = 0.0160; alpha = 0.0040
Simulating trial. . .
espilon = 0.0160; alpha = 0.0040
Simulating trial. . .
espilon = 0.0160; alpha = 0.0040
Simulating trial. . .
espilon = 0.0160; alpha = 0.0040
Simulating trial. . .
espilon = 0.0160; alpha = 0.0040
Simulating trial. . .
espilon = 0.0160; alpha = 0.0040
Simulating trial. . .
espilon = 0.0160; alpha = 0.0040
Simulating trial. . .
espilon = 0.0160; alpha = 0.0040
Simulating trial. . .
espilon = 0.0160; alpha = 0.0040
Simulating trial. . .
espilon = 0.0160; alpha = 0.0040
Simulating trial. . .
espilon = 0.0160; alpha = 0.0040
Simulating trial. . .
espilon = 0.0160; alpha = 0.0040
Simulating trial. . .
espilon = 0.0160; alpha = 0.0040
Simulating trial. . .
espilon = 0.0160; alpha = 0.0040
Simulating trial. . .
espilon = 0.0160; alpha = 0.0040
Simulating trial. . .
espilon = 0.0160; alpha = 0.0040
Simulating trial. . .
espilon = 0.0160; alpha = 0.0040
Simulating trial. . .
espilon = 0.0160; alpha = 0.0040
Simulating trial. . .
espilon = 0.0160; alpha = 0.0040
Simulating trial. . .
espilon = 0.0160; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 2.23)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 1.10)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 1.20)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.87)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 2.02)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'right')
Agent drove right instead of left. (rewarded 0.52)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.24)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 2.34)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.29)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.47)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.72)
45% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1036
\-------------------------
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.68)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.39)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.98)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.72)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.78)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.01)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.74)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.77)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.89)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.29)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.49)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.05)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.11)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.91)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.85)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.84)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.52)
32% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1037
\-------------------------
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
Simulating trial. . .
espilon = 0.0159; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.95)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.86)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.24)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 2.50)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 1.41)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 2.02)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.88)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.45)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.05)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.58)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', 'left')
Agent followed the waypoint right. (rewarded 1.91)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.28)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.74)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 2.05)
44% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1038
\-------------------------
Simulating trial. . .
espilon = 0.0158; alpha = 0.0040
Simulating trial. . .
espilon = 0.0158; alpha = 0.0040
Simulating trial. . .
espilon = 0.0158; alpha = 0.0040
Simulating trial. . .
espilon = 0.0158; alpha = 0.0040
Simulating trial. . .
espilon = 0.0158; alpha = 0.0040
Simulating trial. . .
espilon = 0.0158; alpha = 0.0040
Simulating trial. . .
espilon = 0.0158; alpha = 0.0040
Simulating trial. . .
espilon = 0.0158; alpha = 0.0040
Simulating trial. . .
espilon = 0.0158; alpha = 0.0040
Simulating trial. . .
espilon = 0.0158; alpha = 0.0040
Simulating trial. . .
espilon = 0.0158; alpha = 0.0040
Simulating trial. . .
espilon = 0.0158; alpha = 0.0040
Simulating trial. . .
espilon = 0.0158; alpha = 0.0040
Simulating trial. . .
espilon = 0.0158; alpha = 0.0040
Simulating trial. . .
espilon = 0.0158; alpha = 0.0040
Simulating trial. . .
espilon = 0.0158; alpha = 0.0040
Simulating trial. . .
espilon = 0.0158; alpha = 0.0040
Simulating trial. . .
espilon = 0.0158; alpha = 0.0040
Simulating trial. . .
espilon = 0.0158; alpha = 0.0040
Simulating trial. . .
espilon = 0.0158; alpha = 0.0040
Simulating trial. . .
espilon = 0.0158; alpha = 0.0040
Simulating trial. . .
espilon = 0.0158; alpha = 0.0040
Simulating trial. . .
espilon = 0.0158; alpha = 0.0040
Simulating trial. . .
espilon = 0.0158; alpha = 0.0040
Simulating trial. . .
espilon = 0.0158; alpha = 0.0040
Simulating trial. . .
espilon = 0.0158; alpha = 0.0040
Simulating trial. . .
espilon = 0.0158; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.54)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 0.81)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 2.12)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.42)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.10)
75% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1039
\-------------------------
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'right', None)
Agent properly idled at a red light. (rewarded 2.14)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.24)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.73)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.64)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.01)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.19)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.38)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', 'right')
Agent properly idled at a red light. (rewarded 2.26)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.68)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.56)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 1.06)
45% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1040
\-------------------------
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
Simulating trial. . .
espilon = 0.0157; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.87)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', 'left')
Agent drove forward instead of right. (rewarded 1.25)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.67)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.15)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.78)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, None)
Agent followed the waypoint right. (rewarded 1.42)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.74)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.84)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.21)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.39)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.31)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded -0.13)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.43)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.08)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'right', None)
Agent drove right instead of left. (rewarded 0.94)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded -0.45)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.45)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', 'forward')
Agent drove forward instead of left. (rewarded 0.38)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent drove right instead of left. (rewarded -0.18)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.15)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 1041
\-------------------------
Simulating trial. . .
espilon = 0.0156; alpha = 0.0040
Simulating trial. . .
espilon = 0.0156; alpha = 0.0040
Simulating trial. . .
espilon = 0.0156; alpha = 0.0040
Simulating trial. . .
espilon = 0.0156; alpha = 0.0040
Simulating trial. . .
espilon = 0.0156; alpha = 0.0040
Simulating trial. . .
espilon = 0.0156; alpha = 0.0040
Simulating trial. . .
espilon = 0.0156; alpha = 0.0040
Simulating trial. . .
espilon = 0.0156; alpha = 0.0040
Simulating trial. . .
espilon = 0.0156; alpha = 0.0040
Simulating trial. . .
espilon = 0.0156; alpha = 0.0040
Simulating trial. . .
espilon = 0.0156; alpha = 0.0040
Simulating trial. . .
espilon = 0.0156; alpha = 0.0040
Simulating trial. . .
espilon = 0.0156; alpha = 0.0040
Simulating trial. . .
espilon = 0.0156; alpha = 0.0040
Simulating trial. . .
espilon = 0.0156; alpha = 0.0040
Simulating trial. . .
espilon = 0.0156; alpha = 0.0040
Simulating trial. . .
espilon = 0.0156; alpha = 0.0040
Simulating trial. . .
espilon = 0.0156; alpha = 0.0040
Simulating trial. . .
espilon = 0.0156; alpha = 0.0040
Simulating trial. . .
espilon = 0.0156; alpha = 0.0040
Simulating trial. . .
espilon = 0.0156; alpha = 0.0040
Simulating trial. . .
espilon = 0.0156; alpha = 0.0040
Simulating trial. . .
espilon = 0.0156; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'right')
Agent properly idled at a red light. (rewarded 1.43)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.60)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', 'left')
Agent properly idled at a red light. (rewarded 2.52)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.56)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 2.54)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.72)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.25)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.66)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.05)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', 'left')
Agent followed the waypoint forward. (rewarded 2.71)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 1.25)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'left')
Agent followed the waypoint right. (rewarded 1.75)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.55)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.40)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.11)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.04)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 0.74)
15% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1042
\-------------------------
Simulating trial. . .
espilon = 0.0155; alpha = 0.0040
Simulating trial. . .
espilon = 0.0155; alpha = 0.0040
Simulating trial. . .
espilon = 0.0155; alpha = 0.0040
Simulating trial. . .
espilon = 0.0155; alpha = 0.0040
Simulating trial. . .
espilon = 0.0155; alpha = 0.0040
Simulating trial. . .
espilon = 0.0155; alpha = 0.0040
Simulating trial. . .
espilon = 0.0155; alpha = 0.0040
Simulating trial. . .
espilon = 0.0155; alpha = 0.0040
Simulating trial. . .
espilon = 0.0155; alpha = 0.0040
Simulating trial. . .
espilon = 0.0155; alpha = 0.0040
Simulating trial. . .
espilon = 0.0155; alpha = 0.0040
Simulating trial. . .
espilon = 0.0155; alpha = 0.0040
Simulating trial. . .
espilon = 0.0155; alpha = 0.0040
Simulating trial. . .
espilon = 0.0155; alpha = 0.0040
Simulating trial. . .
espilon = 0.0155; alpha = 0.0040
Simulating trial. . .
espilon = 0.0155; alpha = 0.0040
Simulating trial. . .
espilon = 0.0155; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.22)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 2.84)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.65)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.78)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.48)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.46)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'forward')
Agent drove right instead of left. (rewarded 0.03)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'left')
Agent followed the waypoint right. (rewarded 1.40)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.65)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.16)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 1.43)
45% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1043
\-------------------------
Simulating trial. . .
espilon = 0.0155; alpha = 0.0040
Simulating trial. . .
espilon = 0.0155; alpha = 0.0040
Simulating trial. . .
espilon = 0.0155; alpha = 0.0040
Simulating trial. . .
espilon = 0.0155; alpha = 0.0040
Simulating trial. . .
espilon = 0.0155; alpha = 0.0040
Simulating trial. . .
espilon = 0.0155; alpha = 0.0040
Simulating trial. . .
espilon = 0.0155; alpha = 0.0040
Simulating trial. . .
espilon = 0.0155; alpha = 0.0040
Simulating trial. . .
espilon = 0.0155; alpha = 0.0040
Simulating trial. . .
espilon = 0.0155; alpha = 0.0040
Simulating trial. . .
espilon = 0.0155; alpha = 0.0040
Simulating trial. . .
espilon = 0.0155; alpha = 0.0040
Simulating trial. . .
espilon = 0.0155; alpha = 0.0040
Simulating trial. . .
espilon = 0.0155; alpha = 0.0040
Simulating trial. . .
espilon = 0.0155; alpha = 0.0040
Simulating trial. . .
espilon = 0.0155; alpha = 0.0040
Simulating trial. . .
espilon = 0.0155; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.40)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', 'forward')
Agent drove forward instead of left. (rewarded 1.46)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.21)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.20)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.63)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.63)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.57)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 2.86)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.96)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.53)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.73)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'right', None)
Agent drove right instead of forward. (rewarded 1.20)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 2.01)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.14)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.66)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded -0.41)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.06)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.04)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'forward', None)
Agent drove left instead of right. (rewarded 0.89)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.89)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 1044
\-------------------------
Simulating trial. . .
espilon = 0.0154; alpha = 0.0040
Simulating trial. . .
espilon = 0.0154; alpha = 0.0040
Simulating trial. . .
espilon = 0.0154; alpha = 0.0040
Simulating trial. . .
espilon = 0.0154; alpha = 0.0040
Simulating trial. . .
espilon = 0.0154; alpha = 0.0040
Simulating trial. . .
espilon = 0.0154; alpha = 0.0040
Simulating trial. . .
espilon = 0.0154; alpha = 0.0040
Simulating trial. . .
espilon = 0.0154; alpha = 0.0040
Simulating trial. . .
espilon = 0.0154; alpha = 0.0040
Simulating trial. . .
espilon = 0.0154; alpha = 0.0040
Simulating trial. . .
espilon = 0.0154; alpha = 0.0040
Simulating trial. . .
espilon = 0.0154; alpha = 0.0040
Simulating trial. . .
espilon = 0.0154; alpha = 0.0040
Simulating trial. . .
espilon = 0.0154; alpha = 0.0040
Simulating trial. . .
espilon = 0.0154; alpha = 0.0040
Simulating trial. . .
espilon = 0.0154; alpha = 0.0040
Simulating trial. . .
espilon = 0.0154; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', 'forward')
Agent followed the waypoint right. (rewarded 2.86)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.86)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.59)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.98)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.49)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.88)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.12)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.12)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.00)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'right', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.79)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent followed the waypoint left. (rewarded 1.76)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.74)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'right')
Agent drove right instead of forward. (rewarded 1.25)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.27)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent drove right instead of left. (rewarded 0.68)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.49)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.01)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.97)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 2.51)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 1.38)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.63)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 2.04)
27% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1045
\-------------------------
Simulating trial. . .
espilon = 0.0154; alpha = 0.0040
Simulating trial. . .
espilon = 0.0154; alpha = 0.0040
Simulating trial. . .
espilon = 0.0154; alpha = 0.0040
Simulating trial. . .
espilon = 0.0154; alpha = 0.0040
Simulating trial. . .
espilon = 0.0154; alpha = 0.0040
Simulating trial. . .
espilon = 0.0154; alpha = 0.0040
Simulating trial. . .
espilon = 0.0154; alpha = 0.0040
Simulating trial. . .
espilon = 0.0154; alpha = 0.0040
Simulating trial. . .
espilon = 0.0154; alpha = 0.0040
Simulating trial. . .
espilon = 0.0154; alpha = 0.0040
Simulating trial. . .
espilon = 0.0154; alpha = 0.0040
Simulating trial. . .
espilon = 0.0154; alpha = 0.0040
Simulating trial. . .
espilon = 0.0154; alpha = 0.0040
Simulating trial. . .
espilon = 0.0154; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.05)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 1.61)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', None)
Agent properly idled at a red light. (rewarded 2.25)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.47)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.85)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.61)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.36)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent followed the waypoint left. (rewarded 2.80)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.49)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.14)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1046
\-------------------------
Simulating trial. . .
espilon = 0.0153; alpha = 0.0040
Simulating trial. . .
espilon = 0.0153; alpha = 0.0040
Simulating trial. . .
espilon = 0.0153; alpha = 0.0040
Simulating trial. . .
espilon = 0.0153; alpha = 0.0040
Simulating trial. . .
espilon = 0.0153; alpha = 0.0040
Simulating trial. . .
espilon = 0.0153; alpha = 0.0040
Simulating trial. . .
espilon = 0.0153; alpha = 0.0040
Simulating trial. . .
espilon = 0.0153; alpha = 0.0040
Simulating trial. . .
espilon = 0.0153; alpha = 0.0040
Simulating trial. . .
espilon = 0.0153; alpha = 0.0040
Simulating trial. . .
espilon = 0.0153; alpha = 0.0040
Simulating trial. . .
espilon = 0.0153; alpha = 0.0040
Simulating trial. . .
espilon = 0.0153; alpha = 0.0040
Simulating trial. . .
espilon = 0.0153; alpha = 0.0040
Simulating trial. . .
espilon = 0.0153; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', 'left')
Agent followed the waypoint right. (rewarded 2.04)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 2.00)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.37)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.56)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.54)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.31)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 0.97)
65% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1047
\-------------------------
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.76)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 2.01)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.19)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.51)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.70)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.48)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.86)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 0.55)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.40)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.40)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.21)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.10)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1048
\-------------------------
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
Simulating trial. . .
espilon = 0.0152; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.07)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.10)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.38)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.77)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'forward')
Agent followed the waypoint right. (rewarded 2.78)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 2.18)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.02)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 0.82)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.25)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 2.20)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.41)
56% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1049
\-------------------------
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', 'left')
Agent followed the waypoint right. (rewarded 1.01)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.00)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.36)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.12)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.92)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.59)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.63)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.61)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'right')
Agent drove forward instead of left. (rewarded 1.86)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 1.40)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.19)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.81)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'left')
Agent followed the waypoint right. (rewarded 2.19)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 1.94)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.04)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.21)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 1.25)
32% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1050
\-------------------------
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
Simulating trial. . .
espilon = 0.0151; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.80)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.30)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.33)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.47)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.41)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.10)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.57)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 1.43)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 0.94)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.35)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'right')
Agent properly idled at a red light. (rewarded 2.28)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.21)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.41)
35% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1051
\-------------------------
Simulating trial. . .
espilon = 0.0150; alpha = 0.0040
Simulating trial. . .
espilon = 0.0150; alpha = 0.0040
Simulating trial. . .
espilon = 0.0150; alpha = 0.0040
Simulating trial. . .
espilon = 0.0150; alpha = 0.0040
Simulating trial. . .
espilon = 0.0150; alpha = 0.0040
Simulating trial. . .
espilon = 0.0150; alpha = 0.0040
Simulating trial. . .
espilon = 0.0150; alpha = 0.0040
Simulating trial. . .
espilon = 0.0150; alpha = 0.0040
Simulating trial. . .
espilon = 0.0150; alpha = 0.0040
Simulating trial. . .
espilon = 0.0150; alpha = 0.0040
Simulating trial. . .
espilon = 0.0150; alpha = 0.0040
Simulating trial. . .
espilon = 0.0150; alpha = 0.0040
Simulating trial. . .
espilon = 0.0150; alpha = 0.0040
Simulating trial. . .
espilon = 0.0150; alpha = 0.0040
Simulating trial. . .
espilon = 0.0150; alpha = 0.0040
Simulating trial. . .
espilon = 0.0150; alpha = 0.0040
Simulating trial. . .
espilon = 0.0150; alpha = 0.0040
Simulating trial. . .
espilon = 0.0150; alpha = 0.0040
Simulating trial. . .
espilon = 0.0150; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', 'left')
Agent followed the waypoint left. (rewarded 2.77)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.88)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.91)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.97)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.16)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', None)
Agent properly idled at a red light. (rewarded 1.91)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.37)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.54)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 0.96)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.27)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.69)
63% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1052
\-------------------------
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.67)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.75)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.20)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.79)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.04)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.24)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.93)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.99)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.86)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.77)
67% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1053
\-------------------------
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
Simulating trial. . .
espilon = 0.0149; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 2.99)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.97)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.18)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.24)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.34)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.71)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.85)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'forward', 'left')
Agent drove right instead of forward. (rewarded 1.04)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 1.19)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.13)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.90)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.28)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 0.79)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.08)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.65)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'forward')
Agent drove right instead of left. (rewarded 1.02)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent followed the waypoint right. (rewarded 0.63)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.08)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.27)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 0.25)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 1054
\-------------------------
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', 'right')
Agent properly idled at a red light. (rewarded 2.75)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'right')
Agent properly idled at a red light. (rewarded 1.24)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.58)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.51)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.88)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.48)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.62)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.47)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.78)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.30)
50% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1055
\-------------------------
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
Simulating trial. . .
espilon = 0.0148; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'forward', None)
Agent drove left instead of right. (rewarded 1.01)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.83)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.16)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.26)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 2.07)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.65)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.54)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.03)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.76)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 0.03)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.44)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.93)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.72)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.44)
44% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1056
\-------------------------
Simulating trial. . .
espilon = 0.0147; alpha = 0.0040
Simulating trial. . .
espilon = 0.0147; alpha = 0.0040
Simulating trial. . .
espilon = 0.0147; alpha = 0.0040
Simulating trial. . .
espilon = 0.0147; alpha = 0.0040
Simulating trial. . .
espilon = 0.0147; alpha = 0.0040
Simulating trial. . .
espilon = 0.0147; alpha = 0.0040
Simulating trial. . .
espilon = 0.0147; alpha = 0.0040
Simulating trial. . .
espilon = 0.0147; alpha = 0.0040
Simulating trial. . .
espilon = 0.0147; alpha = 0.0040
Simulating trial. . .
espilon = 0.0147; alpha = 0.0040
Simulating trial. . .
espilon = 0.0147; alpha = 0.0040
Simulating trial. . .
espilon = 0.0147; alpha = 0.0040
Simulating trial. . .
espilon = 0.0147; alpha = 0.0040
Simulating trial. . .
espilon = 0.0147; alpha = 0.0040
Simulating trial. . .
espilon = 0.0147; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'right', 'forward')
Agent properly idled at a red light. (rewarded 1.32)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 1.54)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.94)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 2.64)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 2.72)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.83)
76% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1057
\-------------------------
Simulating trial. . .
espilon = 0.0146; alpha = 0.0040
Simulating trial. . .
espilon = 0.0146; alpha = 0.0040
Simulating trial. . .
espilon = 0.0146; alpha = 0.0040
Simulating trial. . .
espilon = 0.0146; alpha = 0.0040
Simulating trial. . .
espilon = 0.0146; alpha = 0.0040
Simulating trial. . .
espilon = 0.0146; alpha = 0.0040
Simulating trial. . .
espilon = 0.0146; alpha = 0.0040
Simulating trial. . .
espilon = 0.0146; alpha = 0.0040
Simulating trial. . .
espilon = 0.0146; alpha = 0.0040
Simulating trial. . .
espilon = 0.0146; alpha = 0.0040
Simulating trial. . .
espilon = 0.0146; alpha = 0.0040
Simulating trial. . .
espilon = 0.0146; alpha = 0.0040
Simulating trial. . .
espilon = 0.0146; alpha = 0.0040
Simulating trial. . .
espilon = 0.0146; alpha = 0.0040
Simulating trial. . .
espilon = 0.0146; alpha = 0.0040
Simulating trial. . .
espilon = 0.0146; alpha = 0.0040
Simulating trial. . .
espilon = 0.0146; alpha = 0.0040
Simulating trial. . .
espilon = 0.0146; alpha = 0.0040
Simulating trial. . .
espilon = 0.0146; alpha = 0.0040
Simulating trial. . .
espilon = 0.0146; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.58)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.37)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.21)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.91)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.77)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.12)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.46)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.09)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.62)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.81)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.15)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.28)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.78)
63% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 0.89)
60% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', 'left')
Agent drove forward instead of right. (rewarded 0.34)
57% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.10)
54% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'right', None)
Agent properly idled at a red light. (rewarded 2.72)
51% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.98)
49% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 1.24)
46% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.22)
43% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.09)
40% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'left')
Agent drove right instead of left. (rewarded 0.39)
37% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent followed the waypoint right. (rewarded 2.54)
34% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'right')
Agent followed the waypoint right. (rewarded 1.12)
31% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.51)
29% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 0.83)
26% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.23)
23% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 2.17)
20% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.76)
17% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.86)
14% of time remaining to reach destination.
/-------------------
| Step 30 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 2.26)
11% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1058
\-------------------------
Simulating trial. . .
espilon = 0.0146; alpha = 0.0040
Simulating trial. . .
espilon = 0.0146; alpha = 0.0040
Simulating trial. . .
espilon = 0.0146; alpha = 0.0040
Simulating trial. . .
espilon = 0.0146; alpha = 0.0040
Simulating trial. . .
espilon = 0.0146; alpha = 0.0040
Simulating trial. . .
espilon = 0.0146; alpha = 0.0040
Simulating trial. . .
espilon = 0.0146; alpha = 0.0040
Simulating trial. . .
espilon = 0.0146; alpha = 0.0040
Simulating trial. . .
espilon = 0.0146; alpha = 0.0040
Simulating trial. . .
espilon = 0.0146; alpha = 0.0040
Simulating trial. . .
espilon = 0.0146; alpha = 0.0040
Simulating trial. . .
espilon = 0.0146; alpha = 0.0040
Simulating trial. . .
espilon = 0.0146; alpha = 0.0040
Simulating trial. . .
espilon = 0.0146; alpha = 0.0040
Simulating trial. . .
espilon = 0.0146; alpha = 0.0040
Simulating trial. . .
espilon = 0.0146; alpha = 0.0040
Simulating trial. . .
espilon = 0.0146; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'right', None)
Agent followed the waypoint forward. (rewarded 2.67)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.51)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.51)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.30)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.49)
75% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1059
\-------------------------
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.00)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.88)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.16)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.68)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.76)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.17)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.66)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 1.52)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.39)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent attempted driving forward through a red light. (rewarded -10.31)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.70)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.18)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.54)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.57)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.78)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.62)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.14)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 0.58)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 0.95)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.77)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.29)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 1.11)
27% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.93)
23% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 0.64)
20% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'right')
Agent properly idled at a red light. (rewarded 1.29)
17% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.52)
13% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.85)
10% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'right')
Agent drove right instead of left. (rewarded -0.47)
7% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.25)
3% of time remaining to reach destination.
/-------------------
| Step 29 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.79)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 1060
\-------------------------
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
Simulating trial. . .
espilon = 0.0145; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.52)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.48)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.55)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.46)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.16)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.61)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 0.99)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.38)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'right')
Agent drove right instead of forward. (rewarded 0.39)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', 'forward')
Agent properly idled at a red light. (rewarded 1.38)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'forward')
Agent drove right instead of left. (rewarded 1.20)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 0.85)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.60)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.09)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.90)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.64)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.57)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.65)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 1.62)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.05)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.01)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.45)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'right')
Agent followed the waypoint right. (rewarded 0.88)
8% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1061
\-------------------------
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', 'left')
Agent followed the waypoint forward. (rewarded 2.41)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', None)
Agent properly idled at a red light. (rewarded 1.70)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.55)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.13)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.50)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.36)
70% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1062
\-------------------------
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
Simulating trial. . .
espilon = 0.0144; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.95)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.06)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 0.72)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.52)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.36)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.95)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.60)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.99)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.14)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 0.99)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.72)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'right', 'forward')
Agent followed the waypoint forward. (rewarded 1.62)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1063
\-------------------------
Simulating trial. . .
espilon = 0.0143; alpha = 0.0040
Simulating trial. . .
espilon = 0.0143; alpha = 0.0040
Simulating trial. . .
espilon = 0.0143; alpha = 0.0040
Simulating trial. . .
espilon = 0.0143; alpha = 0.0040
Simulating trial. . .
espilon = 0.0143; alpha = 0.0040
Simulating trial. . .
espilon = 0.0143; alpha = 0.0040
Simulating trial. . .
espilon = 0.0143; alpha = 0.0040
Simulating trial. . .
espilon = 0.0143; alpha = 0.0040
Simulating trial. . .
espilon = 0.0143; alpha = 0.0040
Simulating trial. . .
espilon = 0.0143; alpha = 0.0040
Simulating trial. . .
espilon = 0.0143; alpha = 0.0040
Simulating trial. . .
espilon = 0.0143; alpha = 0.0040
Simulating trial. . .
espilon = 0.0143; alpha = 0.0040
Simulating trial. . .
espilon = 0.0143; alpha = 0.0040
Simulating trial. . .
espilon = 0.0143; alpha = 0.0040
Simulating trial. . .
espilon = 0.0143; alpha = 0.0040
Simulating trial. . .
espilon = 0.0143; alpha = 0.0040
Simulating trial. . .
espilon = 0.0143; alpha = 0.0040
Simulating trial. . .
espilon = 0.0143; alpha = 0.0040
Simulating trial. . .
espilon = 0.0143; alpha = 0.0040
Simulating trial. . .
espilon = 0.0143; alpha = 0.0040
Simulating trial. . .
espilon = 0.0143; alpha = 0.0040
Simulating trial. . .
espilon = 0.0143; alpha = 0.0040
Simulating trial. . .
espilon = 0.0143; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.14)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', 'left')
Agent drove forward instead of right. (rewarded 1.12)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.25)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.30)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.18)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.91)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent followed the waypoint forward. (rewarded 1.23)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.57)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'right', None)
Agent properly idled at a red light. (rewarded 2.40)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.77)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'right', 'right')
Agent drove forward instead of right. (rewarded 0.35)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded 0.35)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.26)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.97)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.65)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.76)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 0.66)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.14)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.78)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.37)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 0.94)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 1.22)
12% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.83)
8% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'right')
Agent drove right instead of left. (rewarded 0.57)
4% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.81)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 1064
\-------------------------
Simulating trial. . .
espilon = 0.0142; alpha = 0.0040
Simulating trial. . .
espilon = 0.0142; alpha = 0.0040
Simulating trial. . .
espilon = 0.0142; alpha = 0.0040
Simulating trial. . .
espilon = 0.0142; alpha = 0.0040
Simulating trial. . .
espilon = 0.0142; alpha = 0.0040
Simulating trial. . .
espilon = 0.0142; alpha = 0.0040
Simulating trial. . .
espilon = 0.0142; alpha = 0.0040
Simulating trial. . .
espilon = 0.0142; alpha = 0.0040
Simulating trial. . .
espilon = 0.0142; alpha = 0.0040
Simulating trial. . .
espilon = 0.0142; alpha = 0.0040
Simulating trial. . .
espilon = 0.0142; alpha = 0.0040
Simulating trial. . .
espilon = 0.0142; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.20)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.95)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.42)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.77)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.74)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'forward')
Agent followed the waypoint forward. (rewarded 2.76)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 2.77)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.70)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1065
\-------------------------
Simulating trial. . .
espilon = 0.0142; alpha = 0.0040
Simulating trial. . .
espilon = 0.0142; alpha = 0.0040
Simulating trial. . .
espilon = 0.0142; alpha = 0.0040
Simulating trial. . .
espilon = 0.0142; alpha = 0.0040
Simulating trial. . .
espilon = 0.0142; alpha = 0.0040
Simulating trial. . .
espilon = 0.0142; alpha = 0.0040
Simulating trial. . .
espilon = 0.0142; alpha = 0.0040
Simulating trial. . .
espilon = 0.0142; alpha = 0.0040
Simulating trial. . .
espilon = 0.0142; alpha = 0.0040
Simulating trial. . .
espilon = 0.0142; alpha = 0.0040
Simulating trial. . .
espilon = 0.0142; alpha = 0.0040
Simulating trial. . .
espilon = 0.0142; alpha = 0.0040
Simulating trial. . .
espilon = 0.0142; alpha = 0.0040
Simulating trial. . .
espilon = 0.0142; alpha = 0.0040
Simulating trial. . .
espilon = 0.0142; alpha = 0.0040
Simulating trial. . .
espilon = 0.0142; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.57)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.51)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 0.28)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.83)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.49)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 1.43)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.60)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.23)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.44)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 1.49)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 2.09)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.00)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1066
\-------------------------
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.17)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.32)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.35)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.33)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.10)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.19)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.75)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'right')
Agent drove right instead of forward. (rewarded 1.64)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent drove right instead of left. (rewarded 0.39)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.37)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.67)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.51)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.62)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.24)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'left')
Agent followed the waypoint right. (rewarded 2.09)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 1.45)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.40)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 0.55)
10% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1067
\-------------------------
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
Simulating trial. . .
espilon = 0.0141; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.67)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.80)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.18)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 0.07)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 2.38)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.91)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 2.46)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.79)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.12)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 0.99)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'left')
Agent drove right instead of left. (rewarded 0.45)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.74)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.05)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'right')
Agent drove right instead of forward. (rewarded 0.55)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 0.91)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 2.24)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.25)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.88)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.59)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 2.22)
20% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1068
\-------------------------
Simulating trial. . .
espilon = 0.0140; alpha = 0.0040
Simulating trial. . .
espilon = 0.0140; alpha = 0.0040
Simulating trial. . .
espilon = 0.0140; alpha = 0.0040
Simulating trial. . .
espilon = 0.0140; alpha = 0.0040
Simulating trial. . .
espilon = 0.0140; alpha = 0.0040
Simulating trial. . .
espilon = 0.0140; alpha = 0.0040
Simulating trial. . .
espilon = 0.0140; alpha = 0.0040
Simulating trial. . .
espilon = 0.0140; alpha = 0.0040
Simulating trial. . .
espilon = 0.0140; alpha = 0.0040
Simulating trial. . .
espilon = 0.0140; alpha = 0.0040
Simulating trial. . .
espilon = 0.0140; alpha = 0.0040
Simulating trial. . .
espilon = 0.0140; alpha = 0.0040
Simulating trial. . .
espilon = 0.0140; alpha = 0.0040
Simulating trial. . .
espilon = 0.0140; alpha = 0.0040
Simulating trial. . .
espilon = 0.0140; alpha = 0.0040
Simulating trial. . .
espilon = 0.0140; alpha = 0.0040
Simulating trial. . .
espilon = 0.0140; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'right')
Agent followed the waypoint right. (rewarded 1.48)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.81)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.11)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.46)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 2.19)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 1.90)
76% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1069
\-------------------------
Simulating trial. . .
espilon = 0.0140; alpha = 0.0040
Simulating trial. . .
espilon = 0.0140; alpha = 0.0040
Simulating trial. . .
espilon = 0.0140; alpha = 0.0040
Simulating trial. . .
espilon = 0.0140; alpha = 0.0040
Simulating trial. . .
espilon = 0.0140; alpha = 0.0040
Simulating trial. . .
espilon = 0.0140; alpha = 0.0040
Simulating trial. . .
espilon = 0.0140; alpha = 0.0040
Simulating trial. . .
espilon = 0.0140; alpha = 0.0040
Simulating trial. . .
espilon = 0.0140; alpha = 0.0040
Simulating trial. . .
espilon = 0.0140; alpha = 0.0040
Simulating trial. . .
espilon = 0.0140; alpha = 0.0040
Simulating trial. . .
espilon = 0.0140; alpha = 0.0040
Simulating trial. . .
espilon = 0.0140; alpha = 0.0040
Simulating trial. . .
espilon = 0.0140; alpha = 0.0040
Simulating trial. . .
espilon = 0.0140; alpha = 0.0040
Simulating trial. . .
espilon = 0.0140; alpha = 0.0040
Simulating trial. . .
espilon = 0.0140; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.15)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.74)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.78)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.08)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.90)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.07)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 0.21)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.14)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.70)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 2.05)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.12)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'forward')
Agent followed the waypoint right. (rewarded 1.60)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.08)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.97)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.65)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.59)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'forward')
Agent drove right instead of forward. (rewarded -0.27)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 1.07)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'left')
Agent followed the waypoint left. (rewarded 1.80)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.48)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 1070
\-------------------------
Simulating trial. . .
espilon = 0.0139; alpha = 0.0040
Simulating trial. . .
espilon = 0.0139; alpha = 0.0040
Simulating trial. . .
espilon = 0.0139; alpha = 0.0040
Simulating trial. . .
espilon = 0.0139; alpha = 0.0040
Simulating trial. . .
espilon = 0.0139; alpha = 0.0040
Simulating trial. . .
espilon = 0.0139; alpha = 0.0040
Simulating trial. . .
espilon = 0.0139; alpha = 0.0040
Simulating trial. . .
espilon = 0.0139; alpha = 0.0040
Simulating trial. . .
espilon = 0.0139; alpha = 0.0040
Simulating trial. . .
espilon = 0.0139; alpha = 0.0040
Simulating trial. . .
espilon = 0.0139; alpha = 0.0040
Simulating trial. . .
espilon = 0.0139; alpha = 0.0040
Simulating trial. . .
espilon = 0.0139; alpha = 0.0040
Simulating trial. . .
espilon = 0.0139; alpha = 0.0040
Simulating trial. . .
espilon = 0.0139; alpha = 0.0040
Simulating trial. . .
espilon = 0.0139; alpha = 0.0040
Simulating trial. . .
espilon = 0.0139; alpha = 0.0040
Simulating trial. . .
espilon = 0.0139; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 1.10)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.86)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.46)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.61)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.79)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent followed the waypoint left. (rewarded 2.16)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.25)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'right')
Agent followed the waypoint forward. (rewarded 2.30)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.56)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.02)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.00)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.91)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.75)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove right instead of forward. (rewarded 0.16)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.17)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.36)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.37)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.92)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.57)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.66)
20% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1071
\-------------------------
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.01)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.40)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.77)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.89)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 1.93)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 2.70)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.77)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent idled at a green light with no oncoming traffic. (rewarded -5.73)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.30)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.73)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 2.62)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.18)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 0.78)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'forward')
Agent drove right instead of left. (rewarded 0.52)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.50)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 0.76)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 1.44)
43% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1072
\-------------------------
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
Simulating trial. . .
espilon = 0.0138; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.69)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 0.21)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.78)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.85)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.45)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 2.13)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 0.88)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.90)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.91)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.56)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.87)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.65)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 0.21)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.32)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 0.75)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.75)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'right')
Agent drove right instead of forward. (rewarded 0.41)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'right')
Agent drove right instead of left. (rewarded -0.58)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.45)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent followed the waypoint right. (rewarded 0.26)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 1073
\-------------------------
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 2.73)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.61)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.58)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.05)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.14)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.81)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'forward')
Agent drove right instead of left. (rewarded 0.45)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', 'forward')
Agent followed the waypoint right. (rewarded 2.13)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.20)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.27)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.55)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'left')
Agent followed the waypoint forward. (rewarded 1.59)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.99)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.60)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.87)
25% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1074
\-------------------------
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
Simulating trial. . .
espilon = 0.0137; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'forward')
Agent drove forward instead of right. (rewarded 1.83)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent followed the waypoint right. (rewarded 2.98)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.65)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.48)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 1.82)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.04)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 0.99)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.77)
68% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1075
\-------------------------
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, 'right')
Agent followed the waypoint right. (rewarded 1.68)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.66)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.56)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.95)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.10)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.69)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.03)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.44)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.07)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.60)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.14)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', 'left')
Agent followed the waypoint forward. (rewarded 1.75)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1076
\-------------------------
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
Simulating trial. . .
espilon = 0.0136; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.68)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.05)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.49)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, None)
Agent followed the waypoint right. (rewarded 2.70)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.67)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'right')
Agent properly idled at a red light. (rewarded 2.49)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', 'right')
Agent properly idled at a red light. (rewarded 1.13)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', 'right')
Agent followed the waypoint forward. (rewarded 1.57)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.24)
64% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1077
\-------------------------
Simulating trial. . .
espilon = 0.0135; alpha = 0.0040
Simulating trial. . .
espilon = 0.0135; alpha = 0.0040
Simulating trial. . .
espilon = 0.0135; alpha = 0.0040
Simulating trial. . .
espilon = 0.0135; alpha = 0.0040
Simulating trial. . .
espilon = 0.0135; alpha = 0.0040
Simulating trial. . .
espilon = 0.0135; alpha = 0.0040
Simulating trial. . .
espilon = 0.0135; alpha = 0.0040
Simulating trial. . .
espilon = 0.0135; alpha = 0.0040
Simulating trial. . .
espilon = 0.0135; alpha = 0.0040
Simulating trial. . .
espilon = 0.0135; alpha = 0.0040
Simulating trial. . .
espilon = 0.0135; alpha = 0.0040
Simulating trial. . .
espilon = 0.0135; alpha = 0.0040
Simulating trial. . .
espilon = 0.0135; alpha = 0.0040
Simulating trial. . .
espilon = 0.0135; alpha = 0.0040
Simulating trial. . .
espilon = 0.0135; alpha = 0.0040
Simulating trial. . .
espilon = 0.0135; alpha = 0.0040
Simulating trial. . .
espilon = 0.0135; alpha = 0.0040
Simulating trial. . .
espilon = 0.0135; alpha = 0.0040
Simulating trial. . .
espilon = 0.0135; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 2.34)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 2.61)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 0.73)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.55)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.79)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 1.72)
70% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1078
\-------------------------
Simulating trial. . .
espilon = 0.0135; alpha = 0.0040
Simulating trial. . .
espilon = 0.0135; alpha = 0.0040
Simulating trial. . .
espilon = 0.0135; alpha = 0.0040
Simulating trial. . .
espilon = 0.0135; alpha = 0.0040
Simulating trial. . .
espilon = 0.0135; alpha = 0.0040
Simulating trial. . .
espilon = 0.0135; alpha = 0.0040
Simulating trial. . .
espilon = 0.0135; alpha = 0.0040
Simulating trial. . .
espilon = 0.0135; alpha = 0.0040
Simulating trial. . .
espilon = 0.0135; alpha = 0.0040
Simulating trial. . .
espilon = 0.0135; alpha = 0.0040
Simulating trial. . .
espilon = 0.0135; alpha = 0.0040
Simulating trial. . .
espilon = 0.0135; alpha = 0.0040
Simulating trial. . .
espilon = 0.0135; alpha = 0.0040
Simulating trial. . .
espilon = 0.0135; alpha = 0.0040
Simulating trial. . .
espilon = 0.0135; alpha = 0.0040
Simulating trial. . .
espilon = 0.0135; alpha = 0.0040
Simulating trial. . .
espilon = 0.0135; alpha = 0.0040
Simulating trial. . .
espilon = 0.0135; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 1.73)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'right')
Agent drove forward instead of left. (rewarded 1.51)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 2.54)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.75)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.94)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.74)
70% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1079
\-------------------------
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.44)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.47)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.74)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.33)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 1.49)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.41)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.81)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 0.14)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.57)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.17)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.81)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.79)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.00)
35% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1080
\-------------------------
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
Simulating trial. . .
espilon = 0.0134; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.80)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.43)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.13)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.81)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 0.95)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.31)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.26)
65% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1081
\-------------------------
Simulating trial. . .
espilon = 0.0133; alpha = 0.0040
Simulating trial. . .
espilon = 0.0133; alpha = 0.0040
Simulating trial. . .
espilon = 0.0133; alpha = 0.0040
Simulating trial. . .
espilon = 0.0133; alpha = 0.0040
Simulating trial. . .
espilon = 0.0133; alpha = 0.0040
Simulating trial. . .
espilon = 0.0133; alpha = 0.0040
Simulating trial. . .
espilon = 0.0133; alpha = 0.0040
Simulating trial. . .
espilon = 0.0133; alpha = 0.0040
Simulating trial. . .
espilon = 0.0133; alpha = 0.0040
Simulating trial. . .
espilon = 0.0133; alpha = 0.0040
Simulating trial. . .
espilon = 0.0133; alpha = 0.0040
Simulating trial. . .
espilon = 0.0133; alpha = 0.0040
Simulating trial. . .
espilon = 0.0133; alpha = 0.0040
Simulating trial. . .
espilon = 0.0133; alpha = 0.0040
Simulating trial. . .
espilon = 0.0133; alpha = 0.0040
Simulating trial. . .
espilon = 0.0133; alpha = 0.0040
Simulating trial. . .
espilon = 0.0133; alpha = 0.0040
Simulating trial. . .
espilon = 0.0133; alpha = 0.0040
Simulating trial. . .
espilon = 0.0133; alpha = 0.0040
Simulating trial. . .
espilon = 0.0133; alpha = 0.0040
Simulating trial. . .
espilon = 0.0133; alpha = 0.0040
Simulating trial. . .
espilon = 0.0133; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 1.55)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 1.53)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.10)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.03)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.28)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.34)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 1.26)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.14)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1082
\-------------------------
Simulating trial. . .
espilon = 0.0132; alpha = 0.0040
Simulating trial. . .
espilon = 0.0132; alpha = 0.0040
Simulating trial. . .
espilon = 0.0132; alpha = 0.0040
Simulating trial. . .
espilon = 0.0132; alpha = 0.0040
Simulating trial. . .
espilon = 0.0132; alpha = 0.0040
Simulating trial. . .
espilon = 0.0132; alpha = 0.0040
Simulating trial. . .
espilon = 0.0132; alpha = 0.0040
Simulating trial. . .
espilon = 0.0132; alpha = 0.0040
Simulating trial. . .
espilon = 0.0132; alpha = 0.0040
Simulating trial. . .
espilon = 0.0132; alpha = 0.0040
Simulating trial. . .
espilon = 0.0132; alpha = 0.0040
Simulating trial. . .
espilon = 0.0132; alpha = 0.0040
Simulating trial. . .
espilon = 0.0132; alpha = 0.0040
Simulating trial. . .
espilon = 0.0132; alpha = 0.0040
Simulating trial. . .
espilon = 0.0132; alpha = 0.0040
Simulating trial. . .
espilon = 0.0132; alpha = 0.0040
Simulating trial. . .
espilon = 0.0132; alpha = 0.0040
Simulating trial. . .
espilon = 0.0132; alpha = 0.0040
Simulating trial. . .
espilon = 0.0132; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', 'left')
Agent followed the waypoint right. (rewarded 1.76)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.56)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.19)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.99)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.04)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'forward')
Agent drove right instead of left. (rewarded 1.37)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.61)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.05)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', 'left')
Agent followed the waypoint right. (rewarded 0.92)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.95)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.61)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.94)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 0.71)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent followed the waypoint left. (rewarded 2.05)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.61)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.36)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 1.61)
15% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1083
\-------------------------
Simulating trial. . .
espilon = 0.0132; alpha = 0.0040
Simulating trial. . .
espilon = 0.0132; alpha = 0.0040
Simulating trial. . .
espilon = 0.0132; alpha = 0.0040
Simulating trial. . .
espilon = 0.0132; alpha = 0.0040
Simulating trial. . .
espilon = 0.0132; alpha = 0.0040
Simulating trial. . .
espilon = 0.0132; alpha = 0.0040
Simulating trial. . .
espilon = 0.0132; alpha = 0.0040
Simulating trial. . .
espilon = 0.0132; alpha = 0.0040
Simulating trial. . .
espilon = 0.0132; alpha = 0.0040
Simulating trial. . .
espilon = 0.0132; alpha = 0.0040
Simulating trial. . .
espilon = 0.0132; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', 'forward')
Agent drove right instead of forward. (rewarded 1.59)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'forward')
Agent drove right instead of left. (rewarded 1.81)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.75)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.51)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.54)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 1.92)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.11)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.11)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.78)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.12)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.07)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.57)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.97)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent drove right instead of forward. (rewarded 0.58)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.57)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.45)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'right', None)
Agent drove forward instead of left. (rewarded 0.20)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.39)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent followed the waypoint left. (rewarded 0.57)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.45)
20% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1084
\-------------------------
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'forward')
Agent properly idled at a red light. (rewarded 1.11)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 0.27)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.80)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent followed the waypoint forward. (rewarded 1.36)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 1.36)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.93)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.78)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded -0.03)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.36)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.16)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 1.77)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.53)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent drove right instead of left. (rewarded 0.35)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.25)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.23)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.76)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.21)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.27)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'forward')
Agent followed the waypoint right. (rewarded 0.96)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'right')
Agent properly idled at a red light. (rewarded 0.79)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.75)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.07)
12% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1085
\-------------------------
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
Simulating trial. . .
espilon = 0.0131; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'right', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.55)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent followed the waypoint left. (rewarded 2.42)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.39)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.84)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.25)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'left')
Agent properly idled at a red light. (rewarded 1.40)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 0.49)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent drove right instead of left. (rewarded -0.02)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.95)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.86)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.18)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.38)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.77)
63% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.32)
60% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.24)
57% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', None)
Agent drove right instead of forward. (rewarded 1.18)
54% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.85)
51% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.45)
49% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1086
\-------------------------
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'left', None)
Agent followed the waypoint forward. (rewarded 1.11)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.17)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.51)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.10)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.77)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.47)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent drove left instead of forward. (rewarded 1.38)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.29)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.29)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 0.92)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent attempted driving forward through a red light. (rewarded -10.11)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.89)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.76)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.35)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.02)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.08)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.17)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.20)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1087
\-------------------------
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
Simulating trial. . .
espilon = 0.0130; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.27)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'right')
Agent followed the waypoint forward. (rewarded 1.10)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.57)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 0.98)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.95)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.01)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 2.78)
72% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1088
\-------------------------
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.23)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'right')
Agent properly idled at a red light. (rewarded 1.21)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.97)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.72)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.03)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 2.28)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.36)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent followed the waypoint right. (rewarded 2.72)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.49)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.34)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.33)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.20)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 1.39)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.75)
53% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1089
\-------------------------
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
Simulating trial. . .
espilon = 0.0129; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 0.53)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.69)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.65)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.40)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 1.13)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 0.95)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.12)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.58)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.58)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.80)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.44)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.51)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.76)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent followed the waypoint forward. (rewarded 1.20)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.88)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.51)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.07)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.30)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.04)
37% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.79)
33% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 0.80)
30% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.96)
27% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1090
\-------------------------
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.38)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'right')
Agent properly idled at a red light. (rewarded 2.22)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.27)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.49)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.79)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.55)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.23)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.34)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', 'left')
Agent followed the waypoint left. (rewarded 2.66)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.65)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1091
\-------------------------
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
Simulating trial. . .
espilon = 0.0128; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', 'right')
Agent drove right instead of left. (rewarded 1.38)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.98)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', 'left')
Agent followed the waypoint left. (rewarded 1.39)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent idled at a green light with no oncoming traffic. (rewarded -5.84)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.13)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.10)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 1.33)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.02)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.12)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.50)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.29)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.97)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1092
\-------------------------
Simulating trial. . .
espilon = 0.0127; alpha = 0.0040
Simulating trial. . .
espilon = 0.0127; alpha = 0.0040
Simulating trial. . .
espilon = 0.0127; alpha = 0.0040
Simulating trial. . .
espilon = 0.0127; alpha = 0.0040
Simulating trial. . .
espilon = 0.0127; alpha = 0.0040
Simulating trial. . .
espilon = 0.0127; alpha = 0.0040
Simulating trial. . .
espilon = 0.0127; alpha = 0.0040
Simulating trial. . .
espilon = 0.0127; alpha = 0.0040
Simulating trial. . .
espilon = 0.0127; alpha = 0.0040
Simulating trial. . .
espilon = 0.0127; alpha = 0.0040
Simulating trial. . .
espilon = 0.0127; alpha = 0.0040
Simulating trial. . .
espilon = 0.0127; alpha = 0.0040
Simulating trial. . .
espilon = 0.0127; alpha = 0.0040
Simulating trial. . .
espilon = 0.0127; alpha = 0.0040
Simulating trial. . .
espilon = 0.0127; alpha = 0.0040
Simulating trial. . .
espilon = 0.0127; alpha = 0.0040
Simulating trial. . .
espilon = 0.0127; alpha = 0.0040
Simulating trial. . .
espilon = 0.0127; alpha = 0.0040
Simulating trial. . .
espilon = 0.0127; alpha = 0.0040
Simulating trial. . .
espilon = 0.0127; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'left')
Agent followed the waypoint left. (rewarded 1.97)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.98)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'left', 'left')
Agent followed the waypoint forward. (rewarded 2.20)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.42)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.53)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.40)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.90)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.95)
68% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1093
\-------------------------
Simulating trial. . .
espilon = 0.0127; alpha = 0.0040
Simulating trial. . .
espilon = 0.0127; alpha = 0.0040
Simulating trial. . .
espilon = 0.0127; alpha = 0.0040
Simulating trial. . .
espilon = 0.0127; alpha = 0.0040
Simulating trial. . .
espilon = 0.0127; alpha = 0.0040
Simulating trial. . .
espilon = 0.0127; alpha = 0.0040
Simulating trial. . .
espilon = 0.0127; alpha = 0.0040
Simulating trial. . .
espilon = 0.0127; alpha = 0.0040
Simulating trial. . .
espilon = 0.0127; alpha = 0.0040
Simulating trial. . .
espilon = 0.0127; alpha = 0.0040
Simulating trial. . .
espilon = 0.0127; alpha = 0.0040
Simulating trial. . .
espilon = 0.0127; alpha = 0.0040
Simulating trial. . .
espilon = 0.0127; alpha = 0.0040
Simulating trial. . .
espilon = 0.0127; alpha = 0.0040
Simulating trial. . .
espilon = 0.0127; alpha = 0.0040
Simulating trial. . .
espilon = 0.0127; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', 'left')
Agent drove forward instead of right. (rewarded 0.43)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.36)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', 'forward')
Agent drove right instead of forward. (rewarded 0.46)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.80)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.42)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.02)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 0.96)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.49)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.06)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.39)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.58)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.85)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.58)
48% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1094
\-------------------------
Simulating trial. . .
espilon = 0.0126; alpha = 0.0040
Simulating trial. . .
espilon = 0.0126; alpha = 0.0040
Simulating trial. . .
espilon = 0.0126; alpha = 0.0040
Simulating trial. . .
espilon = 0.0126; alpha = 0.0040
Simulating trial. . .
espilon = 0.0126; alpha = 0.0040
Simulating trial. . .
espilon = 0.0126; alpha = 0.0040
Simulating trial. . .
espilon = 0.0126; alpha = 0.0040
Simulating trial. . .
espilon = 0.0126; alpha = 0.0040
Simulating trial. . .
espilon = 0.0126; alpha = 0.0040
Simulating trial. . .
espilon = 0.0126; alpha = 0.0040
Simulating trial. . .
espilon = 0.0126; alpha = 0.0040
Simulating trial. . .
espilon = 0.0126; alpha = 0.0040
Simulating trial. . .
espilon = 0.0126; alpha = 0.0040
Simulating trial. . .
espilon = 0.0126; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'left')
Agent followed the waypoint right. (rewarded 2.11)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.16)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.25)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'forward')
Agent drove right instead of forward. (rewarded 0.82)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', 'left')
Agent properly idled at a red light. (rewarded 1.22)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.71)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.51)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent followed the waypoint left. (rewarded 2.62)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.38)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.01)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.06)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.84)
52% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1095
\-------------------------
Simulating trial. . .
espilon = 0.0126; alpha = 0.0040
Simulating trial. . .
espilon = 0.0126; alpha = 0.0040
Simulating trial. . .
espilon = 0.0126; alpha = 0.0040
Simulating trial. . .
espilon = 0.0126; alpha = 0.0040
Simulating trial. . .
espilon = 0.0126; alpha = 0.0040
Simulating trial. . .
espilon = 0.0126; alpha = 0.0040
Simulating trial. . .
espilon = 0.0126; alpha = 0.0040
Simulating trial. . .
espilon = 0.0126; alpha = 0.0040
Simulating trial. . .
espilon = 0.0126; alpha = 0.0040
Simulating trial. . .
espilon = 0.0126; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.64)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.85)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.77)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'left', 'forward')
Agent followed the waypoint right. (rewarded 2.12)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.51)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.80)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.94)
65% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1096
\-------------------------
Simulating trial. . .
espilon = 0.0125; alpha = 0.0040
Simulating trial. . .
espilon = 0.0125; alpha = 0.0040
Simulating trial. . .
espilon = 0.0125; alpha = 0.0040
Simulating trial. . .
espilon = 0.0125; alpha = 0.0040
Simulating trial. . .
espilon = 0.0125; alpha = 0.0040
Simulating trial. . .
espilon = 0.0125; alpha = 0.0040
Simulating trial. . .
espilon = 0.0125; alpha = 0.0040
Simulating trial. . .
espilon = 0.0125; alpha = 0.0040
Simulating trial. . .
espilon = 0.0125; alpha = 0.0040
Simulating trial. . .
espilon = 0.0125; alpha = 0.0040
Simulating trial. . .
espilon = 0.0125; alpha = 0.0040
Simulating trial. . .
espilon = 0.0125; alpha = 0.0040
Simulating trial. . .
espilon = 0.0125; alpha = 0.0040
Simulating trial. . .
espilon = 0.0125; alpha = 0.0040
Simulating trial. . .
espilon = 0.0125; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 1.06)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.10)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.16)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.98)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.36)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.32)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.24)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.93)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'left')
Agent followed the waypoint right. (rewarded 1.73)
55% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1097
\-------------------------
Simulating trial. . .
espilon = 0.0125; alpha = 0.0040
Simulating trial. . .
espilon = 0.0125; alpha = 0.0040
Simulating trial. . .
espilon = 0.0125; alpha = 0.0040
Simulating trial. . .
espilon = 0.0125; alpha = 0.0040
Simulating trial. . .
espilon = 0.0125; alpha = 0.0040
Simulating trial. . .
espilon = 0.0125; alpha = 0.0040
Simulating trial. . .
espilon = 0.0125; alpha = 0.0040
Simulating trial. . .
espilon = 0.0125; alpha = 0.0040
Simulating trial. . .
espilon = 0.0125; alpha = 0.0040
Simulating trial. . .
espilon = 0.0125; alpha = 0.0040
Simulating trial. . .
espilon = 0.0125; alpha = 0.0040
Simulating trial. . .
espilon = 0.0125; alpha = 0.0040
Simulating trial. . .
espilon = 0.0125; alpha = 0.0040
Simulating trial. . .
espilon = 0.0125; alpha = 0.0040
Simulating trial. . .
espilon = 0.0125; alpha = 0.0040
Simulating trial. . .
espilon = 0.0125; alpha = 0.0040
Simulating trial. . .
espilon = 0.0125; alpha = 0.0040
Simulating trial. . .
espilon = 0.0125; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.55)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'left', None)
Agent followed the waypoint forward. (rewarded 1.85)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.74)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.45)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 2.56)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 0.09)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 1.34)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', 'right')
Agent properly idled at a red light. (rewarded 1.04)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.28)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.28)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 0.94)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.65)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.58)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', 'forward')
Agent drove right instead of forward. (rewarded 0.55)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.05)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.41)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.09)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.55)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 0.80)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 0.64)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 1098
\-------------------------
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.68)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 2.25)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'left')
Agent drove right instead of forward. (rewarded 0.75)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.82)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.13)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.56)
80% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1099
\-------------------------
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
Simulating trial. . .
espilon = 0.0124; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.48)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.45)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 1.16)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', None)
Agent followed the waypoint left. (rewarded 1.91)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.44)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.52)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.19)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.40)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.28)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.31)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.59)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 1.39)
52% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1100
\-------------------------
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 2.47)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'left')
Agent followed the waypoint left. (rewarded 1.80)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.62)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 2.21)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 0.99)
80% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1101
\-------------------------
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
Simulating trial. . .
espilon = 0.0123; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded 0.29)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.75)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.44)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.48)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.99)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', 'left')
Agent followed the waypoint forward. (rewarded 2.32)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.37)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.22)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', None)
Agent drove right instead of forward. (rewarded 1.74)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', 'forward')
Agent properly idled at a red light. (rewarded 2.01)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.59)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'forward')
Agent drove right instead of left. (rewarded 0.64)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 0.89)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', None)
Agent followed the waypoint right. (rewarded 1.22)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.53)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1102
\-------------------------
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 2.65)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'right')
Agent drove right instead of forward. (rewarded 1.60)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 2.68)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.27)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.33)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.08)
76% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1103
\-------------------------
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
Simulating trial. . .
espilon = 0.0122; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.70)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 2.31)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 1.94)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.07)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.42)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 1.00)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.25)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 0.92)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.42)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.02)
50% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1104
\-------------------------
Simulating trial. . .
espilon = 0.0121; alpha = 0.0040
Simulating trial. . .
espilon = 0.0121; alpha = 0.0040
Simulating trial. . .
espilon = 0.0121; alpha = 0.0040
Simulating trial. . .
espilon = 0.0121; alpha = 0.0040
Simulating trial. . .
espilon = 0.0121; alpha = 0.0040
Simulating trial. . .
espilon = 0.0121; alpha = 0.0040
Simulating trial. . .
espilon = 0.0121; alpha = 0.0040
Simulating trial. . .
espilon = 0.0121; alpha = 0.0040
Simulating trial. . .
espilon = 0.0121; alpha = 0.0040
Simulating trial. . .
espilon = 0.0121; alpha = 0.0040
Simulating trial. . .
espilon = 0.0121; alpha = 0.0040
Simulating trial. . .
espilon = 0.0121; alpha = 0.0040
Simulating trial. . .
espilon = 0.0121; alpha = 0.0040
Simulating trial. . .
espilon = 0.0121; alpha = 0.0040
Simulating trial. . .
espilon = 0.0121; alpha = 0.0040
Simulating trial. . .
espilon = 0.0121; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', 'forward')
Agent drove forward instead of left. (rewarded 0.69)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.32)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.06)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.82)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.80)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.69)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.32)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.60)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 2.31)
64% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1105
\-------------------------
Simulating trial. . .
espilon = 0.0121; alpha = 0.0040
Simulating trial. . .
espilon = 0.0121; alpha = 0.0040
Simulating trial. . .
espilon = 0.0121; alpha = 0.0040
Simulating trial. . .
espilon = 0.0121; alpha = 0.0040
Simulating trial. . .
espilon = 0.0121; alpha = 0.0040
Simulating trial. . .
espilon = 0.0121; alpha = 0.0040
Simulating trial. . .
espilon = 0.0121; alpha = 0.0040
Simulating trial. . .
espilon = 0.0121; alpha = 0.0040
Simulating trial. . .
espilon = 0.0121; alpha = 0.0040
Simulating trial. . .
espilon = 0.0121; alpha = 0.0040
Simulating trial. . .
espilon = 0.0121; alpha = 0.0040
Simulating trial. . .
espilon = 0.0121; alpha = 0.0040
Simulating trial. . .
espilon = 0.0121; alpha = 0.0040
Simulating trial. . .
espilon = 0.0121; alpha = 0.0040
Simulating trial. . .
espilon = 0.0121; alpha = 0.0040
Simulating trial. . .
espilon = 0.0121; alpha = 0.0040
Simulating trial. . .
espilon = 0.0121; alpha = 0.0040
Simulating trial. . .
espilon = 0.0121; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', 'forward')
Agent properly idled at a red light. (rewarded 1.55)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 2.40)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'left')
Agent drove left instead of right. (rewarded 0.79)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.09)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.33)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.55)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.57)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.97)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.00)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.53)
50% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1106
\-------------------------
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 2.66)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.92)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.67)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.46)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.96)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 2.63)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.75)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.33)
68% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1107
\-------------------------
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
Simulating trial. . .
espilon = 0.0120; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.40)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.94)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.02)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.21)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.08)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.32)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.49)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.30)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1108
\-------------------------
Simulating trial. . .
espilon = 0.0119; alpha = 0.0040
Simulating trial. . .
espilon = 0.0119; alpha = 0.0040
Simulating trial. . .
espilon = 0.0119; alpha = 0.0040
Simulating trial. . .
espilon = 0.0119; alpha = 0.0040
Simulating trial. . .
espilon = 0.0119; alpha = 0.0040
Simulating trial. . .
espilon = 0.0119; alpha = 0.0040
Simulating trial. . .
espilon = 0.0119; alpha = 0.0040
Simulating trial. . .
espilon = 0.0119; alpha = 0.0040
Simulating trial. . .
espilon = 0.0119; alpha = 0.0040
Simulating trial. . .
espilon = 0.0119; alpha = 0.0040
Simulating trial. . .
espilon = 0.0119; alpha = 0.0040
Simulating trial. . .
espilon = 0.0119; alpha = 0.0040
Simulating trial. . .
espilon = 0.0119; alpha = 0.0040
Simulating trial. . .
espilon = 0.0119; alpha = 0.0040
Simulating trial. . .
espilon = 0.0119; alpha = 0.0040
Simulating trial. . .
espilon = 0.0119; alpha = 0.0040
Simulating trial. . .
espilon = 0.0119; alpha = 0.0040
Simulating trial. . .
espilon = 0.0119; alpha = 0.0040
Simulating trial. . .
espilon = 0.0119; alpha = 0.0040
Simulating trial. . .
espilon = 0.0119; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.37)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.80)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, 'right')
Agent followed the waypoint right. (rewarded 1.81)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.24)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.16)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.63)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.04)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 0.20)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent followed the waypoint left. (rewarded 0.90)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.18)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'right')
Agent drove right instead of left. (rewarded 0.72)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.37)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'forward', None)
Agent followed the waypoint right. (rewarded 1.59)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.44)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.75)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'right')
Agent followed the waypoint right. (rewarded 1.78)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 0.69)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent followed the waypoint right. (rewarded 1.84)
10% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1109
\-------------------------
Simulating trial. . .
espilon = 0.0119; alpha = 0.0040
Simulating trial. . .
espilon = 0.0119; alpha = 0.0040
Simulating trial. . .
espilon = 0.0119; alpha = 0.0040
Simulating trial. . .
espilon = 0.0119; alpha = 0.0040
Simulating trial. . .
espilon = 0.0119; alpha = 0.0040
Simulating trial. . .
espilon = 0.0119; alpha = 0.0040
Simulating trial. . .
espilon = 0.0119; alpha = 0.0040
Simulating trial. . .
espilon = 0.0119; alpha = 0.0040
Simulating trial. . .
espilon = 0.0119; alpha = 0.0040
Simulating trial. . .
espilon = 0.0119; alpha = 0.0040
Simulating trial. . .
espilon = 0.0119; alpha = 0.0040
Simulating trial. . .
espilon = 0.0119; alpha = 0.0040
Simulating trial. . .
espilon = 0.0119; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', 'right')
Agent properly idled at a red light. (rewarded 2.37)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.73)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.35)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.11)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 0.95)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 0.94)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.48)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.35)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.40)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.49)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.71)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.19)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.42)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.20)
30% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1110
\-------------------------
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.06)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.62)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.23)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'left')
Agent drove right instead of left. (rewarded 0.24)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'forward', 'forward')
Agent followed the waypoint right. (rewarded 2.81)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.70)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.37)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.37)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.97)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.29)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.49)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.83)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.39)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', None)
Agent properly idled at a red light. (rewarded 1.22)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.70)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.23)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.75)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 2.15)
10% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1111
\-------------------------
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
Simulating trial. . .
espilon = 0.0118; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.63)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 2.94)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.98)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.00)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.53)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.30)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 0.95)
65% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1112
\-------------------------
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.13)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 2.27)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.98)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.46)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.04)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.86)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.71)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.05)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.34)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.47)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'forward')
Agent drove right instead of forward. (rewarded 1.19)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.48)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.89)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.92)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.59)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'forward')
Agent drove right instead of left. (rewarded -0.39)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', 'forward')
Agent followed the waypoint right. (rewarded 2.02)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'right')
Agent properly idled at a red light. (rewarded 0.82)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.66)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.02)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 1113
\-------------------------
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', 'right')
Agent drove right instead of left. (rewarded 1.13)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', 'left')
Agent drove forward instead of right. (rewarded 0.41)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.89)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.10)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.19)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.56)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.84)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.14)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 0.95)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.61)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.27)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.14)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1114
\-------------------------
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
Simulating trial. . .
espilon = 0.0117; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'left', 'right')
Agent drove right instead of left. (rewarded 0.37)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 1.04)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.39)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.35)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.01)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.44)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.24)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.52)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.98)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.15)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 0.80)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'right')
Agent drove forward instead of left. (rewarded 0.29)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.50)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.01)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.92)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.37)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.93)
32% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1115
\-------------------------
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.18)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'right')
Agent followed the waypoint forward. (rewarded 1.78)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 1.83)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.85)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.61)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'right', 'left')
Agent followed the waypoint forward. (rewarded 2.26)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.30)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 2.23)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'right')
Agent drove forward instead of left. (rewarded 1.30)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 0.93)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.17)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.44)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.76)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.69)
53% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1116
\-------------------------
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
Simulating trial. . .
espilon = 0.0116; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'right')
Agent drove right instead of left. (rewarded 1.90)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.72)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.31)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.08)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.13)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.24)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.67)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.46)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.97)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 0.98)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', None)
Agent properly idled at a red light. (rewarded 2.18)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.00)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1117
\-------------------------
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', 'forward')
Agent followed the waypoint right. (rewarded 2.90)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.09)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.67)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.61)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.57)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.85)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.93)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.09)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.53)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.23)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.09)
45% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1118
\-------------------------
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
Simulating trial. . .
espilon = 0.0115; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'right', 'left', 'left')
Agent followed the waypoint right. (rewarded 1.81)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.50)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.51)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.60)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 1.33)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.85)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'right')
Agent properly idled at a red light. (rewarded 1.86)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 2.11)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 0.97)
55% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1119
\-------------------------
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.58)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.27)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.50)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.84)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.32)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.37)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.74)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'left')
Agent drove right instead of left. (rewarded 0.55)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', 'left')
Agent drove right instead of forward. (rewarded 0.80)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'right', None)
Agent drove forward instead of left. (rewarded 1.58)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 1.18)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.21)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', 'left')
Agent properly idled at a red light. (rewarded 1.53)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'left')
Agent idled at a green light with oncoming traffic. (rewarded 0.82)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent followed the waypoint left. (rewarded 1.98)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.86)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.00)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'right')
Agent drove right instead of forward. (rewarded -0.25)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.53)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 0.56)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 1120
\-------------------------
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
Simulating trial. . .
espilon = 0.0114; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.81)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent followed the waypoint forward. (rewarded 1.77)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.39)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.56)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.12)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.05)
70% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1121
\-------------------------
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.93)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent drove left instead of forward. (rewarded 1.20)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.20)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.44)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 0.95)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent drove right instead of forward. (rewarded 0.99)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'right')
Agent drove right instead of left. (rewarded 0.79)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', 'left')
Agent drove right instead of left. (rewarded 1.78)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.32)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 2.71)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', 'left')
Agent properly idled at a red light. (rewarded 1.85)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.09)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.89)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.10)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.54)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.17)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.60)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.65)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.55)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.23)
20% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1122
\-------------------------
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
Simulating trial. . .
espilon = 0.0113; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 2.55)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.18)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.91)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.49)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving left through a red light. (rewarded -9.82)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'right')
Agent properly idled at a red light. (rewarded 1.14)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.70)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.92)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.19)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.58)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.14)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 1.50)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 1.58)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.94)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.45)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'forward')
Agent drove right instead of forward. (rewarded 1.33)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.66)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.43)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.94)
24% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.09)
20% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.83)
16% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 1.92)
12% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1123
\-------------------------
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'right', 'left')
Agent drove right instead of left. (rewarded 0.67)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 2.49)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, 'forward')
Agent properly idled at a red light. (rewarded 1.09)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', 'right', None, 'forward')
Agent properly idled at a red light. (rewarded 2.04)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 1.52)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'forward')
Agent followed the waypoint right. (rewarded 2.18)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.20)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.78)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 2.44)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.30)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.01)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.82)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.58)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.78)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent attempted driving left through a red light. (rewarded -10.76)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded -0.14)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'forward', None)
Agent followed the waypoint left. (rewarded 2.57)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 1.25)
28% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 0.83)
24% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1124
\-------------------------
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.78)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.38)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', 'left')
Agent drove left instead of right. (rewarded 0.86)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent followed the waypoint right. (rewarded 0.98)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent followed the waypoint right. (rewarded 1.45)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.00)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.21)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.69)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.87)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.51)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.85)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.35)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1125
\-------------------------
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
Simulating trial. . .
espilon = 0.0112; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.56)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.16)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.33)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.54)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.21)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.81)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.76)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.16)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'left')
Agent drove left instead of right. (rewarded 0.72)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.17)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 1.31)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.61)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 0.74)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.35)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.15)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.19)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.47)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'right', None)
Agent properly idled at a red light. (rewarded 2.29)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 0.54)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.21)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 1126
\-------------------------
Simulating trial. . .
espilon = 0.0111; alpha = 0.0040
Simulating trial. . .
espilon = 0.0111; alpha = 0.0040
Simulating trial. . .
espilon = 0.0111; alpha = 0.0040
Simulating trial. . .
espilon = 0.0111; alpha = 0.0040
Simulating trial. . .
espilon = 0.0111; alpha = 0.0040
Simulating trial. . .
espilon = 0.0111; alpha = 0.0040
Simulating trial. . .
espilon = 0.0111; alpha = 0.0040
Simulating trial. . .
espilon = 0.0111; alpha = 0.0040
Simulating trial. . .
espilon = 0.0111; alpha = 0.0040
Simulating trial. . .
espilon = 0.0111; alpha = 0.0040
Simulating trial. . .
espilon = 0.0111; alpha = 0.0040
Simulating trial. . .
espilon = 0.0111; alpha = 0.0040
Simulating trial. . .
espilon = 0.0111; alpha = 0.0040
Simulating trial. . .
espilon = 0.0111; alpha = 0.0040
Simulating trial. . .
espilon = 0.0111; alpha = 0.0040
Simulating trial. . .
espilon = 0.0111; alpha = 0.0040
Simulating trial. . .
espilon = 0.0111; alpha = 0.0040
Simulating trial. . .
espilon = 0.0111; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'right', 'left')
Agent followed the waypoint left. (rewarded 2.54)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.07)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.40)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.90)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.32)
80% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1127
\-------------------------
Simulating trial. . .
espilon = 0.0111; alpha = 0.0040
Simulating trial. . .
espilon = 0.0111; alpha = 0.0040
Simulating trial. . .
espilon = 0.0111; alpha = 0.0040
Simulating trial. . .
espilon = 0.0111; alpha = 0.0040
Simulating trial. . .
espilon = 0.0111; alpha = 0.0040
Simulating trial. . .
espilon = 0.0111; alpha = 0.0040
Simulating trial. . .
espilon = 0.0111; alpha = 0.0040
Simulating trial. . .
espilon = 0.0111; alpha = 0.0040
Simulating trial. . .
espilon = 0.0111; alpha = 0.0040
Simulating trial. . .
espilon = 0.0111; alpha = 0.0040
Simulating trial. . .
espilon = 0.0111; alpha = 0.0040
Simulating trial. . .
espilon = 0.0111; alpha = 0.0040
Simulating trial. . .
espilon = 0.0111; alpha = 0.0040
Simulating trial. . .
espilon = 0.0111; alpha = 0.0040
Simulating trial. . .
espilon = 0.0111; alpha = 0.0040
Simulating trial. . .
espilon = 0.0111; alpha = 0.0040
Simulating trial. . .
espilon = 0.0111; alpha = 0.0040
Simulating trial. . .
espilon = 0.0111; alpha = 0.0040
Simulating trial. . .
espilon = 0.0111; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.24)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'forward')
Agent properly idled at a red light. (rewarded 2.16)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.21)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.60)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.73)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.37)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', 'left')
Agent followed the waypoint forward. (rewarded 2.51)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.13)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.20)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.10)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.71)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.27)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.31)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 2.16)
44% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1128
\-------------------------
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.24)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'forward')
Agent drove right instead of forward. (rewarded 0.68)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.53)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.88)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.72)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.09)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 1.62)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.16)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.43)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.13)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.13)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.67)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 0.87)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 0.98)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.36)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.03)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.63)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.28)
28% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1129
\-------------------------
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
Simulating trial. . .
espilon = 0.0110; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', 'forward')
Agent properly idled at a red light. (rewarded 2.28)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 1.94)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 2.64)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.44)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.37)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'forward')
Agent followed the waypoint forward. (rewarded 1.67)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.25)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 2.51)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.78)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.26)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 0.99)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.00)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.17)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.86)
30% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1130
\-------------------------
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.79)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.43)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.82)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.82)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.92)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.72)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.80)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.88)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.02)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'forward', 'forward')
Agent drove right instead of forward. (rewarded 1.74)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.96)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.33)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.66)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.44)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', None)
Agent properly idled at a red light. (rewarded 2.53)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 0.91)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 0.96)
43% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.57)
40% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 0.98)
37% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1131
\-------------------------
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
Simulating trial. . .
espilon = 0.0109; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.50)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.97)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.18)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.59)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.44)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.13)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.06)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.22)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 0.86)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.52)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.52)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.70)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.88)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded -0.36)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.78)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.34)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.90)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.81)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.59)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.83)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Training trial 1132
\-------------------------
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.42)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.68)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 2.30)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.71)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.09)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 1.96)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.75)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.04)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 1.84)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.63)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.24)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.87)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 1.77)
63% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 0.99)
60% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.28)
57% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.23)
54% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.65)
51% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, 'forward')
Agent drove right instead of left. (rewarded 1.40)
49% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.98)
46% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.50)
43% of time remaining to reach destination.
/-------------------
| Step 20 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.68)
40% of time remaining to reach destination.
/-------------------
| Step 21 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'left', 'forward')
Agent drove right instead of forward. (rewarded 1.63)
37% of time remaining to reach destination.
/-------------------
| Step 22 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'left')
Agent drove right instead of left. (rewarded -0.18)
34% of time remaining to reach destination.
/-------------------
| Step 23 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.51)
31% of time remaining to reach destination.
/-------------------
| Step 24 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.97)
29% of time remaining to reach destination.
/-------------------
| Step 25 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.03)
26% of time remaining to reach destination.
/-------------------
| Step 26 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 0.98)
23% of time remaining to reach destination.
/-------------------
| Step 27 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.86)
20% of time remaining to reach destination.
/-------------------
| Step 28 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.40)
17% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1133
\-------------------------
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'left', None)
Agent properly idled at a red light. (rewarded 1.34)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.78)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.24)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.43)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.65)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.78)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.04)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.55)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.54)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.69)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.79)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'forward')
Agent followed the waypoint right. (rewarded 1.94)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1134
\-------------------------
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
Simulating trial. . .
espilon = 0.0108; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.10)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.59)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'right', None, 'forward')
Agent properly idled at a red light. (rewarded 1.55)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'forward')
Agent drove forward instead of left. (rewarded 1.75)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.05)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.10)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', 'left')
Agent drove forward instead of left. (rewarded 0.79)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 2.34)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.76)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.26)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.53)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.42)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -9.74)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.00)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.15)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.54)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.34)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent followed the waypoint left. (rewarded 2.14)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.76)
5% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1135
\-------------------------
Simulating trial. . .
espilon = 0.0107; alpha = 0.0040
Simulating trial. . .
espilon = 0.0107; alpha = 0.0040
Simulating trial. . .
espilon = 0.0107; alpha = 0.0040
Simulating trial. . .
espilon = 0.0107; alpha = 0.0040
Simulating trial. . .
espilon = 0.0107; alpha = 0.0040
Simulating trial. . .
espilon = 0.0107; alpha = 0.0040
Simulating trial. . .
espilon = 0.0107; alpha = 0.0040
Simulating trial. . .
espilon = 0.0107; alpha = 0.0040
Simulating trial. . .
espilon = 0.0107; alpha = 0.0040
Simulating trial. . .
espilon = 0.0107; alpha = 0.0040
Simulating trial. . .
espilon = 0.0107; alpha = 0.0040
Simulating trial. . .
espilon = 0.0107; alpha = 0.0040
Simulating trial. . .
espilon = 0.0107; alpha = 0.0040
Simulating trial. . .
espilon = 0.0107; alpha = 0.0040
Simulating trial. . .
espilon = 0.0107; alpha = 0.0040
Simulating trial. . .
espilon = 0.0107; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 2.59)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'right')
Agent drove right instead of left. (rewarded 1.11)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'left')
Agent drove right instead of forward. (rewarded 1.53)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', 'left')
Agent drove right instead of left. (rewarded 0.63)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.09)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.02)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'forward', None)
Agent properly idled at a red light. (rewarded 0.98)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.22)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.49)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 2.45)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 0.84)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.15)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 2.05)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.00)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.12)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.73)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 1.57)
32% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, None)
Agent followed the waypoint right. (rewarded 1.21)
28% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1136
\-------------------------
Simulating trial. . .
espilon = 0.0107; alpha = 0.0040
Simulating trial. . .
espilon = 0.0107; alpha = 0.0040
Simulating trial. . .
espilon = 0.0107; alpha = 0.0040
Simulating trial. . .
espilon = 0.0107; alpha = 0.0040
Simulating trial. . .
espilon = 0.0107; alpha = 0.0040
Simulating trial. . .
espilon = 0.0107; alpha = 0.0040
Simulating trial. . .
espilon = 0.0107; alpha = 0.0040
Simulating trial. . .
espilon = 0.0107; alpha = 0.0040
Simulating trial. . .
espilon = 0.0107; alpha = 0.0040
Simulating trial. . .
espilon = 0.0107; alpha = 0.0040
Simulating trial. . .
espilon = 0.0107; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.20)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.10)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.31)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.11)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.69)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.80)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.27)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'left')
Agent drove right instead of forward. (rewarded 1.20)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, None)
Agent followed the waypoint left. (rewarded 1.23)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.80)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', 'right')
Agent properly idled at a red light. (rewarded 1.60)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 2.38)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.17)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', 'forward')
Agent followed the waypoint right. (rewarded 2.40)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.71)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.97)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.06)
43% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1137
\-------------------------
Simulating trial. . .
espilon = 0.0106; alpha = 0.0040
Simulating trial. . .
espilon = 0.0106; alpha = 0.0040
Simulating trial. . .
espilon = 0.0106; alpha = 0.0040
Simulating trial. . .
espilon = 0.0106; alpha = 0.0040
Simulating trial. . .
espilon = 0.0106; alpha = 0.0040
Simulating trial. . .
espilon = 0.0106; alpha = 0.0040
Simulating trial. . .
espilon = 0.0106; alpha = 0.0040
Simulating trial. . .
espilon = 0.0106; alpha = 0.0040
Simulating trial. . .
espilon = 0.0106; alpha = 0.0040
Simulating trial. . .
espilon = 0.0106; alpha = 0.0040
Simulating trial. . .
espilon = 0.0106; alpha = 0.0040
Simulating trial. . .
espilon = 0.0106; alpha = 0.0040
Simulating trial. . .
espilon = 0.0106; alpha = 0.0040
Simulating trial. . .
espilon = 0.0106; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', 'left')
Agent drove right instead of forward. (rewarded 1.29)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'forward', None)
Agent followed the waypoint left. (rewarded 1.40)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.52)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.40)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 1.69)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.03)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.24)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.27)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1138
\-------------------------
Simulating trial. . .
espilon = 0.0106; alpha = 0.0040
Simulating trial. . .
espilon = 0.0106; alpha = 0.0040
Simulating trial. . .
espilon = 0.0106; alpha = 0.0040
Simulating trial. . .
espilon = 0.0106; alpha = 0.0040
Simulating trial. . .
espilon = 0.0106; alpha = 0.0040
Simulating trial. . .
espilon = 0.0106; alpha = 0.0040
Simulating trial. . .
espilon = 0.0106; alpha = 0.0040
Simulating trial. . .
espilon = 0.0106; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', 'left')
Agent followed the waypoint left. (rewarded 1.97)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 2.82)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.35)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', None)
Agent followed the waypoint forward. (rewarded 1.34)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 2.83)
80% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1139
\-------------------------
Simulating trial. . .
espilon = 0.0105; alpha = 0.0040
Simulating trial. . .
espilon = 0.0105; alpha = 0.0040
Simulating trial. . .
espilon = 0.0105; alpha = 0.0040
Simulating trial. . .
espilon = 0.0105; alpha = 0.0040
Simulating trial. . .
espilon = 0.0105; alpha = 0.0040
Simulating trial. . .
espilon = 0.0105; alpha = 0.0040
Simulating trial. . .
espilon = 0.0105; alpha = 0.0040
Simulating trial. . .
espilon = 0.0105; alpha = 0.0040
Simulating trial. . .
espilon = 0.0105; alpha = 0.0040
Simulating trial. . .
espilon = 0.0105; alpha = 0.0040
Simulating trial. . .
espilon = 0.0105; alpha = 0.0040
Simulating trial. . .
espilon = 0.0105; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.26)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.95)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.04)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.21)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.63)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.47)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'green', None, None, None)
Agent followed the waypoint right. (rewarded 1.55)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.47)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.66)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.90)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 2.15)
45% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1140
\-------------------------
Simulating trial. . .
espilon = 0.0105; alpha = 0.0040
Simulating trial. . .
espilon = 0.0105; alpha = 0.0040
Simulating trial. . .
espilon = 0.0105; alpha = 0.0040
Simulating trial. . .
espilon = 0.0105; alpha = 0.0040
Simulating trial. . .
espilon = 0.0105; alpha = 0.0040
Simulating trial. . .
espilon = 0.0105; alpha = 0.0040
Simulating trial. . .
espilon = 0.0105; alpha = 0.0040
Simulating trial. . .
espilon = 0.0105; alpha = 0.0040
Simulating trial. . .
espilon = 0.0105; alpha = 0.0040
Simulating trial. . .
espilon = 0.0105; alpha = 0.0040
Simulating trial. . .
espilon = 0.0105; alpha = 0.0040
Simulating trial. . .
espilon = 0.0105; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', 'forward')
Agent followed the waypoint right. (rewarded 1.95)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 0.49)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.76)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.94)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.76)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 1.73)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'left')
Agent followed the waypoint left. (rewarded 2.01)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.21)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'forward', 'right')
Agent drove right instead of forward. (rewarded 0.06)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 2.09)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.05)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.66)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent followed the waypoint left. (rewarded 1.44)
48% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1141
\-------------------------
Simulating trial. . .
espilon = 0.0105; alpha = 0.0040
Simulating trial. . .
espilon = 0.0105; alpha = 0.0040
Simulating trial. . .
espilon = 0.0105; alpha = 0.0040
Simulating trial. . .
espilon = 0.0105; alpha = 0.0040
Simulating trial. . .
espilon = 0.0105; alpha = 0.0040
Simulating trial. . .
espilon = 0.0105; alpha = 0.0040
Simulating trial. . .
espilon = 0.0105; alpha = 0.0040
Simulating trial. . .
espilon = 0.0105; alpha = 0.0040
Simulating trial. . .
espilon = 0.0105; alpha = 0.0040
Simulating trial. . .
espilon = 0.0105; alpha = 0.0040
Simulating trial. . .
espilon = 0.0105; alpha = 0.0040
Simulating trial. . .
espilon = 0.0105; alpha = 0.0040
Simulating trial. . .
espilon = 0.0105; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, 'left')
Agent followed the waypoint left. (rewarded 1.67)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 2.64)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.03)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.77)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.99)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.07)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.68)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.29)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.53)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.12)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'forward', 'right')
Agent drove forward instead of right. (rewarded 1.84)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, None)
Agent followed the waypoint right. (rewarded 2.36)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.75)
63% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'red', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.75)
60% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 1.48)
57% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 2.55)
54% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1142
\-------------------------
Simulating trial. . .
espilon = 0.0104; alpha = 0.0040
Simulating trial. . .
espilon = 0.0104; alpha = 0.0040
Simulating trial. . .
espilon = 0.0104; alpha = 0.0040
Simulating trial. . .
espilon = 0.0104; alpha = 0.0040
Simulating trial. . .
espilon = 0.0104; alpha = 0.0040
Simulating trial. . .
espilon = 0.0104; alpha = 0.0040
Simulating trial. . .
espilon = 0.0104; alpha = 0.0040
Simulating trial. . .
espilon = 0.0104; alpha = 0.0040
Simulating trial. . .
espilon = 0.0104; alpha = 0.0040
Simulating trial. . .
espilon = 0.0104; alpha = 0.0040
Simulating trial. . .
espilon = 0.0104; alpha = 0.0040
Simulating trial. . .
espilon = 0.0104; alpha = 0.0040
Simulating trial. . .
espilon = 0.0104; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.96)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.61)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', None)
Agent properly idled at a red light. (rewarded 2.57)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.89)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.91)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('right', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.59)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'left')
Agent followed the waypoint right. (rewarded 2.66)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.70)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.02)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.97)
50% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1143
\-------------------------
Simulating trial. . .
espilon = 0.0104; alpha = 0.0040
Simulating trial. . .
espilon = 0.0104; alpha = 0.0040
Simulating trial. . .
espilon = 0.0104; alpha = 0.0040
Simulating trial. . .
espilon = 0.0104; alpha = 0.0040
Simulating trial. . .
espilon = 0.0104; alpha = 0.0040
Simulating trial. . .
espilon = 0.0104; alpha = 0.0040
Simulating trial. . .
espilon = 0.0104; alpha = 0.0040
Simulating trial. . .
espilon = 0.0104; alpha = 0.0040
Simulating trial. . .
espilon = 0.0104; alpha = 0.0040
Simulating trial. . .
espilon = 0.0104; alpha = 0.0040
Simulating trial. . .
espilon = 0.0104; alpha = 0.0040
Simulating trial. . .
espilon = 0.0104; alpha = 0.0040
Simulating trial. . .
espilon = 0.0104; alpha = 0.0040
Simulating trial. . .
espilon = 0.0104; alpha = 0.0040
Simulating trial. . .
espilon = 0.0104; alpha = 0.0040
Simulating trial. . .
espilon = 0.0104; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.66)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'right', None)
Agent followed the waypoint forward. (rewarded 1.47)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.16)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.48)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.04)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.68)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.02)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'right')
Agent drove right instead of forward. (rewarded 1.88)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 1.52)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', None)
Agent followed the waypoint right. (rewarded 1.16)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.24)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.41)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.09)
63% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.09)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1144
\-------------------------
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', 'right')
Agent properly idled at a red light. (rewarded 1.50)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.64)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.87)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.10)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.36)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', None)
Agent properly idled at a red light. (rewarded 1.74)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.68)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.34)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.19)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.30)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.79)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.86)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, None)
Agent followed the waypoint forward. (rewarded 1.10)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.00)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.90)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.13)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.46)
43% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1145
\-------------------------
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.79)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.92)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.85)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.35)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.14)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'left', 'left')
Agent properly idled at a red light. (rewarded 1.03)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.13)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 1.86)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'left')
Agent properly idled at a red light. (rewarded 2.31)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.79)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.86)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.88)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'right')
Agent followed the waypoint right. (rewarded 1.85)
35% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1146
\-------------------------
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
Simulating trial. . .
espilon = 0.0103; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.93)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.51)
94% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.66)
91% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.74)
89% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 2.09)
86% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', 'right')
Agent drove right instead of forward. (rewarded 1.88)
83% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.54)
80% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.75)
77% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.31)
74% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 1.64)
71% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.19)
69% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.80)
66% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.40)
63% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 0.89)
60% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', 'left', None)
Agent followed the waypoint right. (rewarded 1.09)
57% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1147
\-------------------------
Simulating trial. . .
espilon = 0.0102; alpha = 0.0040
Simulating trial. . .
espilon = 0.0102; alpha = 0.0040
Simulating trial. . .
espilon = 0.0102; alpha = 0.0040
Simulating trial. . .
espilon = 0.0102; alpha = 0.0040
Simulating trial. . .
espilon = 0.0102; alpha = 0.0040
Simulating trial. . .
espilon = 0.0102; alpha = 0.0040
Simulating trial. . .
espilon = 0.0102; alpha = 0.0040
Simulating trial. . .
espilon = 0.0102; alpha = 0.0040
Simulating trial. . .
espilon = 0.0102; alpha = 0.0040
Simulating trial. . .
espilon = 0.0102; alpha = 0.0040
Simulating trial. . .
espilon = 0.0102; alpha = 0.0040
Simulating trial. . .
espilon = 0.0102; alpha = 0.0040
Simulating trial. . .
espilon = 0.0102; alpha = 0.0040
Simulating trial. . .
espilon = 0.0102; alpha = 0.0040
Simulating trial. . .
espilon = 0.0102; alpha = 0.0040
Simulating trial. . .
espilon = 0.0102; alpha = 0.0040
Simulating trial. . .
espilon = 0.0102; alpha = 0.0040
Simulating trial. . .
espilon = 0.0102; alpha = 0.0040
Simulating trial. . .
espilon = 0.0102; alpha = 0.0040
Simulating trial. . .
espilon = 0.0102; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 1.24)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.48)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.88)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.90)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.03)
75% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1148
\-------------------------
Simulating trial. . .
espilon = 0.0102; alpha = 0.0040
Simulating trial. . .
espilon = 0.0102; alpha = 0.0040
Simulating trial. . .
espilon = 0.0102; alpha = 0.0040
Simulating trial. . .
espilon = 0.0102; alpha = 0.0040
Simulating trial. . .
espilon = 0.0102; alpha = 0.0040
Simulating trial. . .
espilon = 0.0102; alpha = 0.0040
Simulating trial. . .
espilon = 0.0102; alpha = 0.0040
Simulating trial. . .
espilon = 0.0102; alpha = 0.0040
Simulating trial. . .
espilon = 0.0102; alpha = 0.0040
Simulating trial. . .
espilon = 0.0102; alpha = 0.0040
Simulating trial. . .
espilon = 0.0102; alpha = 0.0040
Simulating trial. . .
espilon = 0.0102; alpha = 0.0040
Simulating trial. . .
espilon = 0.0102; alpha = 0.0040
Simulating trial. . .
espilon = 0.0102; alpha = 0.0040
Simulating trial. . .
espilon = 0.0102; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.82)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 2.58)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.32)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.15)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.38)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 2.86)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'left', None, 'forward')
Agent drove right instead of left. (rewarded 1.53)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'forward')
Agent followed the waypoint forward. (rewarded 2.82)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.16)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'right')
Agent drove right instead of forward. (rewarded 1.13)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', None)
Agent followed the waypoint left. (rewarded 0.82)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.31)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1149
\-------------------------
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'left')
Agent drove left instead of right. (rewarded 0.79)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.83)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.20)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', 'left')
Agent properly idled at a red light. (rewarded 0.96)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.56)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.45)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.73)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 2.55)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 2.65)
55% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1150
\-------------------------
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'left', 'left')
Agent properly idled at a red light. (rewarded 1.06)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.53)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.25)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.07)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 2.51)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.15)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', None)
Agent properly idled at a red light. (rewarded 1.00)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', None, 'left', 'left')
Agent properly idled at a red light. (rewarded 2.44)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.94)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.24)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 0.97)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, 'forward', 'left')
Agent followed the waypoint left. (rewarded 2.11)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'forward', None)
Agent followed the waypoint forward. (rewarded 2.11)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 0.86)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.48)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent attempted driving forward through a red light. (rewarded -10.61)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'right', 'left')
Agent followed the waypoint forward. (rewarded 1.48)
32% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1151
\-------------------------
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
Simulating trial. . .
espilon = 0.0101; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', 'right')
Agent followed the waypoint forward. (rewarded 1.92)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', 'left')
Agent properly idled at a red light. (rewarded 2.49)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.68)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'right', 'left')
Agent properly idled at a red light. (rewarded 2.73)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.02)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.54)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.87)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', 'left')
Agent followed the waypoint left. (rewarded 1.50)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.18)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 0.97)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.36)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.39)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1152
\-------------------------
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'forward')
Agent drove right instead of left. (rewarded 0.16)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'left', None)
Agent followed the waypoint right. (rewarded 1.30)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', 'left')
Agent followed the waypoint forward. (rewarded 1.05)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 2.74)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.33)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 0.97)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.56)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 2.34)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.19)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', 'right')
Agent followed the waypoint forward. (rewarded 2.12)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 2.82)
56% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Training trial 1153
\-------------------------
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
Simulating trial. . .
espilon = 0.0100; alpha = 0.0040
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'right', None)
Agent drove forward instead of right. (rewarded 1.26)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.28)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'red', 'left', None, 'left')
Agent followed the waypoint right. (rewarded 2.66)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 0.89)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'left', 'forward', 'right')
Agent properly idled at a red light. (rewarded 1.55)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.42)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.76)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'green', 'right', None, 'left')
Agent followed the waypoint left. (rewarded 1.39)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent drove right instead of left. (rewarded 0.13)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.53)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('right', 'green', 'left', 'forward', None)
Agent drove left instead of right. (rewarded 0.97)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.47)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 1.95)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', 'left')
Agent drove right instead of left. (rewarded 1.44)
30% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('right', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.66)
25% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 2.05)
20% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.26)
15% of time remaining to reach destination.
/-------------------
| Step 17 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'right', 'left')
Agent drove right instead of forward. (rewarded 1.02)
10% of time remaining to reach destination.
/-------------------
| Step 18 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', None, None)
Agent followed the waypoint left. (rewarded 0.79)
5% of time remaining to reach destination.
/-------------------
| Step 19 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 2.10)
0% of time remaining to reach destination.
Trial Aborted!
Agent did not reach the destination.
/-------------------------
| Testing trial 1
\-------------------------
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.33)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 2.46)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'left')
Agent properly idled at a red light. (rewarded 1.26)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.86)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.77)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.10)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.74)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 2.36)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.58)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.79)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.27)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.54)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', 'left')
Agent properly idled at a red light. (rewarded 0.74)
35% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.52)
30% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Testing trial 2
\-------------------------
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', None, None)
Agent followed the waypoint left. (rewarded 1.23)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'left', None)
Agent followed the waypoint forward. (rewarded 1.94)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', 'forward')
Agent properly idled at a red light. (rewarded 2.97)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, 'forward')
Agent drove right instead of forward. (rewarded 0.18)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.04)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, 'left', None)
Agent followed the waypoint left. (rewarded 1.49)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'forward', None)
Agent drove right instead of left. (rewarded 0.63)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'left')
Agent drove right instead of forward. (rewarded 0.74)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'green', None, 'right', None)
Agent followed the waypoint left. (rewarded 1.08)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', 'right', 'forward', 'right')
Agent drove forward instead of left. (rewarded 0.30)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.59)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, None)
Agent followed the waypoint left. (rewarded 1.73)
52% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Testing trial 3
\-------------------------
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 2.33)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.56)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.33)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', None)
Agent properly idled at a red light. (rewarded 1.86)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.95)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.83)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', None, 'right')
Agent followed the waypoint forward. (rewarded 2.47)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', 'left', 'forward', None)
Agent drove right instead of forward. (rewarded 0.00)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'forward')
Agent properly idled at a red light. (rewarded 1.85)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'forward')
Agent drove forward instead of left. (rewarded 0.98)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('left', 'green', 'forward', 'left', None)
Agent followed the waypoint left. (rewarded 1.59)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('right', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.97)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'red', 'right', 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.69)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'forward')
Agent followed the waypoint right. (rewarded 1.71)
53% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Testing trial 4
\-------------------------
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.29)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.76)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.30)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 1.27)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.50)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', 'forward')
Agent followed the waypoint forward. (rewarded 1.51)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.66)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.02)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.33)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.37)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.63)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'right')
Agent followed the waypoint forward. (rewarded 1.49)
52% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, None)
Agent followed the waypoint right. (rewarded 2.45)
48% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', None, None)
Agent properly idled at a red light. (rewarded 1.33)
44% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.59)
40% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 1.09)
36% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.33)
32% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Testing trial 5
\-------------------------
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.60)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', 'forward')
Agent drove right instead of left. (rewarded 1.57)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('right', 'green', 'right', None, 'forward')
Agent drove forward instead of right. (rewarded 1.69)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('right', 'green', None, 'left', None)
Agent followed the waypoint right. (rewarded 2.51)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', 'left', None)
Agent followed the waypoint forward. (rewarded 2.09)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.56)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('left', 'red', 'right', 'left', 'forward')
Agent drove right instead of left. (rewarded 0.13)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.45)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', None, None)
Agent followed the waypoint forward. (rewarded 1.04)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', None, None)
Agent properly idled at a red light. (rewarded 2.43)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.33)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 0.77)
40% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Testing trial 6
\-------------------------
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'red', None, 'forward', None)
Agent properly idled at a red light. (rewarded 2.06)
96% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.30)
92% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.96)
88% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.52)
84% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', 'forward', 'left')
Agent properly idled at a red light. (rewarded 1.84)
80% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.66)
76% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.36)
72% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('left', 'red', 'forward', None, 'left')
Agent properly idled at a red light. (rewarded 2.46)
68% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('left', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.54)
64% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.26)
60% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.66)
56% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 2.40)
52% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Testing trial 7
\-------------------------
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', None, 'right', None)
Agent followed the waypoint right. (rewarded 1.62)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.24)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.60)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.35)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.17)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.94)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 1.59)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 1.87)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Testing trial 8
\-------------------------
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'right', None)
Agent properly idled at a red light. (rewarded 1.71)
95% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'right', 'left', 'forward')
Agent properly idled at a red light. (rewarded 2.51)
90% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.72)
85% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.64)
80% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 0.94)
75% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, 'forward')
Agent properly idled at a red light. (rewarded 2.65)
70% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, 'forward')
Agent followed the waypoint forward. (rewarded 1.52)
65% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', 'right', None)
Agent properly idled at a red light. (rewarded 2.24)
60% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.62)
55% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.62)
50% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.71)
45% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', 'forward', None, None)
Agent followed the waypoint forward. (rewarded 0.98)
40% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('right', 'green', None, None, 'left')
Agent followed the waypoint right. (rewarded 1.16)
35% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Testing trial 9
\-------------------------
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('right', 'green', 'left', None, 'forward')
Agent followed the waypoint right. (rewarded 1.13)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('right', 'green', 'forward', None, 'forward')
Agent followed the waypoint right. (rewarded 1.31)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'green', 'right', 'forward', None)
Agent followed the waypoint forward. (rewarded 2.43)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.87)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.47)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 2.32)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'forward', 'left')
Agent followed the waypoint forward. (rewarded 2.93)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.12)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'right')
Agent properly idled at a red light. (rewarded 2.82)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 2.83)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'green', None, 'left', None)
Agent followed the waypoint forward. (rewarded 1.83)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('left', 'green', None, None, 'left')
Agent followed the waypoint left. (rewarded 1.50)
60% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
/-------------------------
| Testing trial 10
\-------------------------
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
Simulating trial. . .
espilon = 0.0000; alpha = 0.0000
/-------------------
| Step 0 Results
\-------------------
Agent previous state: ('left', 'green', 'left', 'forward', None)
Agent followed the waypoint left. (rewarded 2.67)
97% of time remaining to reach destination.
/-------------------
| Step 1 Results
\-------------------
Agent previous state: ('forward', 'red', 'left', None, None)
Agent properly idled at a red light. (rewarded 1.91)
93% of time remaining to reach destination.
/-------------------
| Step 2 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.66)
90% of time remaining to reach destination.
/-------------------
| Step 3 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.73)
87% of time remaining to reach destination.
/-------------------
| Step 4 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.27)
83% of time remaining to reach destination.
/-------------------
| Step 5 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.45)
80% of time remaining to reach destination.
/-------------------
| Step 6 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'left')
Agent properly idled at a red light. (rewarded 1.11)
77% of time remaining to reach destination.
/-------------------
| Step 7 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'left')
Agent followed the waypoint forward. (rewarded 2.54)
73% of time remaining to reach destination.
/-------------------
| Step 8 Results
\-------------------
Agent previous state: ('right', 'green', None, 'forward', 'left')
Agent idled at a green light with oncoming traffic. (rewarded 1.60)
70% of time remaining to reach destination.
/-------------------
| Step 9 Results
\-------------------
Agent previous state: ('right', 'red', 'left', 'forward', None)
Agent followed the waypoint right. (rewarded 1.59)
67% of time remaining to reach destination.
/-------------------
| Step 10 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, None)
Agent properly idled at a red light. (rewarded 1.23)
63% of time remaining to reach destination.
/-------------------
| Step 11 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, None)
Agent followed the waypoint forward. (rewarded 2.07)
60% of time remaining to reach destination.
/-------------------
| Step 12 Results
\-------------------
Agent previous state: ('forward', 'red', 'forward', 'forward', None)
Agent properly idled at a red light. (rewarded 1.81)
57% of time remaining to reach destination.
/-------------------
| Step 13 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 2.21)
53% of time remaining to reach destination.
/-------------------
| Step 14 Results
\-------------------
Agent previous state: ('forward', 'red', None, 'left', 'forward')
Agent properly idled at a red light. (rewarded 1.44)
50% of time remaining to reach destination.
/-------------------
| Step 15 Results
\-------------------
Agent previous state: ('forward', 'red', None, None, 'forward')
Agent properly idled at a red light. (rewarded 1.65)
47% of time remaining to reach destination.
/-------------------
| Step 16 Results
\-------------------
Agent previous state: ('forward', 'green', None, None, 'forward')
Agent followed the waypoint forward. (rewarded 2.21)
43% of time remaining to reach destination.
Trial Completed!
Agent reached the destination.
Simulation ended. . .
<matplotlib.figure.Figure at 0x7f5121504b10>
# Carregue o arquivo 'sim_improved-learning' da simulação do algoritmo Q-Learning melhorado
vs.plot_trials('sim_improved-learning.csv')
Usando a visualização acima que foi produzida a partir da sua simulação com o algoritmo Q-Learning melhorado, forneça uma análise final e faça observações sobre o agente condutor melhorado como na Pergunta 6. Perguntas que você deve responder:
Resposta:
1 - Foi utilizada uma função de epsilon = e^(-at), com epsilon a partir de 1.00.
2 - Foram necessárias 1153 iterações de treinamento para que o agente condutor chegasse a fase de testes.
3 - Foi utilizada uma tolerância epsilon de 0.01, e um alfa de 0.004. Foi utilizado um alfa baixo o bastante para que fosse possível realizar um número de iterações de treinamento suficientes para aumentar a classificação de segurança.
4 - Ao comparar com o agente condutor da seção anterior verificamos os seguintes resultados:
5 - Os resultados do algoritmo Q-learning mostram que o agente condutor treinado aprendeu uma política apropriada.
6 - Sim.
Às vezes, a resposta para a importante pergunta "o que é que eu estou tentando fazer meu agente aprender?" tem apenas uma resposta teórica e não pode ser expressa de forma concreta. Aqui, porém, você concretamente definir o que é que o agente está tentando aprender, e isso são as leis de trânsito (dos EUA). Já que essas leis são informações conhecidas, você pode definir, para cada estado assumido pelo Smartcab, a ação ótima para o agente condutor baseado nessas leis. Nesse caso, chamamos o conjunto de pares estado-ação ótimos de política ótima. Portanto, ao contrário de algumas respostas teóricas, podemos avaliar de forma clara se o agente está agindo "incorretamente" não apenas pela recompensa (punição) que recebe, mas também através da observação do seu comportamento. Se o agente atravessa um sinal vermelho, vemos tanto que ele recebe uma recompensa negativa, quanto sabemos que esse não é o comportamento correto. Isso pode ser explorado de forma vantajosa para verificar se a política que seu agente condutor aprendeu é a correta, ou se é uma política subótima.
Dê alguns exemplos (usando os estados que você definiu) do que seria uma política ótima para este problema. Em seguida, investigue o arquivo de texto 'sim_improved-learning.txt' para ver os resultados do seu algoritmo Q-Learning melhorado. Para cada estado que tiver sido registrado na simulação, a política (a ação com maior valor) aprendida está correta? Existem quaisquer estados em que a política é diferente do que se espera de uma política ótima? Dê um exemplo de um estado e todas recompensas estado-ação registradas, explicando por que esta seria a política correta.
Resposta:
Segue abaixo exemplos de políticas ótimas para este problema:
| Política | Ações | Estados |
|---|---|---|
| ótima | None | {'forward', 'red', None, None,None} |
| ótima | Right | {'forward', 'green', None, None,'forward'} |
No arquivo 'sim_improved-learning.txt' foram encontrados os seguintes casos compatíveis com as políticas ótimas exemplificadas anteriormente.
('forward', 'red', None, None, None)
-- forward : -3.22
-- right : 0.19
-- None : 1.80
-- left : -3.06
('forward', 'green', None, None, 'forward')
-- forward : 1.11
-- right : 0.08
-- None : -0.35
-- left : -1.54
Também foi encontrados o seguinte caso com a política subótima :
('forward', 'red', 'right', 'right', 'left')
-- forward : 0.00
-- right : 0.00
-- None : 0.05
-- left : 0.00
Uma vez que nos EUA o veículo pode tomar a direita em um farol vermelho e nunca tomar a esquerda, o peso deveria ser negativo para o atributo 'left'.
'gamma'¶Curiosamente, como parte do algoritmo Q-Learning algorithm, foi pedido que você não usasse o fator de desconto, 'gamma', na implementação. A inclusão de recompensas futuras no algoritmo é usada para ajudar na retropropagação de recompensas positivas de um estado futuro para um estado atual. Essencialmente, se ao agente condutor é dada a opção de executar uma série de ações e chegar em diferentes estados, incluir recompensas futuras vai enviesar o agente em direção a estados que poderiam fornecer ainda mais recompensas. Um exemplo disso seria o caso de um agente condutor se movimentando em direção a um objetivo: com todas ações e recompensas iguais, movimentar-se em direção a um objetivo teoricamente levaria a melhores recompensas se houvesse uma recompensa adicional por chegar ao objetivo. Entretanto, mesmo que neste projeto o agente condutor esteja tentando chegar a um destino em um prazo determinado, a inclusão de recompensas futuras não beneficiaria o agente. Na verdade, se o agente tivesse várias iterações para aprender, isso poderia até afetar os valores-Q de forma negativa!
Existem duas características do projeto que invalidam o uso de recompensas futuras no algoritmo Q-Learning. Uma característica tem a ver com o Smartcab em si, enquanto a outra tem a ver com o ambiente. Você consegue descobrir que características são essas e por que recompensas futuras não funcionarão para este projeto?
Resposta:
No caso do smartcab o principal objetivo é aprender a obedecer as regras de trânsito e chegar ao destino de forma segura e confiável. Isto pode ser aprendido com recompensas imediatas.
Os estados do ambiente são independentes, isto é, o smartcab ao escolher um determinado passo, este não afeta o ambiente dos próximos passos. A única característica de estado que pode ser dependente da ação anterior é o 'waypoint', entretanto adicionar recompensas futuras podem afetar negativamente os Q-values.
Observação: Quando você tiver terminado todas implementações de código e respondido com sucesso todas perguntas acima, você pode finalizar seu trabalho exportando o caderno iPython como um documento HTML. Você pode fazer isso usando o menu acima e navegando para File -> Download as -> HTML (.html). Inclua o documento finalizado junto com este caderno como seu envio.
To run docker container with pygame
xhost +
docker run -v $PWD:/tmp/working -v /tmp/.X11-unix:/tmp/.X11-unix -e DISPLAY=unix$DISPLAY -w=/tmp/working -p 8888:8888 --rm -it continuumio/anaconda jupyter notebook --no-browser --ip=* --notebook-dir=/tmp/working --allow-root
!pip install pygame
https://github.com/diyjac/smartcab/blob/master/smartcab.ipynb